This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
17
LangRef.rst
-
include/llvm/
-
llvm/
-
Bitcode/
-
LLVMBitCodes.h
-
IR/
-
Attributes.td
-
Transforms/IPO/
-
IPO/
4
Attributor.h
-
lib/
-
AsmParser/
-
LLLexer.cpp
-
LLParser.cpp
-
LLToken.h
-
Bitcode/
-
Reader/
1
BitcodeReader.cpp
-
Writer/
-
BitcodeWriter.cpp
-
IR/
-
Attributes.cpp
-
Verifier.cpp
-
Transforms/IPO/
-
IPO/
5/53
Attributor.cpp
-
test/Transforms/FunctionAttrs/
-
Transforms/
-
FunctionAttrs/
3/17
nosync.ll

Differential D62766

[Attributor] Deduce "nosync" function attribute.
ClosedPublic

Authored by sstefan1 on May 31 2019, 7:50 PM.

Download Raw Diff

Details

Reviewers

jdoerfert
jfb
nhaehnle
arsenm

Commits

rG0626367202ce: [Attributor] Deduce "nosync" function attribute.
rL365830: [Attributor] Deduce "nosync" function attribute.

Summary

Introduce and deduce "nosync" function attribute to indicate that a function does not synchronize with another thread in a way that other thread might free memory.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 33336
Build 33335: arc lint + arc unit

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Herald added a reviewer: jdoerfert. · View Herald TranscriptMay 31 2019, 7:50 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, jfb, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B32763: Diff 202526.May 31 2019, 7:50 PM

uenoku added a subscriber: uenoku.May 31 2019, 9:31 PM

jdoerfert added inline comments.Jun 1 2019, 1:00 AM

llvm/docs/LangRef.rst
1479	The part after "causing" is too specific. We want nosync to be generic.
llvm/lib/Transforms/IPO/Attributor.cpp
5	Leftover comments?
15	(only a leftover of an old attributor patch version)
90	You have to check against "nosync".
294	Copy & paste
301	Make it `static` or member function. Also, describe what it does in the comment.
314	Description missing.
334	What about calls? Maybe you need to look at all side-effect instructions?
358	This is not a fixpoint. UpdateImpl is called multiple times (potentially). Remove the fixpoint call.

small fix
comments and LangRef

Harbormaster completed remote builds in B32786: Diff 202599.Jun 2 2019, 6:28 AM

removing fixpoint call

Harbormaster completed remote builds in B32787: Diff 202600.Jun 2 2019, 6:31 AM

The diff seems to include changes in the attributor. May download the latest version of the Attributor patch, rebase this one, and remove everything that is not part of the nosync. Also, please include the changes to the test cases you have in this patch.

llvm/lib/Transforms/IPO/Attributor.cpp
290	Typo: "if"
339	You have to add fences here as well and look at calls explicitly (as they are not in the above opcode list). Alternatively you could do: Use the `InformationCache::getReadOrWriteInstsForFunction` method to get all potential read & write instructions. That will include calls you need to look at and everything above. You will need to look at calls first, if they are fine, you can check for volatile and atomic. The code to determine if a call is OK is already present down there.
347	Please use the type here and start variables with an upper case letter.

Checking calls first. Adding checks for fences. Now using InfoCache.

Harbormaster completed remote builds in B32799: Diff 202629.Jun 2 2019, 6:28 PM

sstefan1 added inline comments.Jun 2 2019, 6:32 PM

llvm/lib/Transforms/IPO/Attributor.cpp
546	Just to make sure, when using `InformationCache::getReadOrWriteInstsForFunction` I don't need this, right?

Some more comments but it looks much better already. The test changes are missing though.

llvm/lib/Transforms/IPO/Attributor.cpp
293	"if"
304	Put the documentation on the declaration.
322	Can you describe the logic here?
351	Only do the stuff below if `I` is actually a call, so if `ICS` is not `null`. If you run this, it should crash on you right now because you access ICS unconditionally.
354	I think `getReadOrWriteInstsForFunction` will never pick up a `readnone` call as it will neither read nor write memory.
366	Can we have a single call to `isVolatile`, maybe always call that one and `getOrdering` and decide on the result what to do. That would mean move `I->isAtomic()` into `getOrdering()` and ensure we catch all opcodes in the switch (so the default prints an error)..
546	Correct.

Tests almost done. I'll update in couple hours.

llvm/lib/Transforms/IPO/Attributor.cpp
322	I had to return one. So if `Success` isn't intresting it returns `Failure` ordering. Ohterwise it doesn't matter since `Success` already syncs. I didn't give this much tought, if you have any suggestions, I'll apply them.
354	So is it safe to remove it then, as it does not have side-effects?
366	If I do it this way, I think it would be better to change `AtomicOrdering getOrdering()` to `bool isSyncOrdering()` or whatever is appropriate for the name. It can than return true if ordering is not Unordered or Monotonic. That way everything can be checked in one if. and ensure we catch all opcodes in the switch (so the default prints an error).. I only miss GetElementPtr and alloca which are not of great interest here, if I'm not wrong. But I can add them as well.

addressed most of the comments.

Harbormaster completed remote builds in B32845: Diff 202831.Jun 3 2019, 5:43 PM

jdoerfert added inline comments.Jun 4 2019, 1:28 PM

llvm/lib/Transforms/IPO/Attributor.cpp
330–331	Shouldn't we here directly assume sync as it is atomic but we don't know what kind?
364	You need to check volatile and atomic for all instructions I guess and for calls nosync as well

Does everything else look ok?

llvm/lib/Transforms/IPO/Attributor.cpp
330–331	Yes, my bad. I'll return true.

small fixes

Harbormaster completed remote builds in B32905: Diff 203021.Jun 4 2019, 2:13 PM

More comments including various small style suggestions.

You also need to rewrite the commit message and the test case impact is missing.
For the commit message it is probably enough to drop the last part, thus:

Introduce and deduce the "nosync" function attribute which indicates that a function does not synchronize with another thread in any way.

Remind me, is there a language ref patch for nosync somewhere? If not, we need to add a description in the LangRef.doc as well.

llvm/lib/Transforms/IPO/Attributor.cpp
253	I forgot that before but I think it is better if we split it into a `struct AANoSync` in the header and a `struct AANoSyncFunction` in the cpp file. Some functions would go in the generic header struct, but not getState, getManifestPosition, updateImpl, and ID. This makes it easier to use the result in other attributes.
268–272	2 new lines.
288–289	Comment needs to be updated.
289–292	make it a static member function if possible. that way we can reuse it easier. Same for isVolatile.
315–316	Given that Success and Failure are only needed in this case you can declare them here. To do so you need to add brackets around the case: case Instruction::AtomicCmpXchg: { ... }
320–321	Why is it sufficient that one ordering is "weak enough"? Don't we have to test both? Either way, we need a comment to explain what is happening here.
321–327	No worries, all good. Please also add a comment to explain what this means and why we return true.
329	Indention. And maybe add a few more words here ;)
364–365	I think the`!` in front of ICS is a problem. Did you run this?

Addresing comments.

LangRef was here from the beginning, I just messed up the diffs. Now its here.

Harbormaster completed remote builds in B32938: Diff 203183.Jun 5 2019, 9:53 AM

Inline comments are now not in original order, so I'll reply here.

I think the`!` in front of ICS is a problem. Did you run this?

Yes.

Why is it sufficient that one ordering is "weak enough"? Don't we have to test both? Either way, we need a comment to explain what is happening here.

My thinking is that if either one of them 'weak enough', than "no-sync" is no longer possible since at any point it can be one of the orderings. If you disagree, I can change and require both.

Indention. And maybe add a few more words here ;)

I updated the function comment, hope thats enough.

In D62766#1531219, @sstefan1 wrote:

Inline comments are now not in original order, so I'll reply here.

I think the`!` in front of ICS is a problem. Did you run this?

Yes.

So remove it ;)

Why is it sufficient that one ordering is "weak enough"? Don't we have to test both? Either way, we need a comment to explain what is happening here.

My thinking is that if either one of them 'weak enough', than "no-sync" is no longer possible since at any point it can be one of the orderings. If you disagree, I can change and require both.

I mixed up the meaning of the return value. It looks fine once I read the comment.

Indention. And maybe add a few more words here ;)

I updated the function comment, hope thats enough.

Looks good.

I added more comments but I think this is almost done. Go through the code and tests yourself and make sure there is no spurious newlines or other changes you did not intend.

llvm/docs/LangRef.rst
1477	Maybe add something like: If the function does ever synchronize with another thread, the behavior is undefined.
llvm/include/llvm/Transforms/IPO/Attributor.h
647	You don't need to inherit from `BooleanState` here. That is an implementation detail we probably want to hide. Let `AANoSyncFunction` inherit from `BooleanState` but keep the functions `isAssumedNoSync` and `isKnownNoSync` here. They will not have an implementation and are overwritten in `AANoSyncFunction`.
667	Copy and paste, this is not a namespace ;)
llvm/test/Transforms/FunctionAttrs/nosync.ll
115	Remove the commented instruction here and in the next test. Also, fix the indention.
139	Isn't the "nosync" attribute missing for this function?
197	The function names don't match the IR names.

nosync small fixes.
fixing tests.

Harbormaster completed remote builds in B33037: Diff 203466.Jun 6 2019, 4:29 PM

jdoerfert added inline comments.Jun 7 2019, 7:52 AM

llvm/lib/Transforms/IPO/Attributor.cpp
285	The above functions should have a comment referring to the base class.
288	This has to go in the base class I think.
324	I'm still confused. The pessimistic return value is `true`, correct? If so, Why can we return `false` after we've seen only the success ordering? Don't we have to look at both success and failure ordering and only if both are "fine" we can return `false`?

sstefan1 marked an inline comment as done.Jun 7 2019, 8:05 AM

sstefan1 added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp

324

I agree. I messed this up. Before I update, does this look ok?

if (Success != AtomicOrdering::Unordered ||
        Success != AtomicOrdering::Monotonic)
      return true;

if (Failure != AtomicOrdering::Unordered ||
        Failure != AtomicOrdering::Monotonic)
      return true;

return false;

fixed isNonRelaxedAtomic

Harbormaster completed remote builds in B33093: Diff 203655.Jun 7 2019, 6:05 PM

LGTM

This revision is now accepted and ready to land.Jun 8 2019, 9:41 AM

arsenm added a subscriber: arsenm.Jun 12 2019, 4:56 PM

arsenm added inline comments.

llvm/docs/LangRef.rst
1476–1478	I think this is a bit vague. In particular I don't think the LangRef defines what a "thread" means anywhere. I also think this needs to be more clear on what kinds of synchronization is allowed. Is this only communication through some addressable memory? What about GPU cross lane communication operations? I'm wondering if this is sufficient to solve this problem: http://lists.llvm.org/pipermail/llvm-dev/2013-November/067359.html TLDR, memory instructions can currently be hoisted over an arbitrary call if they are accessing a noalias argument

arsenm added inline comments.Jun 12 2019, 4:57 PM

llvm/docs/LangRef.rst
1476–1478	This is also mentioned as a proper attribute here (which I would greatly prefer to adding another string attribute), but only handled as a string attribute

jdoerfert requested changes to this revision.Jun 12 2019, 5:15 PM

jdoerfert added inline comments.

llvm/docs/LangRef.rst
1476–1478	That is a good point. I was initially thinking string attributes are fine but D62784 seems to be stuck which makes the testing of them hard. Long story short, lets make them enum attributes. @sstefan1 could you please make this a proper enum attribute? This will require some additional "mechanics" in: `llvm/lib/AsmParser/LLParser.cpp` `llvm/lib/Bitcode/Reader/BitcodeReader.cpp` `llvm/lib/Bitcode/Writer/BitcodeWriter.cpp` `llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp` `llvm/lib/IR/Attributes.cpp` `llvm/lib/IR/Verifier.cpp` Could be more though. Look for an existing attribute, e.g. Cold, and how that is handled. @uenoku Could you please also make `nofree` an enum attribute?

This revision now requires changes to proceed.Jun 12 2019, 5:15 PM

jdoerfert mentioned this in D62313: Add a test for "nofree" function attribute.Jun 12 2019, 5:16 PM

jdoerfert added a child revision: D63243: [WIP] Adjust the users of dereferenceable wrt. dereferenceable_globally.Jun 12 2019, 11:05 PM

jdoerfert added inline comments.Jun 12 2019, 11:17 PM

llvm/docs/LangRef.rst
1476–1478	I think this is a bit vague. In particular I don't think the LangRef defines what a "thread" means anywhere. I did think/hope we do not have to. There is the implicit execution thread and `nosync` says there is "nothing else" while the function is executed. Basically, there are no side-effects that did not originate from the code we see. Please object if you think this is not sufficient. I also think this needs to be more clear on what kinds of synchronization is allowed. None, if `nosync` is present. Is this only communication through some addressable memory? What about GPU cross lane communication operations? I'd say, not allowed if `nosync` is present. TLDR, memory instructions can currently be hoisted over an arbitrary call if they are accessing a noalias argument I tried to expose that lately [1] but failed, do you have an example? [1] https://bugs.llvm.org/show_bug.cgi?id=41781

sstefan1 updated this revision to Diff 204510.Jun 13 2019, 6:20 AM

Making nosync an enum attribute.

Herald added subscribers: dexonsmith, steven_wu, mehdi_amini. · View Herald TranscriptJun 13 2019, 6:20 AM

Harbormaster completed remote builds in B33336: Diff 204510.Jun 13 2019, 6:21 AM

fixing diff

Harbormaster completed remote builds in B33338: Diff 204513.Jun 13 2019, 6:30 AM

Sorry for my delayed review, I want to move this ahead now and commit it.

Can you make sure this works on top-of-trunk (origin/master), hence, rebase it please. Also make sure make check-all works without problems.

llvm/lib/Transforms/IPO/Attributor.cpp
93	This should now be checked in the switch below, it is an enum attribute now.
281	Delete this please.
llvm/test/Transforms/FunctionAttrs/nosync.ll
2	You need to enable the attributor explicitly, for now.

Just realized that the basic attribute test in test/Bitcode/attributes.ll is missing, see https://reviews.llvm.org/D49165#change-K8gwLFRSXwEe for an example.

Please add tests for the things I mention in comments, as well as:

relaxed volatile atomic load / store
inline assembly

llvm/include/llvm/Transforms/IPO/Attributor.h
381	"Underlying"
llvm/lib/Transforms/IPO/Attributor.cpp
314	A fence with `singlethread` sync scope doesn't sync with other threads, even if `seq_cst`.
331	We probably want `llvm_unreachable` here, so the code gets updated if we add new atomic operations.
341	You're missing `memcpy` and similar intrinsics, I think you want to handle them here and not in the generic intrinsic handling.
llvm/test/Transforms/FunctionAttrs/nosync.ll
5	"designet"?

addressing comments.

Harbormaster completed remote builds in B33830: Diff 206312.Jun 24 2019, 3:30 PM

sstefan1 added a reviewer: jfb.Jun 24 2019, 3:31 PM

arsenm added inline comments.Jun 24 2019, 3:42 PM

llvm/docs/LangRef.rst
1476–1478	Is it then disallowed to merge any calls that aren't nosync? e.g. if (foo) bar(x) readnone else bar(y) readnone is no longer legal to combine these as bar(foo ? x : y) readnone

small fix
added include
Fix cast

Harbormaster completed remote builds in B33838: Diff 206325.Jun 24 2019, 4:36 PM

In D62766#1555926, @jfb wrote:

Please add tests for the things I mention in comments, as well as:

relaxed volatile atomic load / store

inline assembly

Only added one test for now. I will add more tomorrow.

In D62766#1555578, @jdoerfert wrote:

Sorry for my delayed review, I want to move this ahead now and commit it.

Can you make sure this works on top-of-trunk (origin/master), hence, rebase it please. Also make sure make check-all works without problems.

@jdoerfert As for the intrinsics. I added checks for memcpy, memmove & memset as @jfb suggested. What do you think? I also kept the FIXME comment which can indicate that we might take a different approach. I did make check-all there were few problems with some FunctionAttr tests not checking for nosync attributes. I will fix that tomorrow. Also I seem to have a problem with a test/Bitcode/attributes.ll with nobuiltin attribute.

updated tests

Harbormaster completed remote builds in B33975: Diff 206764.Jun 26 2019, 4:44 PM

This does seem useful, although the description is overly narrow (what does nosync on its own have to do with freeing memory?).

I also think that the definition of nosync needs some work, as just "synchronization" is a rather vague term. Can you define it in terms of fences and atomic instructions instead, e.g. by saying that a nosync function does not perform such operations (or some subset of such operations)?

llvm/docs/LangRef.rst
1476–1478	No, I think that would still be allowed. The sync (aka not-nosync) functions have a potential side effect in terms of the memory model, but it's the same side effect in either case since the memory model at this point doesn't care about subgroups. I guess you're thinking of subgroup operations, but the issue with those is that the set of threads with which communication occurs is a function of where the operation occurs in control flow. It makes sense to keep that issue separate from this attribute.
llvm/test/Transforms/FunctionAttrs/nosync.ll
293–295	The negative check line here is missing.

This revision now requires changes to proceed.Jun 27 2019, 1:52 AM

In D62766#1560305, @nhaehnle wrote:

This does seem useful, although the description is overly narrow (what does nosync on its own have to do with freeing memory?).

The idea was to use this with nofree for dereferencable, like @hfinkel proposed in this email.

I also think that the definition of nosync needs some work, as just "synchronization" is a rather vague term. Can you define it in terms of fences and atomic instructions instead, e.g. by saying that a nosync function does not perform such operations (or some subset of such operations)?

I will update the definition with more details. Maybe I should put that in patch description instead of the current (narrow) one?

jdoerfert added inline comments.Jun 27 2019, 12:44 PM

llvm/docs/LangRef.rst
1476–1478	`readnone` implies `nosync` in my opinion. if we forget about `readnone` in the example, I think the above merge is still legal.

arsenm added inline comments.Jun 27 2019, 1:01 PM

llvm/docs/LangRef.rst
1476–1478	This should mention that synchronization means through some kind of memory side-effect. This needs to be distinguished from a cross-lane operations, which could be interpreted as a kind of of synchronization where treating it as a memory dependence is not sufficient

changed nosync LangRef definition

@arsenm I used most of your comment/suggestion.

Harbormaster completed remote builds in B34022: Diff 206925.Jun 27 2019, 1:46 PM

You should also add a test function with inline assembly.

llvm/lib/Transforms/IPO/Attributor.cpp
319	"then"
382	"improve"
llvm/test/Transforms/FunctionAttrs/nosync.ll
317	I don't think you can generally treat intrinsics as `nosync`. Unless you know they're actually `nosync` you should assume that intrinsics might synchronize. For example: int a; void i_totally_sync() { __builtin_ia32_clflush(&a); } Corresponds to: tail call void @llvm.x86.sse2.clflush(i8* bitcast (i32* @a to i8)) You should have a test for this, and it should definitely not* be `nosync`. The other option here is to go and add a field to all intrinsics, so when creating a new one we have to figure out whether it'll definitely sync, maybe sync, or never sync. I don't think that's in scope for this patch.

nhaehnle added inline comments.Jul 1 2019, 3:10 AM

llvm/docs/LangRef.rst
1475–1482	Thanks, I think this is better, but there are still some problems: There are no relaxed atomics in LLVM, only unordered, monotonic, and stronger orderings. What about fences? I would put the part about cross-lane operations at the end and rephrase it slightly for clarity. Suggestion: This attribute is only concerned with synchronization through memory operations and is therefore orthogonal to cross-lane and convergent operations. In particular, an operation such as a barrier can be `convergent` but also `nosync`. Assuming we can agree about the actual statement of that last sentence...

jdoerfert added inline comments.Jul 1 2019, 1:31 PM

llvm/docs/LangRef.rst
1475–1482	> This attribute is only concerned with synchronization through memory operations and is therefore orthogonal to cross-lane and convergent operations. In particular, an operation such as a barrier can be convergent but also nosync. Assuming we can agree about the actual statement of that last sentence... This proposed change, and the one requested earlier and integrated (sync goes through memory), are problematic. I first though they are fine but they will probably make the attribute unusable. An alternative proposal would be: This function attribute indicates that the function does not communicate (synchronize) with another thread through memory or other well-defined means. Synchronization is considered possible in the presence of `atomic` accesses that enforce an order, thus not "unordered" and "monotonic", `volatile` accesses, as well as `convergent` function calls. Note that through the latter non-memory communication, e.g., cross-lane operations, is also considered synchronization. If an annotated function does ever synchronize with another thread, the behavior is undefined. If this is where we are heading, we need to make sure we test: `non-convergent` does not allow `nosync`, e.g., `readnone` does not imply `nosync` `readnone` and `non-convergent` does imply `nosync` @arsenm, @nhaehnle. @jfb, what do you think?
llvm/test/Transforms/FunctionAttrs/nosync.ll
317	I don't think you can generally treat intrinsics as nosync. Unless you know they're actually nosync you should assume that intrinsics might synchronize. Good point. Maybe the best way (for now and in general) is to "not look for" intrinsics. Use the same logic for all instructions. That is, if it is a call and not annotated as no-sync it may-sync. The test with `llvm.cos` can be adjusted by adding `readnone` to the decleration of`llvm.cos`. The other option here is to go and add a field to all intrinsics, so when creating a new one we have to figure out whether it'll definitely sync, maybe sync, or never sync. I don't think that's in scope for this patch. Agreed, we will have to do that for various attributes at some point (soon) but not in this patch.

jfb added inline comments.Jul 1 2019, 1:36 PM

llvm/test/Transforms/FunctionAttrs/nosync.ll
317	I'd still accept the volatile `mem*` intrinsics as is already done, but otherwise yeah intrinsics should be assumed to synchronize.

arsenm added inline comments.Jul 1 2019, 2:05 PM

llvm/docs/LangRef.rst
1475–1482	What makes it unusable exactly? This wording confuses me: Note that through the latter non-memory communication, e.g., cross-lane operations, is also considered synchronization. I'm not 100% comfortable specifically referring to convergent, since I'm still worried about the yet-to-be-defined anticonvergent attribute. Though it is hard to define something around an unsolved problem. This phrasing also implies to me that call site merging is not legal, which is what I thought you were trying to avoid. Conclusion 2 sounds OK to me. Conclusion 1 sounds like the opposite of what the goal is?

jdoerfert added inline comments.Jul 1 2019, 4:02 PM

llvm/docs/LangRef.rst
1475–1482	What makes it unusable exactly? `nosync` would then still allow non-memory synchronization which it shouldn't. I think the IRC conversation helped. What I want us to have is: `nosync` means no synchronization/communication between "threads". Any potential synchronization, e.g., through memory or registers, precludes `nosync`.
llvm/test/Transforms/FunctionAttrs/nosync.ll
317	Agreed.

nhaehnle added inline comments.Jul 2 2019, 12:18 AM

llvm/docs/LangRef.rst
1475–1482	I think the IRC conversation helped. Is that recorded somewhere? `nosync` would then still allow non-memory synchronization which it shouldn't. This is questionable. There are `convergent` operations that do not imply synchronization. For example, some of the `llvm.amdgcn.image.sample.` intrinsics are convergent, but they do not imply any kind of synchronization in the memory model. In Vulkan/SPIR-V parlance, the intrinsic may have an implied control* barrier, but it definitely has no memory barrier (the control barrier part isn't fully spec'd out in SPIR-V either at the moment). For the initial intended usage of this attribute: if there is a pointer that you know to be dereferencable before the image sample, then you still know it to be dereferencable afterwards. So it seems reasonable to want the intrinsic to be marked both `convergent` and `nosync`. That said, I'm okay with this part of it: If this is where we are heading, we need to make sure we test: non-convergent does not allow nosync, e.g., readnone does not imply nosync readnone and non-convergent does imply nosync ... so long as it's understood that those are "merely" the rules for the attributor.

jdoerfert added inline comments.Jul 2 2019, 3:25 PM

llvm/docs/LangRef.rst
1475–1482	This is questionable. There are convergent operations that do not imply synchronization. For example, some of the llvm.amdgcn.image.sample.* intrinsics are convergent, but they do not imply any kind of synchronization in the memory model. For me, `nosync` has to mean absence of any kind of synchronization, including control barriers. For the initial intended usage of this attribute: if there is a pointer that you know to be dereferencable before the image sample, then you still know it to be dereferencable afterwards. So it seems reasonable to want the intrinsic to be marked both convergent and nosync. I see why you want this but I don't think that is what it should mean. `nosync` should not allow control synchronization as it will inevitably cause problems down the road. So, let me rephrase my earlier comment: By default, we have to assume a `convergent` & `readnone` function might cause control synchronization between threads and is therefore not `nosync`. However, a function can be `convergent` and `nosync`. Finally, a function that is not-`convergent` and `readonly` is `nosync`.

hfinkel added inline comments.Jul 2 2019, 4:39 PM

llvm/docs/LangRef.rst
1475–1482	However, a function can be convergent and nosync. I think that this is important. We can mark convergent intrinsics that don't provide synchronizing semantics as nosync. In general, we need a nosync attribute to mean that, in the function marked as nosync, the current thread cannot complete communication with any other threads (e.g., it can't send a value to another thread). The interesting thing, to me, that has been highlighted in this discussion is: convergent functions, by default, can have things like inter-thread register shuffles, but are otherwise readnone, and so must be excluded from automated nosync deduction (because, without accessing memory at all, communicate values to other threads).

Add inline assembly test.

Harbormaster completed remote builds in B34321: Diff 207893.Jul 3 2019, 2:39 PM

sstefan1 marked 2 inline comments as done.Jul 3 2019, 2:48 PM

sstefan1 added inline comments.

llvm/test/Transforms/FunctionAttrs/nosync.ll
293–295	I skipped the negative check line, because for now only nosync is deduced and if function is not nosync there will be no Function Attrs at all. Once we have some more attributes, I'll add the negative check. Is that ok with you?
317	Replaced llvm.cos test with inline assembly. llvm.cos test was my mistake, since with the current implementation it would be considered sync.

jdoerfert added inline comments.Jul 3 2019, 3:02 PM

llvm/test/Transforms/FunctionAttrs/nosync.ll
293–295	Just give the function the `nounwind` attribute and then add the negative check line.
317	The test comment is off, and again add `nounwind` to allow for the check lines

fixed tests & improved definition of nosync in langRef

Harbormaster completed remote builds in B34323: Diff 207903.Jul 3 2019, 3:34 PM

non-convergent and readnone check.
Changed handling of intrinsics.
Added more tests.

@jdoerfert, @jfb, @arsenm, @nhaehnle does this look alright now?

Harbormaster completed remote builds in B34519: Diff 208469.Jul 8 2019, 11:36 AM

I added last minor comments from my side. Other than that I think this looks fine. We will have to wait for the others though.

(You will need to rebase and make sure ninja check-all passes because there are other new attributes.)

llvm/lib/Transforms/IPO/Attributor.cpp
360	I think the "unknown" case is handled the wrong way here. Shouldn't it be: if (Arg->getType()->isIntegerTy(1) && cast<ConstantInt>(Arg)->getValue() == 0) return true; return false; such that "unknown" values, e.g., `%cmp = icmp ...` used as the 4th argument will conservatively make it sync? (+ Test case for this)
415	I was puzzled by this check for a second, add a comment indicating that the above loop handles calls with read/write effects already. Mention that the fact there is a read/write effect caused us already to make sure it is `nosync` and there is consequently no need to check for `convergent`.
llvm/test/Transforms/FunctionAttrs/nosync.ll
311	Copy&paste
350	Shouldn't this be `nosync`? Is it?

sstefan1 marked 2 inline comments as done.Jul 9 2019, 1:44 AM

sstefan1 added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
360	4th argument, isvolatile, is `immarg`, so I guess this not necessary?
llvm/test/Transforms/FunctionAttrs/nosync.ll
350	Yes, this falls under copy & paste as well.

jdoerfert added inline comments.Jul 9 2019, 9:04 PM

llvm/lib/Bitcode/Reader/BitcodeReader.cpp
1195	I think, you can remove this change. All should be fine without.
llvm/lib/Transforms/IPO/Attributor.cpp
360	Agreed, not necessary. However, if you keep it this way, add the above reasoning to the comment, it confused me now and it can easily confuse the next person. My advice, just swap the order to make it easier for people now and in the future ;)

arsenm added inline comments.Jul 10 2019, 7:38 AM

llvm/lib/Transforms/IPO/Attributor.cpp
355–358	It would be clearer to cast to MemIntrinsic and check isVolatile
369–382	I'm pretty sure this is repeated in several passes, and incomplete. Target intrinsics can also be considered volatile, as there is a hook to get the memory properties for them

jdoerfert added inline comments.Jul 10 2019, 7:57 AM

llvm/lib/Transforms/IPO/Attributor.cpp
369–382	I guess we should not reach this function with calls. If that seams reasonable, we need an assert here and change the source below to skip these checks if a call is assumed/known nosync.

rebase
addressing comments
ninja check-all passed

Harbormaster completed remote builds in B34708: Diff 209013.Jul 10 2019, 10:38 AM

LGTM, assunming check-all passes.

LGTM with nits

llvm/include/llvm/Transforms/IPO/Attributor.h
665	Don't need virtual, only override
llvm/lib/Transforms/IPO/Attributor.cpp
399	No virtual necessary (and for the rest of the overrides)
llvm/test/Bitcode/attributes.ll
367 ↗	(On Diff #209013)	Brace placement

Herald added a subscriber: wdng. · View Herald TranscriptJul 11 2019, 7:11 AM

This revision was not accepted when it landed; it landed in state Needs Review.Jul 11 2019, 2:38 PM

Closed by commit rL365830: [Attributor] Deduce "nosync" function attribute. (authored by sstefan). · Explain Why

This revision was automatically updated to reflect the committed changes.

efriedma mentioned this in D115302: GlobalsModRef should treat functions w/o nosync conservatively..Dec 14 2021, 1:48 PM

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

3 lines

include/

llvm/

Bitcode/

LLVMBitCodes.h

3 lines

IR/

Attributes.td

2 lines

Transforms/

IPO/

Attributor.h

18 lines

lib/

AsmParser/

LLLexer.cpp

1 line

LLParser.cpp

1 line

LLToken.h

1 line

Bitcode/

Reader/

BitcodeReader.cpp

4 lines

Writer/

BitcodeWriter.cpp

2 lines

IR/

Attributes.cpp

2 lines

Verifier.cpp

1 line

Transforms/

IPO/

Attributor.cpp

43 lines

test/

Transforms/

FunctionAttrs/

nosync.ll

144 lines

Diff 204510

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 1,466 Lines • ▼ Show 20 Lines
	``noreturn``			``noreturn``
	This function attribute indicates that the function never returns			This function attribute indicates that the function never returns
	normally. This produces undefined behavior at runtime if the			normally. This produces undefined behavior at runtime if the
	function ever does dynamically return.			function ever does dynamically return.
	``norecurse``			``norecurse``
	This function attribute indicates that the function does not call itself			This function attribute indicates that the function does not call itself
	either directly or indirectly down any possible call path. This produces			either directly or indirectly down any possible call path. This produces
	undefined behavior at runtime if the function ever does recurse.			undefined behavior at runtime if the function ever does recurse.
	``nosync``			``nosync``
	This function attribute indicates that the function does not communicate			This function attribute indicates that the function does not communicate
	(synchronize) with another thread.			(synchronize) with another thread. If the function does ever synchronize
				jdoerfertUnsubmitted Not Done Reply Inline Actions Maybe add something like: If the function does ever synchronize with another thread, the behavior is undefined. jdoerfert: Maybe add something like: > If the function does ever synchronize with another thread, the…
				with another thread, the behavior is undefined.
				arsenmUnsubmitted Not Done Reply Inline Actions I think this is a bit vague. In particular I don't think the LangRef defines what a "thread" means anywhere. I also think this needs to be more clear on what kinds of synchronization is allowed. Is this only communication through some addressable memory? What about GPU cross lane communication operations? I'm wondering if this is sufficient to solve this problem: http://lists.llvm.org/pipermail/llvm-dev/2013-November/067359.html TLDR, memory instructions can currently be hoisted over an arbitrary call if they are accessing a noalias argument arsenm: I think this is a bit vague. In particular I don't think the LangRef defines what a "thread"…
				arsenmUnsubmitted Not Done Reply Inline Actions This is also mentioned as a proper attribute here (which I would greatly prefer to adding another string attribute), but only handled as a string attribute arsenm: This is also mentioned as a proper attribute here (which I would greatly prefer to adding…
				jdoerfertUnsubmitted Not Done Reply Inline Actions That is a good point. I was initially thinking string attributes are fine but D62784 seems to be stuck which makes the testing of them hard. Long story short, lets make them enum attributes. @sstefan1 could you please make this a proper enum attribute? This will require some additional "mechanics" in: `llvm/lib/AsmParser/LLParser.cpp` `llvm/lib/Bitcode/Reader/BitcodeReader.cpp` `llvm/lib/Bitcode/Writer/BitcodeWriter.cpp` `llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp` `llvm/lib/IR/Attributes.cpp` `llvm/lib/IR/Verifier.cpp` Could be more though. Look for an existing attribute, e.g. Cold, and how that is handled. @uenoku Could you please also make `nofree` an enum attribute? jdoerfert: That is a good point. I was initially thinking string attributes are fine but D62784 seems to…
				jdoerfertUnsubmitted Not Done Reply Inline Actions I think this is a bit vague. In particular I don't think the LangRef defines what a "thread" means anywhere. I did think/hope we do not have to. There is the implicit execution thread and `nosync` says there is "nothing else" while the function is executed. Basically, there are no side-effects that did not originate from the code we see. Please object if you think this is not sufficient. I also think this needs to be more clear on what kinds of synchronization is allowed. None, if `nosync` is present. Is this only communication through some addressable memory? What about GPU cross lane communication operations? I'd say, not allowed if `nosync` is present. TLDR, memory instructions can currently be hoisted over an arbitrary call if they are accessing a noalias argument I tried to expose that lately [1] but failed, do you have an example? [1] https://bugs.llvm.org/show_bug.cgi?id=41781 jdoerfert: > I think this is a bit vague. In particular I don't think the LangRef defines what a "thread"…
				arsenmUnsubmitted Not Done Reply Inline Actions Is it then disallowed to merge any calls that aren't nosync? e.g. if (foo) bar(x) readnone else bar(y) readnone is no longer legal to combine these as bar(foo ? x : y) readnone arsenm: Is it then disallowed to merge any calls that aren't nosync? e.g. ``` if (foo) bar(x)…
				nhaehnleUnsubmitted Not Done Reply Inline Actions No, I think that would still be allowed. The sync (aka not-nosync) functions have a potential side effect in terms of the memory model, but it's the same side effect in either case since the memory model at this point doesn't care about subgroups. I guess you're thinking of subgroup operations, but the issue with those is that the set of threads with which communication occurs is a function of where the operation occurs in control flow. It makes sense to keep that issue separate from this attribute. nhaehnle: No, I think that would still be allowed. The sync (aka not-nosync) functions have a potential…
				jdoerfertUnsubmitted Not Done Reply Inline Actions `readnone` implies `nosync` in my opinion. if we forget about `readnone` in the example, I think the above merge is still legal. jdoerfert: 1) `readnone` implies `nosync` in my opinion. 2) if we forget about `readnone` in the example…
				arsenmUnsubmitted Not Done Reply Inline Actions This should mention that synchronization means through some kind of memory side-effect. This needs to be distinguished from a cross-lane operations, which could be interpreted as a kind of of synchronization where treating it as a memory dependence is not sufficient arsenm: This should mention that synchronization means through some kind of memory side-effect. This…
	``nounwind``			``nounwind``
				jdoerfertUnsubmitted Not Done Reply Inline Actions The part after "causing" is too specific. We want nosync to be generic. jdoerfert: The part after "causing" is too specific. We want nosync to be generic.
	This function attribute indicates that the function never raises an			This function attribute indicates that the function never raises an
	exception. If the function does raise an exception, its runtime			exception. If the function does raise an exception, its runtime
	behavior is undefined. However, functions marked nounwind may still			behavior is undefined. However, functions marked nounwind may still
				nhaehnleUnsubmitted Not Done Reply Inline Actions Thanks, I think this is better, but there are still some problems: There are no relaxed atomics in LLVM, only unordered, monotonic, and stronger orderings. What about fences? I would put the part about cross-lane operations at the end and rephrase it slightly for clarity. Suggestion: This attribute is only concerned with synchronization through memory operations and is therefore orthogonal to cross-lane and convergent operations. In particular, an operation such as a barrier can be `convergent` but also `nosync`. Assuming we can agree about the actual statement of that last sentence... nhaehnle: Thanks, I think this is better, but there are still some problems: * There are no relaxed…
				jdoerfertUnsubmitted Not Done Reply Inline Actions > This attribute is only concerned with synchronization through memory operations and is therefore orthogonal to cross-lane and convergent operations. In particular, an operation such as a barrier can be convergent but also nosync. Assuming we can agree about the actual statement of that last sentence... This proposed change, and the one requested earlier and integrated (sync goes through memory), are problematic. I first though they are fine but they will probably make the attribute unusable. An alternative proposal would be: This function attribute indicates that the function does not communicate (synchronize) with another thread through memory or other well-defined means. Synchronization is considered possible in the presence of `atomic` accesses that enforce an order, thus not "unordered" and "monotonic", `volatile` accesses, as well as `convergent` function calls. Note that through the latter non-memory communication, e.g., cross-lane operations, is also considered synchronization. If an annotated function does ever synchronize with another thread, the behavior is undefined. If this is where we are heading, we need to make sure we test: `non-convergent` does not allow `nosync`, e.g., `readnone` does not imply `nosync` `readnone` and `non-convergent` does imply `nosync` @arsenm, @nhaehnle. @jfb, what do you think? jdoerfert: > > This attribute is only concerned with synchronization through memory operations and is…
				arsenmUnsubmitted Not Done Reply Inline Actions What makes it unusable exactly? This wording confuses me: Note that through the latter non-memory communication, e.g., cross-lane operations, is also considered synchronization. I'm not 100% comfortable specifically referring to convergent, since I'm still worried about the yet-to-be-defined anticonvergent attribute. Though it is hard to define something around an unsolved problem. This phrasing also implies to me that call site merging is not legal, which is what I thought you were trying to avoid. Conclusion 2 sounds OK to me. Conclusion 1 sounds like the opposite of what the goal is? arsenm: What makes it unusable exactly? This wording confuses me: > Note that through the latter non…
				jdoerfertUnsubmitted Not Done Reply Inline Actions What makes it unusable exactly? `nosync` would then still allow non-memory synchronization which it shouldn't. I think the IRC conversation helped. What I want us to have is: `nosync` means no synchronization/communication between "threads". Any potential synchronization, e.g., through memory or registers, precludes `nosync`. jdoerfert: > What makes it unusable exactly? `nosync` would then still allow non-memory synchronization…
				nhaehnleUnsubmitted Not Done Reply Inline Actions I think the IRC conversation helped. Is that recorded somewhere? `nosync` would then still allow non-memory synchronization which it shouldn't. This is questionable. There are `convergent` operations that do not imply synchronization. For example, some of the `llvm.amdgcn.image.sample.` intrinsics are convergent, but they do not imply any kind of synchronization in the memory model. In Vulkan/SPIR-V parlance, the intrinsic may have an implied control* barrier, but it definitely has no memory barrier (the control barrier part isn't fully spec'd out in SPIR-V either at the moment). For the initial intended usage of this attribute: if there is a pointer that you know to be dereferencable before the image sample, then you still know it to be dereferencable afterwards. So it seems reasonable to want the intrinsic to be marked both `convergent` and `nosync`. That said, I'm okay with this part of it: If this is where we are heading, we need to make sure we test: non-convergent does not allow nosync, e.g., readnone does not imply nosync readnone and non-convergent does imply nosync ... so long as it's understood that those are "merely" the rules for the attributor. nhaehnle: > I think the IRC conversation helped. Is that recorded somewhere? > `nosync` would then…
				jdoerfertUnsubmitted Not Done Reply Inline Actions This is questionable. There are convergent operations that do not imply synchronization. For example, some of the llvm.amdgcn.image.sample.* intrinsics are convergent, but they do not imply any kind of synchronization in the memory model. For me, `nosync` has to mean absence of any kind of synchronization, including control barriers. For the initial intended usage of this attribute: if there is a pointer that you know to be dereferencable before the image sample, then you still know it to be dereferencable afterwards. So it seems reasonable to want the intrinsic to be marked both convergent and nosync. I see why you want this but I don't think that is what it should mean. `nosync` should not allow control synchronization as it will inevitably cause problems down the road. So, let me rephrase my earlier comment: By default, we have to assume a `convergent` & `readnone` function might cause control synchronization between threads and is therefore not `nosync`. However, a function can be `convergent` and `nosync`. Finally, a function that is not-`convergent` and `readonly` is `nosync`. jdoerfert: > This is questionable. There are convergent operations that do not imply synchronization. For…
				hfinkelUnsubmitted Not Done Reply Inline Actions However, a function can be convergent and nosync. I think that this is important. We can mark convergent intrinsics that don't provide synchronizing semantics as nosync. In general, we need a nosync attribute to mean that, in the function marked as nosync, the current thread cannot complete communication with any other threads (e.g., it can't send a value to another thread). The interesting thing, to me, that has been highlighted in this discussion is: convergent functions, by default, can have things like inter-thread register shuffles, but are otherwise readnone, and so must be excluded from automated nosync deduction (because, without accessing memory at all, communicate values to other threads). hfinkel: > However, a function can be convergent and nosync. I think that this is important. We can…
	trap or generate asynchronous exceptions. Exception handling schemes			trap or generate asynchronous exceptions. Exception handling schemes
	that are recognized by LLVM to handle asynchronous exceptions, such			that are recognized by LLVM to handle asynchronous exceptions, such
	as SEH, will still provide their implementation defined semantics.			as SEH, will still provide their implementation defined semantics.
	``"null-pointer-is-valid"``			``"null-pointer-is-valid"``
	If ``"null-pointer-is-valid"`` is set to ``"true"``, then ``null`` address			If ``"null-pointer-is-valid"`` is set to ``"true"``, then ``null`` address
	in address-space 0 is considered to be a valid address for memory loads and			in address-space 0 is considered to be a valid address for memory loads and
	stores. Any analysis or optimization should not treat dereferencing a			stores. Any analysis or optimization should not treat dereferencing a
	pointer to ``null`` as undefined behavior in this function.			pointer to ``null`` as undefined behavior in this function.
	▲ Show 20 Lines • Show All 15,383 Lines • Show Last 20 Lines

llvm/include/llvm/Bitcode/LLVMBitCodes.h

Show First 20 Lines • Show All 600 Lines • ▼ Show 20 Lines	enum AttributeKindCodes {
ATTR_KIND_WRITEONLY = 52,		ATTR_KIND_WRITEONLY = 52,
ATTR_KIND_SPECULATABLE = 53,		ATTR_KIND_SPECULATABLE = 53,
ATTR_KIND_STRICT_FP = 54,		ATTR_KIND_STRICT_FP = 54,
ATTR_KIND_SANITIZE_HWADDRESS = 55,		ATTR_KIND_SANITIZE_HWADDRESS = 55,
ATTR_KIND_NOCF_CHECK = 56,		ATTR_KIND_NOCF_CHECK = 56,
ATTR_KIND_OPT_FOR_FUZZING = 57,		ATTR_KIND_OPT_FOR_FUZZING = 57,
ATTR_KIND_SHADOWCALLSTACK = 58,		ATTR_KIND_SHADOWCALLSTACK = 58,
ATTR_KIND_SPECULATIVE_LOAD_HARDENING = 59,		ATTR_KIND_SPECULATIVE_LOAD_HARDENING = 59,
ATTR_KIND_IMMARG = 60		ATTR_KIND_IMMARG = 60,
		ATTR_KIND_NOSYNC = 61
};		};

enum ComdatSelectionKindCodes {		enum ComdatSelectionKindCodes {
COMDAT_SELECTION_KIND_ANY = 1,		COMDAT_SELECTION_KIND_ANY = 1,
COMDAT_SELECTION_KIND_EXACT_MATCH = 2,		COMDAT_SELECTION_KIND_EXACT_MATCH = 2,
COMDAT_SELECTION_KIND_LARGEST = 3,		COMDAT_SELECTION_KIND_LARGEST = 3,
COMDAT_SELECTION_KIND_NO_DUPLICATES = 4,		COMDAT_SELECTION_KIND_NO_DUPLICATES = 4,
COMDAT_SELECTION_KIND_SAME_SIZE = 5,		COMDAT_SELECTION_KIND_SAME_SIZE = 5,
Show All 14 Lines

llvm/include/llvm/IR/Attributes.td

	Show First 20 Lines • Show All 101 Lines • ▼ Show 20 Lines

	/// Disable redzone.			/// Disable redzone.
	def NoRedZone : EnumAttr<"noredzone">;			def NoRedZone : EnumAttr<"noredzone">;

	/// Mark the function as not returning.			/// Mark the function as not returning.
	def NoReturn : EnumAttr<"noreturn">;			def NoReturn : EnumAttr<"noreturn">;

	/// Function does not synchronize.			/// Function does not synchronize.
	def NoSync : StrBoolAttr<"nosync">;			def NoSync : EnumAttr<"nosync">;

	/// Disable Indirect Branch Tracking.			/// Disable Indirect Branch Tracking.
	def NoCfCheck : EnumAttr<"nocf_check">;			def NoCfCheck : EnumAttr<"nocf_check">;

	/// Function doesn't unwind stack.			/// Function doesn't unwind stack.
	def NoUnwind : EnumAttr<"nounwind">;			def NoUnwind : EnumAttr<"nounwind">;

	/// Select optimizations for best fuzzing signal.			/// Select optimizations for best fuzzing signal.
	▲ Show 20 Lines • Show All 136 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/IPO/Attributor.h

	Show First 20 Lines • Show All 372 Lines • ▼ Show 20 Lines
	/// bits. Users can only add known bits and, except through adding known bits,			/// bits. Users can only add known bits and, except through adding known bits,
	/// they can only remove assumed bits. This should guarantee monotoniticy and			/// they can only remove assumed bits. This should guarantee monotoniticy and
	/// thereby the existence of a fixpoint (if used corretly). The fixpoint is			/// thereby the existence of a fixpoint (if used corretly). The fixpoint is
	/// reached when the assumed and known state/bits are equal. Users can			/// reached when the assumed and known state/bits are equal. Users can
	/// force/inidicate a fixpoint. If an optimistic one is indicated, the known			/// force/inidicate a fixpoint. If an optimistic one is indicated, the known
	/// state will catch up with the assumed one, for a pessimistic fixpoint it is			/// state will catch up with the assumed one, for a pessimistic fixpoint it is
	/// the other way around.			/// the other way around.
	struct IntegerState : public AbstractState {			struct IntegerState : public AbstractState {
	/// Undrlying integer type, we assume 32 bits to be enough.			/// Undrlying integer type, we assume 32 bits to be enough.
				jfbUnsubmitted Not Done Reply Inline Actions "Underlying" jfb: "Underlying"
	using base_t = uint32_t;			using base_t = uint32_t;

	/// Initialize the (best) state.			/// Initialize the (best) state.
	IntegerState(base_t BestState = ~0) : Assumed(BestState) {}			IntegerState(base_t BestState = ~0) : Assumed(BestState) {}

	/// Return the worst possible representable state.			/// Return the worst possible representable state.
	static constexpr base_t getWorstState() { return 0; }			static constexpr base_t getWorstState() { return 0; }

	▲ Show 20 Lines • Show All 249 Lines • ▼ Show 20 Lines
	};			};

	Pass *createAttributorLegacyPass();			Pass *createAttributorLegacyPass();

	/// ----------------------------------------------------------------------------			/// ----------------------------------------------------------------------------
	/// Abstract Attribute Classes			/// Abstract Attribute Classes
	/// ----------------------------------------------------------------------------			/// ----------------------------------------------------------------------------

	struct AANoSync : public AbstractAttribute, BooleanState {			struct AANoSync : public AbstractAttribute {
				jdoerfertUnsubmitted Not Done Reply Inline Actions You don't need to inherit from `BooleanState` here. That is an implementation detail we probably want to hide. Let `AANoSyncFunction` inherit from `BooleanState` but keep the functions `isAssumedNoSync` and `isKnownNoSync` here. They will not have an implementation and are overwritten in `AANoSyncFunction`. jdoerfert: You don't need to inherit from `BooleanState` here. That is an implementation detail we…
	/// An abstract interface for all nosync attributes.			/// An abstract interface for all nosync attributes.
	AANoSync(Value &V, InformationCache &InfoCache)			AANoSync(Value &V, InformationCache &InfoCache)
	: AbstractAttribute(V, InfoCache) {}			: AbstractAttribute(V, InfoCache) {}

	/// See AbstractAttribute::getAsStr().
	virtual const std::string getAsStr() const override {
	return getAssumed() ? "nosync" : "may-sync";
	}

	/// See AbstractAttribute::getAttrKind().			/// See AbstractAttribute::getAttrKind().
	virtual Attribute::AttrKind getAttrKind() const override {			virtual Attribute::AttrKind getAttrKind() const override {
	return Attribute::None;			return ID;
	}			}

				static constexpr Attribute::AttrKind ID =
				Attribute::AttrKind(Attribute::NoSync);

	/// Returns true if "nosync" is assumed.			/// Returns true if "nosync" is assumed.
	bool isAssumedNoSync() const { return getAssumed(); }			virtual bool isAssumedNoSync() const = 0;

	/// Returns true if "nosync" is known.			/// Returns true if "nosync" is known.
	bool isKnownNoSync() const { return getKnown(); }			virtual bool isKnownNoSync() const = 0;
	}; // namespace llvm			};
				arsenmUnsubmitted Not Done Reply Inline Actions Don't need virtual, only override arsenm: Don't need virtual, only override

	} // end namespace llvm			} // end namespace llvm
				jdoerfertUnsubmitted Not Done Reply Inline Actions Copy and paste, this is not a namespace ;) jdoerfert: Copy and paste, this is not a namespace ;)

	#endif // LLVM_TRANSFORMS_IPO_FUNCTIONATTRS_H			#endif // LLVM_TRANSFORMS_IPO_FUNCTIONATTRS_H

llvm/lib/AsmParser/LLLexer.cpp

Show First 20 Lines • Show All 650 Lines • ▼ Show 20 Lines	#define KEYWORD(STR) \
KEYWORD(noduplicate);		KEYWORD(noduplicate);
KEYWORD(noimplicitfloat);		KEYWORD(noimplicitfloat);
KEYWORD(noinline);		KEYWORD(noinline);
KEYWORD(norecurse);		KEYWORD(norecurse);
KEYWORD(nonlazybind);		KEYWORD(nonlazybind);
KEYWORD(nonnull);		KEYWORD(nonnull);
KEYWORD(noredzone);		KEYWORD(noredzone);
KEYWORD(noreturn);		KEYWORD(noreturn);
		KEYWORD(nosync);
KEYWORD(nocf_check);		KEYWORD(nocf_check);
KEYWORD(nounwind);		KEYWORD(nounwind);
KEYWORD(optforfuzzing);		KEYWORD(optforfuzzing);
KEYWORD(optnone);		KEYWORD(optnone);
KEYWORD(optsize);		KEYWORD(optsize);
KEYWORD(readnone);		KEYWORD(readnone);
KEYWORD(readonly);		KEYWORD(readonly);
KEYWORD(returned);		KEYWORD(returned);
▲ Show 20 Lines • Show All 468 Lines • Show Last 20 Lines

llvm/lib/AsmParser/LLParser.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,254 Lines • ▼ Show 20 Lines	while (true) {
case lltok::kw_nobuiltin: B.addAttribute(Attribute::NoBuiltin); break;		case lltok::kw_nobuiltin: B.addAttribute(Attribute::NoBuiltin); break;
case lltok::kw_noduplicate: B.addAttribute(Attribute::NoDuplicate); break;		case lltok::kw_noduplicate: B.addAttribute(Attribute::NoDuplicate); break;
case lltok::kw_noimplicitfloat:		case lltok::kw_noimplicitfloat:
B.addAttribute(Attribute::NoImplicitFloat); break;		B.addAttribute(Attribute::NoImplicitFloat); break;
case lltok::kw_noinline: B.addAttribute(Attribute::NoInline); break;		case lltok::kw_noinline: B.addAttribute(Attribute::NoInline); break;
case lltok::kw_nonlazybind: B.addAttribute(Attribute::NonLazyBind); break;		case lltok::kw_nonlazybind: B.addAttribute(Attribute::NonLazyBind); break;
case lltok::kw_noredzone: B.addAttribute(Attribute::NoRedZone); break;		case lltok::kw_noredzone: B.addAttribute(Attribute::NoRedZone); break;
case lltok::kw_noreturn: B.addAttribute(Attribute::NoReturn); break;		case lltok::kw_noreturn: B.addAttribute(Attribute::NoReturn); break;
		case lltok::kw_nosync: B.addAttribute(Attribute::NoSync); break;
case lltok::kw_nocf_check: B.addAttribute(Attribute::NoCfCheck); break;		case lltok::kw_nocf_check: B.addAttribute(Attribute::NoCfCheck); break;
case lltok::kw_norecurse: B.addAttribute(Attribute::NoRecurse); break;		case lltok::kw_norecurse: B.addAttribute(Attribute::NoRecurse); break;
case lltok::kw_nounwind: B.addAttribute(Attribute::NoUnwind); break;		case lltok::kw_nounwind: B.addAttribute(Attribute::NoUnwind); break;
case lltok::kw_optforfuzzing:		case lltok::kw_optforfuzzing:
B.addAttribute(Attribute::OptForFuzzing); break;		B.addAttribute(Attribute::OptForFuzzing); break;
case lltok::kw_optnone: B.addAttribute(Attribute::OptimizeNone); break;		case lltok::kw_optnone: B.addAttribute(Attribute::OptimizeNone); break;
case lltok::kw_optsize: B.addAttribute(Attribute::OptimizeForSize); break;		case lltok::kw_optsize: B.addAttribute(Attribute::OptimizeForSize); break;
case lltok::kw_readnone: B.addAttribute(Attribute::ReadNone); break;		case lltok::kw_readnone: B.addAttribute(Attribute::ReadNone); break;
▲ Show 20 Lines • Show All 7,308 Lines • Show Last 20 Lines

llvm/lib/AsmParser/LLToken.h

Show First 20 Lines • Show All 194 Lines • ▼ Show 20 Lines	enum Kind {
kw_noduplicate,		kw_noduplicate,
kw_noimplicitfloat,		kw_noimplicitfloat,
kw_noinline,		kw_noinline,
kw_norecurse,		kw_norecurse,
kw_nonlazybind,		kw_nonlazybind,
kw_nonnull,		kw_nonnull,
kw_noredzone,		kw_noredzone,
kw_noreturn,		kw_noreturn,
		kw_nosync,
kw_nocf_check,		kw_nocf_check,
kw_nounwind,		kw_nounwind,
kw_optforfuzzing,		kw_optforfuzzing,
kw_optnone,		kw_optnone,
kw_optsize,		kw_optsize,
kw_readnone,		kw_readnone,
kw_readonly,		kw_readonly,
kw_returned,		kw_returned,
▲ Show 20 Lines • Show All 251 Lines • Show Last 20 Lines

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 1,185 Lines • ▼ Show 20 Lines	static uint64_t getRawAttributeMask(Attribute::AttrKind Val) {
case Attribute::SanitizeHWAddress: return 1ULL << 56;		case Attribute::SanitizeHWAddress: return 1ULL << 56;
case Attribute::NoCfCheck: return 1ULL << 57;		case Attribute::NoCfCheck: return 1ULL << 57;
case Attribute::OptForFuzzing: return 1ULL << 58;		case Attribute::OptForFuzzing: return 1ULL << 58;
case Attribute::ShadowCallStack: return 1ULL << 59;		case Attribute::ShadowCallStack: return 1ULL << 59;
case Attribute::SpeculativeLoadHardening:		case Attribute::SpeculativeLoadHardening:
return 1ULL << 60;		return 1ULL << 60;
case Attribute::ImmArg:		case Attribute::ImmArg:
return 1ULL << 61;		return 1ULL << 61;
		case Attribute::NoSync:
		return 1ULL << 62;
		jdoerfertUnsubmitted Not Done Reply Inline Actions I think, you can remove this change. All should be fine without. jdoerfert: I think, you can remove this change. All should be fine without.
case Attribute::Dereferenceable:		case Attribute::Dereferenceable:
llvm_unreachable("dereferenceable attribute not supported in raw format");		llvm_unreachable("dereferenceable attribute not supported in raw format");
break;		break;
case Attribute::DereferenceableOrNull:		case Attribute::DereferenceableOrNull:
llvm_unreachable("dereferenceable_or_null attribute not supported in raw "		llvm_unreachable("dereferenceable_or_null attribute not supported in raw "
"format");		"format");
break;		break;
case Attribute::ArgMemOnly:		case Attribute::ArgMemOnly:
▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	static Attribute::AttrKind getAttrFromCode(uint64_t Code) {
case bitc::ATTR_KIND_DEREFERENCEABLE_OR_NULL:		case bitc::ATTR_KIND_DEREFERENCEABLE_OR_NULL:
return Attribute::DereferenceableOrNull;		return Attribute::DereferenceableOrNull;
case bitc::ATTR_KIND_ALLOC_SIZE:		case bitc::ATTR_KIND_ALLOC_SIZE:
return Attribute::AllocSize;		return Attribute::AllocSize;
case bitc::ATTR_KIND_NO_RED_ZONE:		case bitc::ATTR_KIND_NO_RED_ZONE:
return Attribute::NoRedZone;		return Attribute::NoRedZone;
case bitc::ATTR_KIND_NO_RETURN:		case bitc::ATTR_KIND_NO_RETURN:
return Attribute::NoReturn;		return Attribute::NoReturn;
		case bitc::ATTR_KIND_NOSYNC:
		return Attribute::NoSync;
case bitc::ATTR_KIND_NOCF_CHECK:		case bitc::ATTR_KIND_NOCF_CHECK:
return Attribute::NoCfCheck;		return Attribute::NoCfCheck;
case bitc::ATTR_KIND_NO_UNWIND:		case bitc::ATTR_KIND_NO_UNWIND:
return Attribute::NoUnwind;		return Attribute::NoUnwind;
case bitc::ATTR_KIND_OPT_FOR_FUZZING:		case bitc::ATTR_KIND_OPT_FOR_FUZZING:
return Attribute::OptForFuzzing;		return Attribute::OptForFuzzing;
case bitc::ATTR_KIND_OPTIMIZE_FOR_SIZE:		case bitc::ATTR_KIND_OPTIMIZE_FOR_SIZE:
return Attribute::OptimizeForSize;		return Attribute::OptimizeForSize;
▲ Show 20 Lines • Show All 4,821 Lines • Show Last 20 Lines

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 650 Lines • ▼ Show 20 Lines	static uint64_t getAttrKindEncoding(Attribute::AttrKind Kind) {
case Attribute::Dereferenceable:		case Attribute::Dereferenceable:
return bitc::ATTR_KIND_DEREFERENCEABLE;		return bitc::ATTR_KIND_DEREFERENCEABLE;
case Attribute::DereferenceableOrNull:		case Attribute::DereferenceableOrNull:
return bitc::ATTR_KIND_DEREFERENCEABLE_OR_NULL;		return bitc::ATTR_KIND_DEREFERENCEABLE_OR_NULL;
case Attribute::NoRedZone:		case Attribute::NoRedZone:
return bitc::ATTR_KIND_NO_RED_ZONE;		return bitc::ATTR_KIND_NO_RED_ZONE;
case Attribute::NoReturn:		case Attribute::NoReturn:
return bitc::ATTR_KIND_NO_RETURN;		return bitc::ATTR_KIND_NO_RETURN;
		case Attribute::NoSync:
		return bitc::ATTR_KIND_NOSYNC;
case Attribute::NoCfCheck:		case Attribute::NoCfCheck:
return bitc::ATTR_KIND_NOCF_CHECK;		return bitc::ATTR_KIND_NOCF_CHECK;
case Attribute::NoUnwind:		case Attribute::NoUnwind:
return bitc::ATTR_KIND_NO_UNWIND;		return bitc::ATTR_KIND_NO_UNWIND;
case Attribute::OptForFuzzing:		case Attribute::OptForFuzzing:
return bitc::ATTR_KIND_OPT_FOR_FUZZING;		return bitc::ATTR_KIND_OPT_FOR_FUZZING;
case Attribute::OptimizeForSize:		case Attribute::OptimizeForSize:
return bitc::ATTR_KIND_OPTIMIZE_FOR_SIZE;		return bitc::ATTR_KIND_OPTIMIZE_FOR_SIZE;
▲ Show 20 Lines • Show All 3,887 Lines • Show Last 20 Lines

llvm/lib/IR/Attributes.cpp

Show First 20 Lines • Show All 292 Lines • ▼ Show 20 Lines	std::string Attribute::getAsString(bool InAttrGrp) const {
if (hasAttribute(Attribute::NonLazyBind))		if (hasAttribute(Attribute::NonLazyBind))
return "nonlazybind";		return "nonlazybind";
if (hasAttribute(Attribute::NonNull))		if (hasAttribute(Attribute::NonNull))
return "nonnull";		return "nonnull";
if (hasAttribute(Attribute::NoRedZone))		if (hasAttribute(Attribute::NoRedZone))
return "noredzone";		return "noredzone";
if (hasAttribute(Attribute::NoReturn))		if (hasAttribute(Attribute::NoReturn))
return "noreturn";		return "noreturn";
		if(hasAttribute(Attribute::NoSync))
		return "nosync";
if (hasAttribute(Attribute::NoCfCheck))		if (hasAttribute(Attribute::NoCfCheck))
return "nocf_check";		return "nocf_check";
if (hasAttribute(Attribute::NoRecurse))		if (hasAttribute(Attribute::NoRecurse))
return "norecurse";		return "norecurse";
if (hasAttribute(Attribute::NoUnwind))		if (hasAttribute(Attribute::NoUnwind))
return "nounwind";		return "nounwind";
if (hasAttribute(Attribute::OptForFuzzing))		if (hasAttribute(Attribute::OptForFuzzing))
return "optforfuzzing";		return "optforfuzzing";
▲ Show 20 Lines • Show All 1,431 Lines • Show Last 20 Lines

llvm/lib/IR/Verifier.cpp

Show First 20 Lines • Show All 1,480 Lines • ▼ Show 20 Lines	void Verifier::visitModuleFlagCGProfileEntry(const MDOperand &MDO) {
Assert(Count && Count->getType()->isIntegerTy(),		Assert(Count && Count->getType()->isIntegerTy(),
"expected an integer constant", Node->getOperand(2));		"expected an integer constant", Node->getOperand(2));
}		}

/// Return true if this attribute kind only applies to functions.		/// Return true if this attribute kind only applies to functions.
static bool isFuncOnlyAttr(Attribute::AttrKind Kind) {		static bool isFuncOnlyAttr(Attribute::AttrKind Kind) {
switch (Kind) {		switch (Kind) {
case Attribute::NoReturn:		case Attribute::NoReturn:
		case Attribute::NoSync:
case Attribute::NoCfCheck:		case Attribute::NoCfCheck:
case Attribute::NoUnwind:		case Attribute::NoUnwind:
case Attribute::NoInline:		case Attribute::NoInline:
case Attribute::AlwaysInline:		case Attribute::AlwaysInline:
case Attribute::OptimizeForSize:		case Attribute::OptimizeForSize:
case Attribute::StackProtect:		case Attribute::StackProtect:
case Attribute::StackProtectReq:		case Attribute::StackProtectReq:
case Attribute::StackProtectStrong:		case Attribute::StackProtectStrong:
▲ Show 20 Lines • Show All 3,859 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/Attributor.cpp

//===- Attributor.cpp - Module-wide attribute deduction -------------------===//		//===- Attributor.cpp - Module-wide attribute deduction -------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
		jdoerfertUnsubmitted Not Done Reply Inline Actions Leftover comments? jdoerfert: Leftover comments?
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file implements an inter procedural pass that deduces and/or propagating		// This file implements an inter procedural pass that deduces and/or propagating
// attributes. This is done in an abstract interpretation style fixpoint		// attributes. This is done in an abstract interpretation style fixpoint
// iteration. See the Attributor.h file comment and the class descriptions in		// iteration. See the Attributor.h file comment and the class descriptions in
// that file for more information.		// that file for more information.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

		jdoerfertUnsubmitted Not Done Reply Inline Actions (only a leftover of an old attributor patch version) jdoerfert: (only a leftover of an old attributor patch version)
#include "llvm/Transforms/IPO/Attributor.h"		#include "llvm/Transforms/IPO/Attributor.h"

#include "llvm/ADT/SetVector.h"		#include "llvm/ADT/SetVector.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/Analysis/GlobalsModRef.h"		#include "llvm/Analysis/GlobalsModRef.h"
#include "llvm/IR/Argument.h"		#include "llvm/IR/Argument.h"
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
ChangeStatus llvm::operator\|(ChangeStatus l, ChangeStatus r) {		ChangeStatus llvm::operator\|(ChangeStatus l, ChangeStatus r) {
return l == ChangeStatus::CHANGED ? l : r;		return l == ChangeStatus::CHANGED ? l : r;
}		}
ChangeStatus llvm::operator&(ChangeStatus l, ChangeStatus r) {		ChangeStatus llvm::operator&(ChangeStatus l, ChangeStatus r) {
return l == ChangeStatus::UNCHANGED ? l : r;		return l == ChangeStatus::UNCHANGED ? l : r;
}		}
///}		///}



/// Helper to adjust the statistics.		/// Helper to adjust the statistics.
static void bookkeeping(AbstractAttribute::ManifestPosition MP,		static void bookkeeping(AbstractAttribute::ManifestPosition MP,
const Attribute &Attr) {		const Attribute &Attr) {
if (!AreStatisticsEnabled())		if (!AreStatisticsEnabled())
return;		return;

if (Attr.isStringAttribute()) {		if (Attr.isStringAttribute()) {
StringRef StringAttr = Attr.getKindAsString();		StringRef StringAttr = Attr.getKindAsString();
if (StringAttr == "nosync")		if (StringAttr == "nosync")
		jdoerfertUnsubmitted Not Done Reply Inline Actions You have to check against "nosync". jdoerfert: You have to check against "nosync".
NumFnNoSync++;		NumFnNoSync++;
return;		return;
}		}
		jdoerfertUnsubmitted Not Done Reply Inline Actions This should now be checked in the switch below, it is an enum attribute now. jdoerfert: This should now be checked in the switch below, it is an enum attribute now.

if (!Attr.isEnumAttribute())		if (!Attr.isEnumAttribute())
return;		return;
switch (Attr.getKindAsEnum()) {		switch (Attr.getKindAsEnum()) {
default:		default:
return;		return;
}		}
}		}
▲ Show 20 Lines • Show All 143 Lines • ▼ Show 20 Lines
}		}

const Function &AbstractAttribute::getAnchorScope() const {		const Function &AbstractAttribute::getAnchorScope() const {
return const_cast<AbstractAttribute *>(this)->getAnchorScope();		return const_cast<AbstractAttribute *>(this)->getAnchorScope();
}		}

/// ------------------------ NoSync Function Attribute -------------------------		/// ------------------------ NoSync Function Attribute -------------------------

struct AANoSyncFunction : AANoSync {		struct AANoSyncFunction : AANoSync, BooleanState {
		jdoerfertUnsubmitted Not Done Reply Inline Actions I forgot that before but I think it is better if we split it into a `struct AANoSync` in the header and a `struct AANoSyncFunction` in the cpp file. Some functions would go in the generic header struct, but not getState, getManifestPosition, updateImpl, and ID. This makes it easier to use the result in other attributes. jdoerfert: I forgot that before but I think it is better if we split it into a `struct AANoSync` in the…

AANoSyncFunction(Function &F, InformationCache &InfoCache)		AANoSyncFunction(Function &F, InformationCache &InfoCache)
: AANoSync(F, InfoCache) {}		: AANoSync(F, InfoCache) {}

/// See AbstractAttribute::getState()		/// See AbstractAttribute::getState()
/// {		/// {
AbstractState &getState() override { return *this; }		AbstractState &getState() override { return *this; }
const AbstractState &getState() const override { return *this; }		const AbstractState &getState() const override { return *this; }
/// }		/// }

/// See AbstractAttribute::getManifestPosition().		/// See AbstractAttribute::getManifestPosition().
virtual ManifestPosition getManifestPosition() const override {		virtual ManifestPosition getManifestPosition() const override {
return MP_FUNCTION;		return MP_FUNCTION;
}		}

		virtual const std::string getAsStr() const override {
		return getAssumed() ? "nosync" : "may-sync";
		}

		jdoerfertUnsubmitted Not Done Reply Inline Actions 2 new lines. jdoerfert: 2 new lines.
/// See AbstractAttribute::updateImpl(...).		/// See AbstractAttribute::updateImpl(...).
virtual ChangeStatus updateImpl(Attributor &A) override;		virtual ChangeStatus updateImpl(Attributor &A) override;

/// Return deduced attributes in \p Attrs.		/// Return deduced attributes in \p Attrs.
virtual void		// virtual void
getDeducedAttributes(SmallVectorImpl<Attribute> &Attrs) const override {		// getDeducedAttributes(SmallVectorImpl<Attribute> &Attrs) const override {
LLVMContext &Ctx = AnchoredVal.getContext();		// LLVMContext &Ctx = AnchoredVal.getContext();
Attrs.emplace_back(Attribute::get(Ctx, "nosync"));		// Attrs.emplace_back(Attribute::get(Ctx, "nosync"));
}		// }
		jdoerfertUnsubmitted Not Done Reply Inline Actions Delete this please. jdoerfert: Delete this please.

static constexpr Attribute::AttrKind ID =		/// See AANoSync::isAssumedNoSync()
Attribute::AttrKind(Attribute::None + 1);		virtual bool isAssumedNoSync() const override { return getAssumed(); }

		jdoerfertUnsubmitted Not Done Reply Inline Actions The above functions should have a comment referring to the base class. jdoerfert: The above functions should have a comment referring to the base class.
		/// See AANoSync::isKnownNoSync()
		virtual bool isKnownNoSync() const override { return getKnown(); }

		jdoerfertUnsubmitted Not Done Reply Inline Actions This has to go in the base class I think. jdoerfert: This has to go in the base class I think.
/// Helper function used to determine whether an instruction is non-relaxed		/// Helper function used to determine whether an instruction is non-relaxed
		jdoerfertUnsubmitted Not Done Reply Inline Actions Comment needs to be updated. jdoerfert: Comment needs to be updated.
/// atomic. In other words, if an atomic instruction does not have unordered		/// atomic. In other words, if an atomic instruction does not have unordered
		jdoerfertUnsubmitted Not Done Reply Inline Actions Typo: "if" jdoerfert: Typo: "if"
/// or monotonic ordering		/// or monotonic ordering
static bool isNonRelaxedAtomic(Instruction *I);		static bool isNonRelaxedAtomic(Instruction *I);
		jdoerfertUnsubmitted Not Done Reply Inline Actions make it a static member function if possible. that way we can reuse it easier. Same for isVolatile. jdoerfert: make it a static member function if possible. that way we can reuse it easier. Same for…

		jdoerfertUnsubmitted Not Done Reply Inline Actions "if" jdoerfert: "if"
/// Helper function used to determine whether an instruction is volatile.		/// Helper function used to determine whether an instruction is volatile.
		jdoerfertUnsubmitted Not Done Reply Inline Actions Copy & paste jdoerfert: Copy & paste
static bool isVolatile(Instruction *I);		static bool isVolatile(Instruction *I);
};		};

bool AANoSyncFunction::isNonRelaxedAtomic(Instruction *I) {		bool AANoSyncFunction::isNonRelaxedAtomic(Instruction *I) {
if (!I->isAtomic())		if (!I->isAtomic())
return false;		return false;

		jdoerfertUnsubmitted Not Done Reply Inline Actions Make it `static` or member function. Also, describe what it does in the comment. jdoerfert: Make it `static` or member function. Also, describe what it does in the comment.
AtomicOrdering Ordering;		AtomicOrdering Ordering;
switch (I->getOpcode()) {		switch (I->getOpcode()) {
case Instruction::AtomicRMW:		case Instruction::AtomicRMW:
		jdoerfertUnsubmitted Not Done Reply Inline Actions Put the documentation on the declaration. jdoerfert: Put the documentation on the declaration.
Ordering = cast<AtomicRMWInst>(I)->getOrdering();		Ordering = cast<AtomicRMWInst>(I)->getOrdering();
break;		break;
case Instruction::Store:		case Instruction::Store:
Ordering = cast<StoreInst>(I)->getOrdering();		Ordering = cast<StoreInst>(I)->getOrdering();
break;		break;
case Instruction::Load:		case Instruction::Load:
Ordering = cast<LoadInst>(I)->getOrdering();		Ordering = cast<LoadInst>(I)->getOrdering();
break;		break;
case Instruction::Fence:		case Instruction::Fence:
Ordering = cast<FenceInst>(I)->getOrdering();		Ordering = cast<FenceInst>(I)->getOrdering();
		jdoerfertUnsubmitted Not Done Reply Inline Actions Description missing. jdoerfert: Description missing.
		jfbUnsubmitted Not Done Reply Inline Actions A fence with `singlethread` sync scope doesn't sync with other threads, even if `seq_cst`. jfb: A fence with `singlethread` sync scope doesn't sync with other threads, even if `seq_cst`.
break;		break;
case Instruction::AtomicCmpXchg: {		case Instruction::AtomicCmpXchg: {
		jdoerfertUnsubmitted Not Done Reply Inline Actions Given that Success and Failure are only needed in this case you can declare them here. To do so you need to add brackets around the case: case Instruction::AtomicCmpXchg: { ... } jdoerfert: Given that Success and Failure are only needed in this case you can declare them here. To do so…
AtomicOrdering Success = cast<AtomicCmpXchgInst>(I)->getSuccessOrdering();		AtomicOrdering Success = cast<AtomicCmpXchgInst>(I)->getSuccessOrdering();
AtomicOrdering Failure = cast<AtomicCmpXchgInst>(I)->getFailureOrdering();		AtomicOrdering Failure = cast<AtomicCmpXchgInst>(I)->getFailureOrdering();
// Only if both are relaxed, than it can be treated as relaxed.		// Only if both are relaxed, than it can be treated as relaxed.
		jfbUnsubmitted Not Done Reply Inline Actions "then" jfb: "then"
// Otherwise it is non-relaxed.		// Otherwise it is non-relaxed.
if (Success == AtomicOrdering::Unordered \|\|		if (Success != AtomicOrdering::Unordered &&
		jdoerfertUnsubmitted Not Done Reply Inline Actions Why is it sufficient that one ordering is "weak enough"? Don't we have to test both? Either way, we need a comment to explain what is happening here. jdoerfert: Why is it sufficient that one ordering is "weak enough"? Don't we have to test both? Either way…
Success == AtomicOrdering::Monotonic)		Success != AtomicOrdering::Monotonic)
		jdoerfertUnsubmitted Not Done Reply Inline Actions Can you describe the logic here? jdoerfert: Can you describe the logic here?
		sstefan1AuthorUnsubmitted Done Reply Inline Actions I had to return one. So if `Success` isn't intresting it returns `Failure` ordering. Ohterwise it doesn't matter since `Success` already syncs. I didn't give this much tought, if you have any suggestions, I'll apply them. sstefan1: I had to return one. So if `Success` isn't intresting it returns `Failure` ordering. Ohterwise…
return false;
if (Failure == AtomicOrdering::Unordered \|\|
Failure == AtomicOrdering::Monotonic)
return false;
return true;		return true;
		if (Failure != AtomicOrdering::Unordered &&
		jdoerfertUnsubmitted Not Done Reply Inline Actions I'm still confused. The pessimistic return value is `true`, correct? If so, Why can we return `false` after we've seen only the success ordering? Don't we have to look at both success and failure ordering and only if both are "fine" we can return `false`? jdoerfert: I'm still confused. The pessimistic return value is `true`, correct? If so, Why can we return…
		sstefan1AuthorUnsubmitted Done Reply Inline Actions I agree. I messed this up. Before I update, does this look ok? if (Success != AtomicOrdering::Unordered \|\| Success != AtomicOrdering::Monotonic) return true; if (Failure != AtomicOrdering::Unordered \|\| Failure != AtomicOrdering::Monotonic) return true; return false; sstefan1: I agree. I messed this up. Before I update, does this look ok? ``` if (Success !=…
		Failure != AtomicOrdering::Monotonic)
		return true;
		return false;
		jdoerfertUnsubmitted Not Done Reply Inline Actions No worries, all good. Please also add a comment to explain what this means and why we return true. jdoerfert: No worries, all good. Please also add a comment to explain what this means and why we return…
}		}
default:		default:
		jdoerfertUnsubmitted Not Done Reply Inline Actions Indention. And maybe add a few more words here ;) jdoerfert: Indention. And maybe add a few more words here ;)
// Unknown atomic, assume non-relaxed.		// Unknown atomic, assume non-relaxed.
return true;		return true;
		jdoerfertUnsubmitted Not Done Reply Inline Actions Shouldn't we here directly assume sync as it is atomic but we don't know what kind? jdoerfert: Shouldn't we here directly assume sync as it is atomic but we don't know what kind?
		sstefan1AuthorUnsubmitted Done Reply Inline Actions Yes, my bad. I'll return true. sstefan1: Yes, my bad. I'll return true.
		jfbUnsubmitted Not Done Reply Inline Actions We probably want `llvm_unreachable` here, so the code gets updated if we add new atomic operations. jfb: We probably want `llvm_unreachable` here, so the code gets updated if we add new atomic…
}		}

// Relaxed.		// Relaxed.
		jdoerfertUnsubmitted Not Done Reply Inline Actions What about calls? Maybe you need to look at all side-effect instructions? jdoerfert: What about calls? Maybe you need to look at all side-effect instructions?
if (Ordering == AtomicOrdering::Unordered \|\|		if (Ordering == AtomicOrdering::Unordered \|\|
Ordering == AtomicOrdering::Monotonic)		Ordering == AtomicOrdering::Monotonic)
return false;		return false;
return true;		return true;
}		}
		jdoerfertUnsubmitted Not Done Reply Inline Actions You have to add fences here as well and look at calls explicitly (as they are not in the above opcode list). Alternatively you could do: Use the `InformationCache::getReadOrWriteInstsForFunction` method to get all potential read & write instructions. That will include calls you need to look at and everything above. You will need to look at calls first, if they are fine, you can check for volatile and atomic. The code to determine if a call is OK is already present down there. jdoerfert: You have to add fences here as well and look at calls explicitly (as they are not in the above…

bool AANoSyncFunction::isVolatile(Instruction *I) {		bool AANoSyncFunction::isVolatile(Instruction *I) {
		jfbUnsubmitted Not Done Reply Inline Actions You're missing `memcpy` and similar intrinsics, I think you want to handle them here and not in the generic intrinsic handling. jfb: You're missing `memcpy` and similar intrinsics, I think you want to handle them here and not in…
switch (I->getOpcode()) {		switch (I->getOpcode()) {
case Instruction::AtomicRMW:		case Instruction::AtomicRMW:
return cast<AtomicRMWInst>(I)->isVolatile();		return cast<AtomicRMWInst>(I)->isVolatile();
case Instruction::Store:		case Instruction::Store:
return cast<StoreInst>(I)->isVolatile();		return cast<StoreInst>(I)->isVolatile();
case Instruction::Load:		case Instruction::Load:
		jdoerfertUnsubmitted Not Done Reply Inline Actions Please use the type here and start variables with an upper case letter. jdoerfert: Please use the type here and start variables with an upper case letter.
return cast<LoadInst>(I)->isVolatile();		return cast<LoadInst>(I)->isVolatile();
case Instruction::AtomicCmpXchg:		case Instruction::AtomicCmpXchg:
return cast<AtomicCmpXchgInst>(I)->isVolatile();		return cast<AtomicCmpXchgInst>(I)->isVolatile();
default:		default:
		jdoerfertUnsubmitted Not Done Reply Inline Actions Only do the stuff below if `I` is actually a call, so if `ICS` is not `null`. If you run this, it should crash on you right now because you access ICS unconditionally. jdoerfert: Only do the stuff below if `I` is actually a call, so if `ICS` is not `null`. If you run this…
return false;		return false;
}		}
}		}
		jdoerfertUnsubmitted Not Done Reply Inline Actions I think `getReadOrWriteInstsForFunction` will never pick up a `readnone` call as it will neither read nor write memory. jdoerfert: I think `getReadOrWriteInstsForFunction` will never pick up a `readnone` call as it will…
		sstefan1AuthorUnsubmitted Done Reply Inline Actions So is it safe to remove it then, as it does not have side-effects? sstefan1: So is it safe to remove it then, as it does not have side-effects?

ChangeStatus AANoSyncFunction::updateImpl(Attributor &A) {		ChangeStatus AANoSyncFunction::updateImpl(Attributor &A) {
Function &F = getAnchorScope();		Function &F = getAnchorScope();

		jdoerfertUnsubmitted Not Done Reply Inline Actions This is not a fixpoint. UpdateImpl is called multiple times (potentially). Remove the fixpoint call. jdoerfert: This is not a fixpoint. UpdateImpl is called multiple times (potentially). Remove the fixpoint…
		arsenmUnsubmitted Not Done Reply Inline Actions It would be clearer to cast to MemIntrinsic and check isVolatile arsenm: It would be clearer to cast to MemIntrinsic and check isVolatile
/// We are looking for volatile instructions or Non-Relaxed atomics.		/// We are looking for volatile instructions or Non-Relaxed atomics.
/// FIXME: We should ipmrove the handling of intrinsics.		/// FIXME: We should ipmrove the handling of intrinsics.
		jdoerfertUnsubmitted Not Done Reply Inline Actions I think the "unknown" case is handled the wrong way here. Shouldn't it be: if (Arg->getType()->isIntegerTy(1) && cast<ConstantInt>(Arg)->getValue() == 0) return true; return false; such that "unknown" values, e.g., `%cmp = icmp ...` used as the 4th argument will conservatively make it sync? (+ Test case for this) jdoerfert: I think the "unknown" case is handled the wrong way here. Shouldn't it be: ``` if (Arg…
		sstefan1AuthorUnsubmitted Done Reply Inline Actions 4th argument, isvolatile, is `immarg`, so I guess this not necessary? sstefan1: 4th argument, isvolatile, is `immarg`, so I guess this not necessary?
		jdoerfertUnsubmitted Not Done Reply Inline Actions Agreed, not necessary. However, if you keep it this way, add the above reasoning to the comment, it confused me now and it can easily confuse the next person. My advice, just swap the order to make it easier for people now and in the future ;) jdoerfert: Agreed, not necessary. However, if you keep it this way, add the above reasoning to the comment…
for (Instruction *I : InfoCache.getReadOrWriteInstsForFunction(F)) {		for (Instruction *I : InfoCache.getReadOrWriteInstsForFunction(F)) {
ImmutableCallSite ICS(I);		ImmutableCallSite ICS(I);
auto NoSyncAA = A.getAAFor<AANoSyncFunction>(this, *I);		auto NoSyncAA = A.getAAFor<AANoSyncFunction>(this, *I);

		jdoerfertUnsubmitted Not Done Reply Inline Actions You need to check volatile and atomic for all instructions I guess and for calls nosync as well jdoerfert: You need to check volatile and atomic for all instructions I guess and for calls nosync as well
if (!ICS && (!NoSyncAA \|\| !NoSyncAA->isAssumedNoSync()) &&		if (ICS && (!NoSyncAA \|\| !NoSyncAA->isAssumedNoSync()) &&
		jdoerfertUnsubmitted Not Done Reply Inline Actions I think the`!` in front of ICS is a problem. Did you run this? jdoerfert: I think the`!` in front of ICS is a problem. Did you run this?
!ICS.hasFnAttr("nosync")) {		!ICS.hasFnAttr(Attribute::NoSync)) {
		jdoerfertUnsubmitted Not Done Reply Inline Actions Can we have a single call to `isVolatile`, maybe always call that one and `getOrdering` and decide on the result what to do. That would mean move `I->isAtomic()` into `getOrdering()` and ensure we catch all opcodes in the switch (so the default prints an error).. jdoerfert: Can we have a single call to `isVolatile`, maybe always call that one and `getOrdering` and…
		sstefan1AuthorUnsubmitted Not Done Reply Inline Actions If I do it this way, I think it would be better to change `AtomicOrdering getOrdering()` to `bool isSyncOrdering()` or whatever is appropriate for the name. It can than return true if ordering is not Unordered or Monotonic. That way everything can be checked in one if. and ensure we catch all opcodes in the switch (so the default prints an error).. I only miss GetElementPtr and alloca which are not of great interest here, if I'm not wrong. But I can add them as well. sstefan1: If I do it this way, I think it would be better to change `AtomicOrdering getOrdering()` to…
indicatePessimisticFixpoint();		indicatePessimisticFixpoint();
return ChangeStatus::CHANGED;		return ChangeStatus::CHANGED;
}		}

if (!isVolatile(I) && !isNonRelaxedAtomic(I))		if (!isVolatile(I) && !isNonRelaxedAtomic(I))
continue;		continue;

indicatePessimisticFixpoint();		indicatePessimisticFixpoint();
return ChangeStatus::CHANGED;		return ChangeStatus::CHANGED;
}		}

return ChangeStatus::UNCHANGED;		return ChangeStatus::UNCHANGED;
}		}

/// ----------------------------------------------------------------------------		/// ----------------------------------------------------------------------------
/// Attributor		/// Attributor
		jfbUnsubmitted Not Done Reply Inline Actions "improve" jfb: "improve"
		arsenmUnsubmitted Not Done Reply Inline Actions I'm pretty sure this is repeated in several passes, and incomplete. Target intrinsics can also be considered volatile, as there is a hook to get the memory properties for them arsenm: I'm pretty sure this is repeated in several passes, and incomplete. Target intrinsics can also…
		jdoerfertUnsubmitted Not Done Reply Inline Actions I guess we should not reach this function with calls. If that seams reasonable, we need an assert here and change the source below to skip these checks if a call is assumed/known nosync. jdoerfert: I guess we should not reach this function with calls. If that seams reasonable, we need an…
/// ----------------------------------------------------------------------------		/// ----------------------------------------------------------------------------

ChangeStatus Attributor::run() {		ChangeStatus Attributor::run() {
// Initialize all abstract attributes.		// Initialize all abstract attributes.
for (AbstractAttribute *AA : AllAbstractAttributes)		for (AbstractAttribute *AA : AllAbstractAttributes)
AA->initialize(*this);		AA->initialize(*this);

LLVM_DEBUG(dbgs() << "[Attributor] Identified and initialized "		LLVM_DEBUG(dbgs() << "[Attributor] Identified and initialized "
<< AllAbstractAttributes.size()		<< AllAbstractAttributes.size()
<< " abstract attributes.\n");		<< " abstract attributes.\n");

// Now that all abstract attributes are collected and initialized we start the		// Now that all abstract attributes are collected and initialized we start the
// abstract analysis.		// abstract analysis.

unsigned IterationCounter = 1;		unsigned IterationCounter = 1;

SmallVector<AbstractAttribute *, 64> ChangedAAs;		SmallVector<AbstractAttribute *, 64> ChangedAAs;
		arsenmUnsubmitted Not Done Reply Inline Actions No virtual necessary (and for the rest of the overrides) arsenm: No virtual necessary (and for the rest of the overrides)
SetVector<AbstractAttribute *> Worklist;		SetVector<AbstractAttribute *> Worklist;
Worklist.insert(AllAbstractAttributes.begin(), AllAbstractAttributes.end());		Worklist.insert(AllAbstractAttributes.begin(), AllAbstractAttributes.end());

do {		do {
LLVM_DEBUG(dbgs() << "\n\n[Attributor] #Iteration: " << IterationCounter		LLVM_DEBUG(dbgs() << "\n\n[Attributor] #Iteration: " << IterationCounter
<< ", Worklist size: " << Worklist.size() << "\n");		<< ", Worklist size: " << Worklist.size() << "\n");

// Add all abstract attributes that are potentially dependent on one that		// Add all abstract attributes that are potentially dependent on one that
// changed to the work list.		// changed to the work list.
for (AbstractAttribute *ChangedAA : ChangedAAs) {		for (AbstractAttribute *ChangedAA : ChangedAAs) {
auto &QuerriedAAs = QueryMap[ChangedAA];		auto &QuerriedAAs = QueryMap[ChangedAA];
Worklist.insert(QuerriedAAs.begin(), QuerriedAAs.end());		Worklist.insert(QuerriedAAs.begin(), QuerriedAAs.end());
}		}

// Reset the changed set.		// Reset the changed set.
ChangedAAs.clear();		ChangedAAs.clear();
		jdoerfertUnsubmitted Not Done Reply Inline Actions I was puzzled by this check for a second, add a comment indicating that the above loop handles calls with read/write effects already. Mention that the fact there is a read/write effect caused us already to make sure it is `nosync` and there is consequently no need to check for `convergent`. jdoerfert: I was puzzled by this check for a second, add a comment indicating that the above loop handles…

// Update all abstract attribute in the work list and record the ones that		// Update all abstract attribute in the work list and record the ones that
// changed.		// changed.
for (AbstractAttribute *AA : Worklist)		for (AbstractAttribute *AA : Worklist)
if (AA->update(*this) == ChangeStatus::CHANGED)		if (AA->update(*this) == ChangeStatus::CHANGED)
ChangedAAs.push_back(AA);		ChangedAAs.push_back(AA);

// Reset the work list and repopulate with the changed abstract attributes.		// Reset the work list and repopulate with the changed abstract attributes.
▲ Show 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	for (Instruction &I : instructions(&F)) {
// the following switch.		// the following switch.
// Note: There are no concrete attributes now so this is initially empty.		// Note: There are no concrete attributes now so this is initially empty.
switch (I.getOpcode()) {		switch (I.getOpcode()) {
default:		default:
break;		break;
}		}
if (IsInterestingOpcode)		if (IsInterestingOpcode)
InstOpcodeMap[I.getOpcode()].push_back(&I);		InstOpcodeMap[I.getOpcode()].push_back(&I);
if (I.mayReadOrWriteMemory())		if (I.mayReadOrWriteMemory())
		sstefan1AuthorUnsubmitted Not Done Reply Inline Actions Just to make sure, when using `InformationCache::getReadOrWriteInstsForFunction` I don't need this, right? sstefan1: Just to make sure, when using `InformationCache::getReadOrWriteInstsForFunction` I don't need…
		jdoerfertUnsubmitted Not Done Reply Inline Actions Correct. jdoerfert: Correct.
ReadOrWriteInsts.push_back(&I);		ReadOrWriteInsts.push_back(&I);
}		}
}		}

/// Helpers to ease debugging through output streams and print calls.		/// Helpers to ease debugging through output streams and print calls.
///		///
///{		///{
raw_ostream &llvm::operator<<(raw_ostream &OS, ChangeStatus S) {		raw_ostream &llvm::operator<<(raw_ostream &OS, ChangeStatus S) {
▲ Show 20 Lines • Show All 115 Lines • Show Last 20 Lines

llvm/test/Transforms/FunctionAttrs/nosync.ll

	; RUN: opt -functionattrs -S < %s \| FileCheck %s --check-prefix=FNATTR			; RUN: opt -functionattrs -S < %s \| FileCheck %s --check-prefix=FNATTR
	; RUN: opt -attributor -S < %s \| FileCheck %s --check-prefix=ATTRIBUTOR			; RUN: opt -attributor -S < %s \| FileCheck %s --check-prefix=ATTRIBUTOR
				jdoerfertUnsubmitted Not Done Reply Inline Actions You need to enable the attributor explicitly, for now. jdoerfert: You need to enable the attributor explicitly, for now.
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

	; Test cases designet for the "nosync" function attribute.			; Test cases designet for the nosync function attribute.
				jfbUnsubmitted Not Done Reply Inline Actions "designet"? jfb: "designet"?
	; FIXME's are used to indicate problems and missing attributes.			; FIXME's are used to indicate problems and missing attributes.

	; struct RT {			; struct RT {
	; char A;			; char A;
	; int B[10][20];			; int B[10][20];
	; char C;			; char C;
	; };			; };
	; struct ST {			; struct ST {
	; int X;			; int X;
	; double Y;			; double Y;
	; struct RT Z;			; struct RT Z;
	; };			; };
	;			;
	; int foo(struct ST s) {			; int foo(struct ST s) {
	; return &s[1].Z.B[5][13];			; return &s[1].Z.B[5][13];
	; }			; }

	; TEST 1			; TEST 1
	; attribute readnone implies "nosync"			; attribute readnone implies nosync
	%struct.RT = type { i8, [10 x [20 x i32]], i8 }			%struct.RT = type { i8, [10 x [20 x i32]], i8 }
	%struct.ST = type { i32, double, %struct.RT }			%struct.ST = type { i32, double, %struct.RT }

	; FNATTR: Function Attrs: nounwind uwtable readnone optsize ssp			; FNATTR: Function Attrs: norecurse nounwind optsize readnone ssp uwtable
	; FNATTR-NEXT: define i32 @foo(%struct.ST %s)			; FNATTR-NEXT: define nonnull i32* @foo(%struct.ST* readnone %s)
	; ATTRIBUTOR: Function Attrs: nounwind uwtable readnone optsize ssp "nosync"			; ATTRIBUTOR: Function Attrs: nosync nounwind optsize readnone ssp uwtable
	; ATTRIBUTOR-NEXT: define i32 @foo(%struct.ST %s)			; ATTRIBUTOR-NEXT: define i32* @foo(%struct.ST* %s)
	define i32* @foo(%struct.ST* %s) nounwind uwtable readnone optsize ssp {			define i32* @foo(%struct.ST* %s) nounwind uwtable readnone optsize ssp {
	entry:			entry:
	%arrayidx = getelementptr inbounds %struct.ST, %struct.ST* %s, i64 1, i32 2, i32 1, i64 5, i64 13			%arrayidx = getelementptr inbounds %struct.ST, %struct.ST* %s, i64 1, i32 2, i32 1, i64 5, i64 13
	ret i32* %arrayidx			ret i32* %arrayidx
	}			}

	; TEST 2			; TEST 2
	; atomic load with monotonic ordering			; atomic load with monotonic ordering
	; int load_monotonic(_Atomic int *num) {			; int load_monotonic(_Atomic int *num) {
	; int n = atomic_load_explicit(num, memory_order_relaxed);			; int n = atomic_load_explicit(num, memory_order_relaxed);
	; return n;			; return n;
	; }			; }

	; FNATTR: Function Attrs: norecurse nounwind uwtable			; FNATTR: Function Attrs: norecurse nounwind uwtable
	; FNATTR-NEXT: define i32 @load_monotonic(i32* nocapture readonly)			; FNATTR-NEXT: define i32 @load_monotonic(i32* nocapture readonly)
	; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable "nosync"			; ATTRIBUTOR: Function Attrs: norecurse nosync nounwind uwtable
	; ATTRIBUTOR-NEXT: define i32 @load_monotonic(i32* nocapture readonly)			; ATTRIBUTOR-NEXT: define i32 @load_monotonic(i32* nocapture readonly)
	define i32 @load_monotonic(i32* nocapture readonly) norecurse nounwind uwtable {			define i32 @load_monotonic(i32* nocapture readonly) norecurse nounwind uwtable {
	%2 = load atomic i32, i32* %0 monotonic, align 4			%2 = load atomic i32, i32* %0 monotonic, align 4
	ret i32 %2			ret i32 %2
	}			}


	; TEST 3			; TEST 3
	; atomic store with monotonic ordering.			; atomic store with monotonic ordering.
	; void store_monotonic(_Atomic int *num) {			; void store_monotonic(_Atomic int *num) {
	; atomic_load_explicit(num, memory_order_relaxed);			; atomic_load_explicit(num, memory_order_relaxed);
	; }			; }

	; FNATTR: Function Attrs: norecurse nounwind uwtable			; FNATTR: Function Attrs: norecurse nounwind uwtable
	; FNATTR-NEXT: define void @store_monotonic(i32* nocapture)			; FNATTR-NEXT: define void @store_monotonic(i32* nocapture)
	; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable "nosync"			; ATTRIBUTOR: Function Attrs: norecurse nosync nounwind uwtable
	; ATTRIBUTOR-NEXT: define void @store_monotonic(i32* nocapture)			; ATTRIBUTOR-NEXT: define void @store_monotonic(i32* nocapture)
	define void @store_monotonic(i32* nocapture) norecurse nounwind uwtable {			define void @store_monotonic(i32* nocapture) norecurse nounwind uwtable {
	store atomic i32 10, i32* %0 monotonic, align 4			store atomic i32 10, i32* %0 monotonic, align 4
	ret void			ret void
	}			}

	; TEST 4 - negative, should not deduce "nosync"			; TEST 4 - negative, should not deduce nosync
	; atomic load with acquire ordering.			; atomic load with acquire ordering.
	; int load_acquire(_Atomic int *num) {			; int load_acquire(_Atomic int *num) {
	; int n = atomic_load_explicit(num, memory_order_acquire);			; int n = atomic_load_explicit(num, memory_order_acquire);
	; return n;			; return n;
	; }			; }

	; FNATTR: Function Attrs: norecurse nounwind uwtable			; FNATTR: Function Attrs: norecurse nounwind uwtable
	; FNATTR-NEXT: define i32 @load_acquire(i32* nocapture readonly)			; FNATTR-NEXT: define i32 @load_acquire(i32* nocapture readonly)
	; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable			; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR-NEXT: define i32 @load_acquire(i32* nocapture readonly)			; ATTRIBUTOR-NEXT: define i32 @load_acquire(i32* nocapture readonly)
	define i32 @load_acquire(i32* nocapture readonly) norecurse nounwind uwtable {			define i32 @load_acquire(i32* nocapture readonly) norecurse nounwind uwtable {
	%2 = load atomic i32, i32* %0 acquire, align 4			%2 = load atomic i32, i32* %0 acquire, align 4
	ret i32 %2			ret i32 %2
	}			}

	; TEST 5 - negative, should not deduce "nosync"			; TEST 5 - negative, should not deduce nosync
	; atomic load with release ordering			; atomic load with release ordering
	; void load_release(_Atomic int *num) {			; void load_release(_Atomic int *num) {
	; atomic_store_explicit(num, 10, memory_order_release);			; atomic_store_explicit(num, 10, memory_order_release);
	; }			; }

	; FNATTR: Function Attrs: norecurse nounwind uwtable			; FNATTR: Function Attrs: norecurse nounwind uwtable
	; FNATTR-NEXT: define void @load_release(i32* nocapture)			; FNATTR-NEXT: define void @load_release(i32* nocapture)
	; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable			; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR-NEXT: define void @load_release(i32* nocapture)			; ATTRIBUTOR-NEXT: define void @load_release(i32* nocapture)
	define void @load_release(i32* nocapture) norecurse nounwind uwtable {			define void @load_release(i32* nocapture) norecurse nounwind uwtable {
	store atomic i32 10, i32* %0 release, align 4			store atomic i32 10, i32* %0 release, align 4
	ret void			ret void
	}			}

	; TEST 6 - negative, should not deduce "nosync"			; TEST 6 - negative, should not deduce nosync
	; volatile store.			; volatile store.
	; void volatile_store(volatile int *num) {			; void volatile_store(volatile int *num) {
	; *num = 14;			; *num = 14;
	; }			; }

	; FNATTR: Function Attrs: norecurse nounwind uwtable			; FNATTR: Function Attrs: norecurse nounwind uwtable
	; FNATTR-NEXT: define void @volatile_store(i32*)			; FNATTR-NEXT: define void @volatile_store(i32*)
	; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable			; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR-NEXT: define void @volatile_store(i32*)			; ATTRIBUTOR-NEXT: define void @volatile_store(i32*)
	define void @volatile_store(i32*) norecurse nounwind uwtable {			define void @volatile_store(i32*) norecurse nounwind uwtable {
	;store volatile i32 14, i32* %0, align 4, !tbaa !2
	store volatile i32 14, i32* %0, align 4			store volatile i32 14, i32* %0, align 4
				jdoerfertUnsubmitted Not Done Reply Inline Actions Remove the commented instruction here and in the next test. Also, fix the indention. jdoerfert: Remove the commented instruction here and in the next test. Also, fix the indention.
	ret void			ret void
	}			}

	; TEST 7 - negative, should not deduce "nosync"			; TEST 7 - negative, should not deduce nosync
	; volatile load.			; volatile load.
	; int volatile_load(volatile int *num) {			; int volatile_load(volatile int *num) {
	; int n = *num;			; int n = *num;
	; return n;			; return n;
	; }			; }

	; FNATTR: Function Attrs: norecurse nounwind uwtable			; FNATTR: Function Attrs: norecurse nounwind uwtable
	; FNATTR-NEXT: define i32 @volatile_load(i32*)			; FNATTR-NEXT: define i32 @volatile_load(i32*)
	; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable			; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR-NEXT: define i32 @volatile_load(i32*)			; ATTRIBUTOR-NEXT: define i32 @volatile_load(i32*)
	define i32 @volatile_load(i32*) norecurse nounwind uwtable {			define i32 @volatile_load(i32*) norecurse nounwind uwtable {
	;%2 = load volatile i32, i32* %0, align 4, !tbaa !2
	%2 = load volatile i32, i32* %0, align 4			%2 = load volatile i32, i32* %0, align 4
	ret i32 %2			ret i32 %2
	}			}

	; TEST 8			; TEST 8
	declare void @nosync_function() noinline nounwind uwtable
				; FNATTR: Function Attrs: noinline nosync nounwind uwtable
				; FNATTR-NEXT: declare void @nosync_function()
				jdoerfertUnsubmitted Not Done Reply Inline Actions Isn't the "nosync" attribute missing for this function? jdoerfert: Isn't the "nosync" attribute missing for this function?
				; ATTRIBUTOR: Function Attrs: noinline nosync nounwind uwtable
				; ATTRIBUTOR-NEXT: declare void @nosync_function()
				declare void @nosync_function() noinline nounwind uwtable nosync

	; FNATTR: Function Attrs: noinline nounwind uwtable			; FNATTR: Function Attrs: noinline nounwind uwtable
	; FNATTR: define void @call_nosync_function()			; FNATTR-NEXT: define void @call_nosync_function()
	; ATTRIBUTOR: Function Attrs: noinline nounwind uwtable "nosync"			; ATTRIBUTOR: Function Attrs: noinline nosync nounwind uwtable
	; ATTRIBUTOR: define void @call_nosync_function()			; ATTRIBUTOR-next: define void @call_nosync_function()
	define void @call_nosync_function() nounwind uwtable noinline {			define void @call_nosync_function() nounwind uwtable noinline {
	tail call void @nosync_function() noinline nounwind uwtable			tail call void @nosync_function() noinline nounwind uwtable
	ret void			ret void
	}			}

	; TEST 9 - negative, should not deduce "nosync"			; TEST 9 - negative, should not deduce nosync

				; FNATTR: Function Attrs: noinline nounwind uwtable
				; FNATTR-NEXT: declare void @might_sync()
				; ATTRIBUTOR: Function Attrs: noinline nounwind uwtable
				; ATTRIBUTOR-NEXT: declare void @might_sync()
	declare void @might_sync() noinline nounwind uwtable			declare void @might_sync() noinline nounwind uwtable

	; FNATTR: Function Attrs: noinline nounwind uwtable			; FNATTR: Function Attrs: noinline nounwind uwtable
	; FNATTR: define void @call_might_sync()			; FNATTR-NEXT: define void @call_might_sync()
	; ATTRIBUTOR: Function Attrs: noinline nounwind uwtable			; ATTRIBUTOR: Function Attrs: noinline nounwind uwtable
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR: define void @call_might_sync()			; ATTRIBUTOR-NEXT: define void @call_might_sync()
	define void @call_might_sync() nounwind uwtable noinline {			define void @call_might_sync() nounwind uwtable noinline {
	tail call void @might_sync() noinline nounwind uwtable			tail call void @might_sync() noinline nounwind uwtable
	ret void			ret void
	}			}

	; TEST 10 - negative, should not deduce "nosync"			; TEST 10 - negative, should not deduce nosync
	; volatile operation in same scc. Call volatile_load defined in TEST 7.			; volatile operation in same scc. Call volatile_load defined in TEST 7.

	; FNATTR: Function Attrs: noinline nounwind uwtable			; FNATTR: Function Attrs: noinline nounwind uwtable
	; FNATTR-NEXT: define i32 @scc1(i32*)			; FNATTR-NEXT: define i32 @scc1(i32*)
	; ATTRIBUTOR: Function Attrs: noinline nounwind uwtable			; ATTRIBUTOR: Function Attrs: noinline nounwind uwtable
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR-NEXT: define i32 @scc1(i32*)			; ATTRIBUTOR-NEXT: define i32 @scc1(i32*)
	define i32 @scc1(i32*) noinline nounwind uwtable {			define i32 @scc1(i32*) noinline nounwind uwtable {
	tail call void @scc2(i32* %0);			tail call void @scc2(i32* %0);
	%val = tail call i32 @volatile_load(i32* %0);			%val = tail call i32 @volatile_load(i32* %0);
	ret i32 %val;			ret i32 %val;
	}			}

	; FNATTR: Function Attrs: noinline nounwind uwtable			; FNATTR: Function Attrs: noinline nounwind uwtable
	; FNATTR-NEXT: define void @scc2(i32*)			; FNATTR-NEXT: define void @scc2(i32*)
	; ATTRIBUTOR: Function Attrs: noinline nounwind uwtable			; ATTRIBUTOR: Function Attrs: noinline nounwind uwtable
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR-NEXT: define void @scc2(i32*)			; ATTRIBUTOR-NEXT: define void @scc2(i32*)
	define void @scc2(i32*) noinline nounwind uwtable {			define void @scc2(i32*) noinline nounwind uwtable {
	tail call i32 @scc1(i32* %0);			tail call i32 @scc1(i32* %0);
	ret void;			ret void;
	}			}

	; TEST 11 - fences, negative			; TEST 11 - fences, negative
	; std::atomic<bool> flag(false);
	; int a;
	;			;
	; void func1(){			; void foo1(int *a, std::atomic<bool> flag){
				jdoerfertUnsubmitted Not Done Reply Inline Actions The function names don't match the IR names. jdoerfert: The function names don't match the IR names.
	; a = 100;			; *a = 100;
	; atomic_thread_fence(std::memory_order_release);			; atomic_thread_fence(std::memory_order_release);
	; flag.store(true, std::memory_order_relaxed);			; flag.store(true, std::memory_order_relaxed);
	; }			; }
	;			;
	; void foo(){			; void bar(int *a, std::atomic<bool> flag){
	; while(!flag.load(std::memory_order_relaxed))			; while(!flag.load(std::memory_order_relaxed))
	; ;			; ;
	;			;
	; atomic_thread_fence(std::memory_order_acquire);			; atomic_thread_fence(std::memory_order_acquire);
	; int b = a;			; int b = *a;
	; }			; }

	%"struct.std::atomic" = type { %"struct.std::__atomic_base" }			%"struct.std::atomic" = type { %"struct.std::__atomic_base" }
	%"struct.std::__atomic_base" = type { i8 }			%"struct.std::__atomic_base" = type { i8 }

	; FNATTR: Function Attrs: norecurse nounwind uwtable			; FNATTR: Function Attrs: norecurse nounwind
	; FNATTR-NEXT: define void @foo1()			; FNATTR-NEXT: define void @foo1(i32* nocapture, %"struct.std::atomic"* nocapture)
	; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR: define void @foo1(i32, %"struct.std::atomic")
	; ATTRIBUTOR-NEXT: define void @foo1()
	define void @foo1(i32 , %"struct.std::atomic") {			define void @foo1(i32, %"struct.std::atomic") {
	store i32 100, i32* %0, align 4			store i32 100, i32* %0, align 4
	fence release			fence release
	%3 = getelementptr inbounds %"struct.std::atomic", %"struct.std::atomic"* %1, i64 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::atomic", %"struct.std::atomic"* %1, i64 0, i32 0, i32 0
	store atomic i8 1, i8* %3 monotonic, align 1			store atomic i8 1, i8* %3 monotonic, align 1
	ret void			ret void
	}			}

	; FNATTR: Function Attrs: norecurse nounwind uwtable			; FNATTR: Function Attrs: norecurse nounwind
	; FNATTR-NEXT: define void @bar()			; FNATTR-NEXT: define void @bar(i32* nocapture readnone, %"struct.std::atomic"* nocapture readonly)
	; ATTRIBUTOR: Function Attrs: norecurse nounwind uwtable			; ATTRIBUTOR-NOT: nosync
	; ATTRIBUTOR-NOT: "nosync"			; ATTRIBUTOR: define void @bar(i32, %"struct.std::atomic")
	; ATTRIBUTOR-NEXT: define void @bar()
	define void @bar(i32 , %"struct.std::atomic") {			define void @bar(i32 , %"struct.std::atomic") {
	%3 = getelementptr inbounds %"struct.std::atomic", %"struct.std::atomic"* %1, i64 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::atomic", %"struct.std::atomic"* %1, i64 0, i32 0, i32 0
	br label %4			br label %4

	4: ; preds = %4, %2			4: ; preds = %4, %2
	%5 = load atomic i8, i8* %3 monotonic, align 1			%5 = load atomic i8, i8* %3 monotonic, align 1
	%6 = and i8 %5, 1			%6 = and i8 %5, 1
	%7 = icmp eq i8 %6, 0			%7 = icmp eq i8 %6, 0
	br i1 %7, label %4, label %8			br i1 %7, label %4, label %8

	8: ; preds = %4			8: ; preds = %4
	fence acquire			fence acquire
	ret void			ret void
	}			}
				nhaehnleUnsubmitted Not Done Reply Inline Actions The negative check line here is missing. nhaehnle: The negative check line here is missing.
				sstefan1AuthorUnsubmitted Done Reply Inline Actions I skipped the negative check line, because for now only nosync is deduced and if function is not nosync there will be no Function Attrs at all. Once we have some more attributes, I'll add the negative check. Is that ok with you? sstefan1: I skipped the negative check line, because for now only nosync is deduced and if function is…
				jdoerfertUnsubmitted Not Done Reply Inline Actions Just give the function the `nounwind` attribute and then add the negative check line. jdoerfert: Just give the function the `nounwind` attribute and then add the negative check line.
				jfbUnsubmitted Not Done Reply Inline Actions I don't think you can generally treat intrinsics as `nosync`. Unless you know they're actually `nosync` you should assume that intrinsics might synchronize. For example: int a; void i_totally_sync() { __builtin_ia32_clflush(&a); } Corresponds to: tail call void @llvm.x86.sse2.clflush(i8* bitcast (i32* @a to i8)) You should have a test for this, and it should definitely not* be `nosync`. The other option here is to go and add a field to all intrinsics, so when creating a new one we have to figure out whether it'll definitely sync, maybe sync, or never sync. I don't think that's in scope for this patch. jfb: I don't think you can generally treat intrinsics as `nosync`. Unless you know they're actually…
				jdoerfertUnsubmitted Not Done Reply Inline Actions I don't think you can generally treat intrinsics as nosync. Unless you know they're actually nosync you should assume that intrinsics might synchronize. Good point. Maybe the best way (for now and in general) is to "not look for" intrinsics. Use the same logic for all instructions. That is, if it is a call and not annotated as no-sync it may-sync. The test with `llvm.cos` can be adjusted by adding `readnone` to the decleration of`llvm.cos`. The other option here is to go and add a field to all intrinsics, so when creating a new one we have to figure out whether it'll definitely sync, maybe sync, or never sync. I don't think that's in scope for this patch. Agreed, we will have to do that for various attributes at some point (soon) but not in this patch. jdoerfert: > I don't think you can generally treat intrinsics as nosync. Unless you know they're actually…
				jfbUnsubmitted Not Done Reply Inline Actions I'd still accept the volatile `mem` intrinsics as is already done, but otherwise yeah intrinsics should be assumed to synchronize. jfb:* I'd still accept the volatile `mem*` intrinsics as is already done, but otherwise yeah…
				jdoerfertUnsubmitted Not Done Reply Inline Actions Agreed. jdoerfert: Agreed.
				sstefan1AuthorUnsubmitted Done Reply Inline Actions Replaced llvm.cos test with inline assembly. llvm.cos test was my mistake, since with the current implementation it would be considered sync. sstefan1: Replaced llvm.cos test with inline assembly. llvm.cos test was my mistake, since with the…
				jdoerfertUnsubmitted Not Done Reply Inline Actions The test comment is off, and again add `nounwind` to allow for the check lines jdoerfert: The test comment is off, and again add `nounwind` to allow for the check lines
				jdoerfertUnsubmitted Not Done Reply Inline Actions Copy&paste jdoerfert: Copy&paste
				jdoerfertUnsubmitted Not Done Reply Inline Actions Shouldn't this be `nosync`? Is it? jdoerfert: Shouldn't this be `nosync`? Is it?
				sstefan1AuthorUnsubmitted Done Reply Inline Actions Yes, this falls under copy & paste as well. sstefan1: Yes, this falls under copy & paste as well.

This is an archive of the discontinued LLVM Phabricator instance.

[Attributor] Deduce "nosync" function attribute.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 204510

llvm/docs/LangRef.rst

llvm/include/llvm/Bitcode/LLVMBitCodes.h

llvm/include/llvm/IR/Attributes.td

llvm/include/llvm/Transforms/IPO/Attributor.h

llvm/lib/AsmParser/LLLexer.cpp

llvm/lib/AsmParser/LLParser.cpp

llvm/lib/AsmParser/LLToken.h

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

llvm/lib/IR/Attributes.cpp

llvm/lib/IR/Verifier.cpp

llvm/lib/Transforms/IPO/Attributor.cpp

llvm/test/Transforms/FunctionAttrs/nosync.ll

[Attributor] Deduce "nosync" function attribute.
ClosedPublic