This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
11/18
LangRef.rst
-
include/llvm/
-
llvm/
-
Bitcode/
1
LLVMBitCodes.h
-
IR/
-
Attributes.h
-
Attributes.td
-
Intrinsics.td
-
LLVMContext.h
-
lib/
-
AsmParser/
-
LLLexer.cpp
-
LLParser.h
-
LLParser.cpp
-
LLToken.h
-
Bitcode/
-
Reader/
-
BitcodeReader.cpp
-
Writer/
-
BitcodeWriter.cpp
-
IR/
-
AsmWriter.cpp
-
AttributeImpl.h
-
Attributes.cpp
-
LLVMContext.cpp
12/13
Verifier.cpp
-
Transforms/Utils/
-
Utils/
-
CodeExtractor.cpp
-
test/
-
Assembler/
-
invalid-byval-type3.ll
-
Bitcode/
-
attributes.ll
-
operand-bundles-bc-analyzer.ll
-
Verifier/
-
preallocated-invalid.ll
-
preallocated-valid.ll

Differential D74651

Add IR constructs for inalloca replacement preallocated call setup
ClosedPublic

Authored by aeubanks on Feb 14 2020, 2:18 PM.

Download Raw Diff

Details

Reviewers

rnk
efriedma

Commits

rG3b0450acecb6: Add IR constructs for preallocated (inalloca replacement)

Summary

Add llvm.call.setup and llvm.call.alloc instrinsics.
Add callsetup operand bundle which takes a token produced by llvm.call.setup.
Add preallocated parameter attribute, which is like byval but without the copy.

Verifier changes for these IR constructs.

See https://github.com/rnk/llvm-project/blob/call-setup-docs/llvm/docs/CallSetup.md

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aeubanks created this revision.Feb 14 2020, 2:18 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 14 2020, 2:18 PM

Herald added subscribers: llvm-commits, jdoerfert, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B46561: Diff 244772.Feb 14 2020, 2:20 PM

aeubanks retitled this revision from Add intrinsics and operand bundles for inalloca replacement llvm.call.setup to [WIP] Add intrinsics and operand bundles for inalloca replacement llvm.call.setup.Feb 14 2020, 2:25 PM

Nice, that's the essence of the verifier checks, and the tests look pretty good. LLVM has an 80 character limit, and the easiest way to handle that is to use clang-format. I recommend setting up an editor integration so you can use it interactively as you edit code, or running git-clang-format before uploading.

llvm/lib/IR/Verifier.cpp
3085	You should add some tests for these checks.
3089–3090	The verifier shouldn't reject unknown bundles, the idea is to allow other bundles as an extension point.
4489	This is the right idea, but I'd make APInt locals for the getValue() results to make it shorter.
4497	I guess I'd try to do this with less conditionals: auto CallSetupBundle = ... Assert(CallSetupBundle, "using call site should have a call.setup bundle"); Assert(CallSetupBundle->Inputs.front().get() == &Call, ...);

Simplify some code, add more tests, check operand bundle type instead of checking that the value is a ConstantTokenNone

llvm/lib/IR/Verifier.cpp
3089–3090	Done. I had (and still have) no idea how "callsetup" was getting read as a LLVMContext::OB_callsetup so I wanted to make sure that this branch was actually taken.

Harbormaster completed remote builds in B46768: Diff 245288.Feb 18 2020, 3:11 PM

Add preallocated attribute, major cleanups

Fix some stuff
Add preallocated
Actually add callsetup as an operand bundle
Parse "preallocated" in ll files, add proper ptr casts in tests
Check that callsetup token is from llvm.call.setup
Remove "preallocated" from function declaration
Add comment to Attributes.td
Remove unused function
Trailing spaces

Herald added subscribers: dexonsmith, steven_wu. · View Herald TranscriptFeb 20 2020, 9:02 AM

aeubanks retitled this revision from [WIP] Add intrinsics and operand bundles for inalloca replacement llvm.call.setup to Add IR constructs for inalloca replacement llvm.call.setup.Feb 20 2020, 9:06 AM

aeubanks edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B46925: Diff 245669.Feb 20 2020, 9:06 AM

aeubanks added a reviewer: rnk.Feb 20 2020, 9:06 AM

Add and enforce matching preallocated attribute in callee parameter

Harbormaster completed remote builds in B47064: Diff 246006.Feb 21 2020, 3:16 PM

Fix bad arc diff

Harbormaster completed remote builds in B47069: Diff 246021.Feb 21 2020, 3:35 PM

Fix typo

Fix arc diff

Harbormaster completed remote builds in B47599: Diff 247282.Feb 28 2020, 9:10 AM

Harbormaster failed remote builds in B47601: Diff 247283!Feb 28 2020, 9:28 AM

I would add a test to llvm/test/Bitcode/attributes.ll for preallocated. Other than that, I think we can assume that the attribute machinery works, and we don't need dedicated round-trip testing like we did for inalloca (llvm/test/(Assembler|Bitcode)/inalloca.ll).

I basically think this looks good, but I think we need rules in llvm/docs/LangRef.rst before landing this, which means we need to wrap up the discussion on llvm-dev with Eli.

llvm/include/llvm/Bitcode/LLVMBitCodes.h
636	This set off my "is this used as the RHS of a left shift?" spidey sense, but I checked, and I think it's all good.

Add preallocated attribute to function declaration parameters in tests
Add preallocated attribute to attributes test
Based on attribute test, found some places that I missed regarding reading/writing asm files
Format

Harbormaster failed remote builds in B47843: Diff 247756!Mar 2 2020, 5:25 PM

Skip preallocated in addRawAttributeValue()
Add extra CHECK-NEXT in operand-bundles-bc-analyzer.ll

Harbormaster failed remote builds in B47939: Diff 247957!Mar 3 2020, 11:57 AM

aeubanks added a child revision: D77689: [X86] Codegen for preallocated.Apr 7 2020, 4:31 PM

Add extra verifier check

Harbormaster failed remote builds in B52265: Diff 255854!Apr 7 2020, 4:56 PM

Rebase

Harbormaster failed remote builds in B53197: Diff 257422!Apr 14 2020, 12:58 PM

Harbormaster failed remote builds in B53195: Diff 257420!

rnk added a reviewer: efriedma.Apr 15 2020, 5:34 PM

I looked through this again, and basically think all the code looks good, but I want to get input from Eli, so I added him as a reviewer.

Eli, any concerns with the design or naming before landing this? I was surprised we didn't get more name suggestions upstream.

Arthur, now that you've worked with the names a bit, do you feel like they make sense? Would you change them?

We still need a LangRef patch at some point.

I suspect we might end up tweaking the signature of llvm.call.setup, based on the llvm-dev discussion, but we can experiment in-tree, I guess.

llvm/lib/IR/Verifier.cpp
1675	Do you need to check `Attrs.getPreallocatedType()->isSized()`, or something like that? Or is that checked elsewhere?

Check isSized()
Add missing preallocated type handling in AttrBuilder

Harbormaster failed remote builds in B53476: Diff 257922!Apr 15 2020, 6:15 PM

In D74651#1985474, @rnk wrote:

Arthur, now that you've worked with the names a bit, do you feel like they make sense? Would you change them?

@llvm.call.alloc is weird because it's not allocating anything, the allocation happens in @llvm.call.setup. Maybe @llvm.call.arg is better?

llvm/lib/IR/Verifier.cpp
1675	Piggybacked off existing checks below, thanks for catching.

In D74651#1985561, @aeubanks wrote:

In D74651#1985474, @rnk wrote:

Arthur, now that you've worked with the names a bit, do you feel like they make sense? Would you change them?

@llvm.call.alloc is weird because it's not allocating anything, the allocation happens in @llvm.call.setup. Maybe @llvm.call.arg is better?

I think when I came up with this terminology, I was thinking that alloc would actually do the allocation, so there would be a staircase of ESP adjustments. However, it's true, in the current implementation, alloc is a misnomer, and we might never implement SP staircasing (I don't have a better name for this code pattern...).

Consider the code MSVC generates for a call like this:
https://gcc.godbolt.org/z/TeLRRa

struct Foo { Foo(int x); Foo(const Foo &o); ~Foo(); int x, y; };
void callee(int, Foo, int, Foo, int, Foo);
void bar() {
  callee(1, Foo(2), 3, Foo(4), 5, Foo(6));
}

Each ESP adjustment happens immediately before the argument is evaluated. It lets the compiler pass scalar memory arguments with the PUSH instruction, which is good for code size, and I hear performance, although I have no first hand evidence of this.

In any case, I think we should use a name that makes sense for either implementation strategy.

Should we double down on the preallocated terminology? llvm.call.preallocated.arg? The number arguments make more sense this way, they refer to the N'th "preallocated" argument. The pattern would be:

%cs = call token @llvm.call.setup(i32 3)
%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0)
%y = call i8* @llvm.call.preallocated.arg(token %cs, i32 1)
%z = call i8* @llvm.call.preallocated.arg(token %cs, i32 2)
...

(I suppose that if we ever wanted to do staircasing, we would have to require that the calls be evaluated right to left.)

In any case, I think we should use a name that makes sense for either implementation strategy.

Should we double down on the preallocated terminology? llvm.call.preallocated.arg? The number arguments make more sense this way, they refer to the N'th "preallocated" argument. The pattern would be:
%cs = call token @llvm.call.setup(i32 3)
%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0)
%y = call i8* @llvm.call.preallocated.arg(token %cs, i32 1)
%z = call i8* @llvm.call.preallocated.arg(token %cs, i32 2)
...
(I suppose that if we ever wanted to do staircasing, we would have to require that the calls be evaluated right to left.)

I like adding preallocated to the intrinsic name, but I think we should do it for all the intrinsics, especially since the @llvm.call prefix seems fairly generic.
Do @llvm.call.preallocated.setup, @llvm.call.preallocated.arg, and @llvm.call.preallocated.teardown sound good?

In D74651#1985503, @efriedma wrote:

We still need a LangRef patch at some point.

I suspect we might end up tweaking the signature of llvm.call.setup, based on the llvm-dev discussion, but we can experiment in-tree, I guess.

Does the LangRef patch typically come before, after, or in the same commit? After doing some experimenting?

In D74651#1987254, @aeubanks wrote:

I like adding preallocated to the intrinsic name, but I think we should do it for all the intrinsics, especially since the @llvm.call prefix seems fairly generic.
Do @llvm.call.preallocated.setup, @llvm.call.preallocated.arg, and @llvm.call.preallocated.teardown sound good?

I like that nomenclature. The the whole feature can be referred to as "calls with preallocated argument memory" or "preallocated call sites", and it sounds like grammatically correct English.

In D74651#1985503, @efriedma wrote:

We still need a LangRef patch at some point.

I suspect we might end up tweaking the signature of llvm.call.setup, based on the llvm-dev discussion, but we can experiment in-tree, I guess.

Does the LangRef patch typically come before, after, or in the same commit? After doing some experimenting?

I guess in this case it would be best to put together a LangRef patch to land before this one, in case anyone wants to look at it alone, without the boilerplate.

The main thing we need in LangRef is to document the verifier rules that were added, and the rules for how to use the intrinsics without causing UB. The intrinsics depend on some unobservable state (ESP), and have to be called in a particular order to work at runtime.

In D74651#1987364, @rnk wrote:

In D74651#1987254, @aeubanks wrote:

Does the LangRef patch typically come before, after, or in the same commit? After doing some experimenting?

I guess in this case it would be best to put together a LangRef patch to land before this one, in case anyone wants to look at it alone, without the boilerplate.

A nice feature of landing LangRef changes coincidentally with IR changes is having the documentation at https://www.llvm.org/docs/LangRef.html match the code in https://github.com/llvm/llvm-project. But if there's a good reason to split them up I agree with @rnk that "docs before" is the right order.

Add LangRef
Update some failing tests
Rename "callsetup" operand bundle to "preallocated"
Rename intrinsics to "llvm.call.preallocated.{setup,arg}"
Remove teardown intrinsic (will come later)

In D74651#1987461, @dexonsmith wrote:

In D74651#1987364, @rnk wrote:

In D74651#1987254, @aeubanks wrote:

Does the LangRef patch typically come before, after, or in the same commit? After doing some experimenting?

I guess in this case it would be best to put together a LangRef patch to land before this one, in case anyone wants to look at it alone, without the boilerplate.

A nice feature of landing LangRef changes coincidentally with IR changes is having the documentation at https://www.llvm.org/docs/LangRef.html match the code in https://github.com/llvm/llvm-project. But if there's a good reason to split them up I agree with @rnk that "docs before" is the right order.

I added the LangRef change to this change, it shouldn't be too hard to just look at the LangRef changes.
(also, how do you look at the rendered version before submitting?)

In D74651#1987364, @rnk wrote:

In D74651#1987254, @aeubanks wrote:

I like adding preallocated to the intrinsic name, but I think we should do it for all the intrinsics, especially since the @llvm.call prefix seems fairly generic.
Do @llvm.call.preallocated.setup, @llvm.call.preallocated.arg, and @llvm.call.preallocated.teardown sound good?

I like that nomenclature. The the whole feature can be referred to as "calls with preallocated argument memory" or "preallocated call sites", and it sounds like grammatically correct English.

I renamed the "callsetup" operand bundle to "preallocated" because of this, lmk if that's too confusing since the attribute is also called "preallocated".

Harbormaster failed remote builds in B53790: Diff 258434!Apr 17 2020, 3:11 PM

efriedma added inline comments.Apr 17 2020, 3:24 PM

llvm/docs/LangRef.rst
1081	Are there any rules for what value has to be passed to a "preallocated" argument? Or is the value ignored?
11954	Probably worth mentioning the rules for nested llvm.call.preallocated.setup .
11978	Maybe worth mentioning what happens if you call llvm.call.preallocated.arg after the memory is deallocated?
llvm/lib/IR/Verifier.cpp
4519	"Prelalocated"?
4542	Do you need to check that there aren't any calls to llvm.call.preallocated.arg with the wrong token?

Rebase
Address code review comments

aeubanks added inline comments.Apr 20 2020, 2:32 PM

llvm/docs/LangRef.rst
1081	It's ignored, clarified.
11954	Added blurb about how t1 = setup() t2 = setup() foo() [t2] foo() [t1] is ok but not t1 = setup() t2 = setup() foo() [t1] foo() [t2], is that good enough?
11978	Said it's UB if you call llvm.call.preallocated.arg after the call, or after another llvm.call.preallocated.setup.
llvm/lib/IR/Verifier.cpp
4542	Good catch, I missed that. Done and added test case.

aeubanks marked an inline comment as done.Apr 20 2020, 2:33 PM

aeubanks added inline comments.

llvm/lib/IR/Verifier.cpp
4519	Done.

Harbormaster completed remote builds in B54008: Diff 258844.Apr 20 2020, 2:39 PM

efriedma added inline comments.Apr 20 2020, 3:36 PM

llvm/docs/LangRef.rst
1081	For the purpose of actually generating code, saying it's ignored is fine. It might be inconvenient for IPO transforms/analysis, though: any transform that examines a preallocated argument would have to explicitly blacklist optimizations on the argument. It might make sense to require that the argument has the "correct" value, even if code generation won't use it, for the sake of optimizations.
llvm/lib/IR/Verifier.cpp
1738	Missing comma

Address code review comments

aeubanks marked an inline comment as done.Apr 20 2020, 5:02 PM

aeubanks added inline comments.

llvm/docs/LangRef.rst
1081	Done, PTAL at new wording.
llvm/lib/IR/Verifier.cpp
1738	Done.

Harbormaster completed remote builds in B54022: Diff 258868.Apr 20 2020, 5:22 PM

LGTM. Please give Reid a chance to comment before you merge, though.

llvm/docs/LangRef.rst
1081	New wording seems fine.

This revision is now accepted and ready to land.Apr 20 2020, 5:25 PM

Require preallocated call site attribute on llvm.call.preallocated.arg

Reid and I talked a bit and decided that this hadn't handled the case of when there's a setup without a call (e.g. via DCE). The stack adjustment will happen, but without the call the stack won't be cleaned up. One solution is turn the calls to llvm.call.preallocated.arg into allocas. But we need to know the type of the argument, and without the call we don't have that info. One solution is to add a call site attribute to llvm.call.preallocated.arg with the type. I reused the preallocated attribute for this purpose.
One downside of this approach is that alloca returns a pointer of the proper type, but llvm.call.preallocated.arg returns i8*. And we likely are casting the i8* to the actual pointer type we want, so there will likely be two unnecessary casts when cleaning up setups without calls. But maybe that doesn't matter so much.

Thoughts on this approach?

Harbormaster failed remote builds in B54319: Diff 259411!Apr 22 2020, 3:48 PM

aeubanks retitled this revision from Add IR constructs for inalloca replacement llvm.call.setup to Add IR constructs for inalloca replacement preallocated call setup.Apr 22 2020, 3:53 PM

Your proposed solution to finding the correct type of an allocation with a corresponding call seems fine.

Not sure about rewrite llvm.call.preallocated.arg to an alloca. Not sure you'd actually want to explicitly rewrite it; probably simpler to come up with some fake stack layout and go through the normal lowering.

On a related note, I just realized there isn't any discussion of alignment in the documentation. Have you thought about how the alignment of preallocated arguments is specified?

Thanks for reviewing Eli! I had some wording suggestions and syntax nits worth reuploading for.

llvm/docs/LangRef.rst
1978–1984	I think this wording could be improved. I made a draft, didn't like it, and threw it away. :( I think "actual argument" is vague. I tried to say something like, "The type used on this attribute has to match the type used by the attribute of the corresponding argument at the final call site." I think it might be better to take the rest of this documentation and move it to the description of the intrinsic, and then refer to it here, after documenting the requirement on the attribute type. I'd replace the fragment "which would result in the stack being adjusted..." with something about the fact that, without a call site, we can't do any stack adjustments at all, since we don't know the argument types, and therefore don't know how much to adjust the stack by. From the standpoint of an optimizer, `llvm.call.preallocated.arg` is a special alloca: it allocates stack memory with some type. This attribute carries that type. If there is no corresponding call site, then it is legal and desirable to transform these intrinsics to plain static allocas using the type on this attribute.
2203	I guess I wrote the words on "on Win32", but perhaps we should make it more vague and say "... compatible with MSVC on some targets." I think this call setup stuff is also necessary on arm32/thumb.
2207	I think .rst files need double backticks, or it's a link, not monospace text.
2216	This should take the token as the first argument. I guess now it should also use the new call site attribute as well.
11961	You have to add `call token` between `= @` here to make it legal-ish IR
11963	You should close the bundle name string in the example, `"preallocated"(token...`
11985	We discussed the need to add the argument size here

In D74651#1998063, @efriedma wrote:

Your proposed solution to finding the correct type of an allocation with a corresponding call seems fine.

Not sure about rewrite llvm.call.preallocated.arg to an alloca. Not sure you'd actually want to explicitly rewrite it; probably simpler to come up with some fake stack layout and go through the normal lowering.

Based on what I know of the codegen design, that is feasible, but it will be discovered post-isel, which is pretty late. During instruction finalizations, the DenseMap entry for the call site will be missing. We can make a stack object of appropriate size and alignment at that point, but it's much less convenient than assuming we've taken care of it by CGP.

We already have to write an IR transform utility to rewrite these intrinsics to allocas, at the very least for the inliner. Functionattrs (or the new Attributor) should remove this preallocated attributes when possible, so that pass will want this utility. And, if a call site gets eliminated because it is unreachable, we'll want to call this utility from standard cleanup passes like instcombine. Once we have the utility, we might as well just call it from CGP and rely on it later, instead of implementing it a second time.

On a related note, I just realized there isn't any discussion of alignment in the documentation. Have you thought about how the alignment of preallocated arguments is specified?

The alignment can be surprising. For example, MSVC does not align double arguments or argument structs containing double, despite alignof(double) == 8. MSVC will align vectors and structs that carry __declspec(align) or alignas attributes, but in those cases, it passes the argument by address using a regular alloca.

Is it possible to put an align attribute on a return value? If so, we could put it on the return value at the call site. However, for our purposes, it would always be 4. That's never enough to do any interesting transforms, so it's almost as good as leaving it unspecified.

LangRef changes from code review

aeubanks added inline comments.Apr 22 2020, 11:35 PM

llvm/docs/LangRef.rst
11985	I thought we didn't need it since now we have the type as part of the preallocated call site attribute.

Harbormaster failed remote builds in B54361: Diff 259488!Apr 23 2020, 12:29 AM

Overlooked a comment about wording the reasoning behind adding the call site attribute

Harbormaster failed remote builds in B54413: Diff 259596!Apr 23 2020, 10:16 AM

lgtm, I think this is ready to land.

llvm/docs/LangRef.rst
11985	Sorry, that was a stale comment.

We already have to write an IR transform utility to rewrite these intrinsics to allocas, at the very least for the inliner. Functionattrs (or the new Attributor) should remove this preallocated attributes when possible, so that pass will want this utility. And, if a call site gets eliminated because it is unreachable, we'll want to call this utility from standard cleanup passes like instcombine. Once we have the utility, we might as well just call it from CGP and rely on it later, instead of implementing it a second time.

I edited my comment a couple of times and I think I dropped one important bit.

I'm not sure it's safe to rewrite an call_preallocated_setup to an alloca in the entry block, in general. There is no rule which prevents multiple calls to the same call_preallocated_setup in a loop, without deallocating the memory in between. Because of that, reusing the memory could lead to a miscompile. There wouldn't be any way to deallocate the result, but that isn't necessarily a problem. You could prove this doesn't happen in common cases, and clang wouldn't generate code like that. But LLVM optimizations could introduce it, I think, if there isn't a rule specifically preventing it.

Is it possible to put an align attribute on a return value? If so, we could put it on the return value at the call site. However, for our purposes, it would always be 4. That's never enough to do any interesting transforms, so it's almost as good as leaving it unspecified.

Saying that the alignment of preallocated arguments is always 4 without any way to tweak it seems a little obnoxious, if we ever want to use the intrinsics for any other purpose. I guess it works, though.

Is it possible to put an align attribute on a return value? If so, we could put it on the return value at the call site. However, for our purposes, it would always be 4. That's never enough to do any interesting transforms, so it's almost as good as leaving it unspecified.

Yes, you can put align on a return value. Probably more important would be putting align on the call arguments itself.

Add blurb about alignment

I copied the blurb from byval about align to preallocated, since they should share the same codepath.

Harbormaster failed remote builds in B54471: Diff 259681!Apr 23 2020, 2:08 PM

In D74651#1999995, @efriedma wrote:

I'm not sure it's safe to rewrite an call_preallocated_setup to an alloca in the entry block, in general. There is no rule which prevents multiple calls to the same call_preallocated_setup in a loop, without deallocating the memory in between. Because of that, reusing the memory could lead to a miscompile. There wouldn't be any way to deallocate the result, but that isn't necessarily a problem. You could prove this doesn't happen in common cases, and clang wouldn't generate code like that. But LLVM optimizations could introduce it, I think, if there isn't a rule specifically preventing it.

I think if we model calls with preallocated bundles as modifying inaccessible state (i.e. the stack pointer), LLVM transforms won't do this.

I'm imagining some kind of loop result code sinking transform that wants to sink a readonly function call out of a loop, because the result is only used after the last iteration.

I think if we model calls with preallocated bundles as modifying inaccessible state (i.e. the stack pointer), LLVM transforms won't do this.

I don't think that's enough in general. Well, maybe it's enough for the transforms we actually implement... LLVM currently has very few transforms that fold code together.

But a simple example, suppose we had a transform that took void g(); void f() { g();g();g();g();g(); } and transformed it into something like void g(); void f() { for (int i =0; i < n; ++i) g(); }. Looks fine generally. If the loop body is a sequence containing calls to llvm.call.preallocated.setup()/llvm.call.preallocated.arg(), maybe not so fine. (I think this is only an issue if there is no call actually consuming the stack; if there were, the token would block the transform.)

rnk mentioned this in D78659: Add nomerge function attribute to supress tail merge optimization in simplifyCFG.Apr 24 2020, 6:19 PM

In D74651#2000661, @efriedma wrote:

I think if we model calls with preallocated bundles as modifying inaccessible state (i.e. the stack pointer), LLVM transforms won't do this.

I don't think that's enough in general. Well, maybe it's enough for the transforms we actually implement... LLVM currently has very few transforms that fold code together.

But a simple example, suppose we had a transform that took void g(); void f() { g();g();g();g();g(); } and transformed it into something like void g(); void f() { for (int i =0; i < n; ++i) g(); }. Looks fine generally. If the loop body is a sequence containing calls to llvm.call.preallocated.setup()/llvm.call.preallocated.arg(), maybe not so fine. (I think this is only an issue if there is no call actually consuming the stack; if there were, the token would block the transform.)

What about if we directly replace the llvm.call.preallocated.arg() with an alloca (as opposed to an alloca in the entry block)?

What about if we directly replace the llvm.call.preallocated.arg() with an alloca (as opposed to an alloca in the entry block)?

There can be more than one llvm.call.preallocated.arg referring to the same argument, so probably you'd want to insert the alloca at the point of the llvm.call.preallocated.setup. But yes, that should work.

Update LangRef about replacing setup/arg with alloca/gep
Rebase
Rename call_setup*.ll -> preallocated*.ll
Preallocated EnumAttr -> TypeAttr

In D74651#2005985, @efriedma wrote:

What about if we directly replace the llvm.call.preallocated.arg() with an alloca (as opposed to an alloca in the entry block)?

There can be more than one llvm.call.preallocated.arg referring to the same argument, so probably you'd want to insert the alloca at the point of the llvm.call.preallocated.setup. But yes, that should work.

We could go down the route of doing something similar to inalloca where the llvm.call.preallocated.setup becomes an alloca of a structure containing all the arguments, and the llvm.call.preallocated.arg becomes a gep of the alloca. I've changed the LangRef to say that. I'm not sure if I'm overspecifying in the LangRef though, maybe that should be an implementation detail?

LGTM

I've changed the LangRef to say that. I'm not sure if I'm overspecifying in the LangRef though, maybe that should be an implementation detail?

LangRef can't require that lowering be implemented some specific way; that would break the as-if rule. The important bit is that it can be implemented that way, and that might be the simplest way to explain the point.

But maybe we don't actually need to include that sentence in LangRef at all; the fact that it's legal to lower that way should be implied from the other rules. And if it isn't implied, it's probably better to clarify the other rules.

Harbormaster failed remote builds in B54868: Diff 260450!Apr 27 2020, 3:07 PM

In D74651#2006367, @efriedma wrote:

I've changed the LangRef to say that. I'm not sure if I'm overspecifying in the LangRef though, maybe that should be an implementation detail?

LangRef can't require that lowering be implemented some specific way; that would break the as-if rule. The important bit is that it can be implemented that way, and that might be the simplest way to explain the point.

But maybe we don't actually need to include that sentence in LangRef at all; the fact that it's legal to lower that way should be implied from the other rules. And if it isn't implied, it's probably better to clarify the other rules.

I think it is derivable from the other rules. I've removed those lines (and we can experiment later with where to put the alloca).

Thanks for the reviews!

Remove implementation detail lines

I think in the end, the description of how this is lowered was left out, so this looks good and we can commit it as is.

However, I'd like to ensure that it is valid to replace preallocated call sites with static allocas (allocas in the entry block).

In D74651#2000661, @efriedma wrote:

I think if we model calls with preallocated bundles as modifying inaccessible state (i.e. the stack pointer), LLVM transforms won't do this.

I don't think that's enough in general. Well, maybe it's enough for the transforms we actually implement... LLVM currently has very few transforms that fold code together.

But a simple example, suppose we had a transform that took void g(); void f() { g();g();g();g();g(); } and transformed it into something like void g(); void f() { for (int i =0; i < n; ++i) g(); }. Looks fine generally. If the loop body is a sequence containing calls to llvm.call.preallocated.setup()/llvm.call.preallocated.arg(), maybe not so fine. (I think this is only an issue if there is no call actually consuming the stack; if there were, the token would block the transform.)

I don't think I understand this example. This looks like loop re-rolling to me, though, to give the transform a name.

If g() uses a preallocated call site, the loop re-roller would have to execute the exact same sequence of preallocated call intrinsics as it would in the unrolled form. That seems like it's all fine.

What about this transform rearranges the call site intrinsics to make them overlap, so that we would need two argument memory locations alive at the same time?

Harbormaster failed remote builds in B54896: Diff 260490!Apr 27 2020, 4:44 PM

Closed by commit rG3b0450acecb6: Add IR constructs for preallocated (inalloca replacement) (authored by aeubanks). · Explain WhyApr 27 2020, 4:44 PM

This revision was automatically updated to reflect the committed changes.

This looks like loop re-rolling to me, though, to give the transform a name.

Yes, that's the idea. (LLVM has a reroll pass, but it only handles rerolling loops.)

I guess I'll try to construct an actual C++ example. Suppose you have something like this:

extern "C" int printf(const char*, ...);
extern "C" void abort(void) __attribute((noreturn));
struct A;
A* ptrs[10];
struct A {
    int z;
    __attribute((noinline))
    A(int x) { z = 1; ptrs[z] = this; }
    A(const A&);
};
    __attribute((noinline))
void sum() { int sum = 0; for(A* p : ptrs) sum += p->z; printf("%d\n", sum); }
int g(int, A);
void f() { g(g(g(g(g((sum(), abort(), 6), A(0)), A(1)), A(2)), A(3)), A(4)); }

On 32-bit Windows, the IR looks something like this:

%1 = alloca inalloca <{ i32, %struct.A }>, align 4
%2 = getelementptr inbounds <{ i32, %struct.A }>, <{ i32, %struct.A }>* %1, i32 0, i32 1
%3 = call x86_thiscallcc %struct.A* @"??0A@@QAE@H@Z"(%struct.A* nonnull %2, i32 4)
%4 = alloca inalloca <{ i32, %struct.A }>, align 4
%5 = getelementptr inbounds <{ i32, %struct.A }>, <{ i32, %struct.A }>* %4, i32 0, i32 1
%6 = call x86_thiscallcc %struct.A* @"??0A@@QAE@H@Z"(%struct.A* nonnull %5, i32 3)
%7 = alloca inalloca <{ i32, %struct.A }>, align 4
%8 = getelementptr inbounds <{ i32, %struct.A }>, <{ i32, %struct.A }>* %7, i32 0, i32 1
%9 = call x86_thiscallcc %struct.A* @"??0A@@QAE@H@Z"(%struct.A* nonnull %8, i32 2)
%10 = alloca inalloca <{ i32, %struct.A }>, align 4
%11 = getelementptr inbounds <{ i32, %struct.A }>, <{ i32, %struct.A }>* %10, i32 0, i32 1
%12 = call x86_thiscallcc %struct.A* @"??0A@@QAE@H@Z"(%struct.A* nonnull %11, i32 1)
%13 = alloca inalloca <{ i32, %struct.A }>, align 4
%14 = getelementptr inbounds <{ i32, %struct.A }>, <{ i32, %struct.A }>* %13, i32 0, i32 1
%15 = call x86_thiscallcc %struct.A* @"??0A@@QAE@H@Z"(%struct.A* nonnull %14, i32 0)
call void @"?sum@@YAXXZ"()
call void @abort() #5
unreachable

Notice the repeated alloca/gep/call pattern. In the new world, the "alloca" will be replaced with preallocated.setup/preallocated.arg. Now say your reroller decides to reroll that. What's the result of "sum()"?

I see, the noreturn part of this is key here. I guess we solved this problem for Windows exception handling by threading the token through all the call instructions, so we can still color EH funclet regions in cases like this:

void maythrow();
void f() {
  try {
    maythrow();
  } catch(...) {
    throw;
  }
}

Funclet bundles have been an ongoing maintenance burden that I would like to ease at some point, so I don't want to use that solution here.

So far we've identified two kinds of transforms that should be semantics preserving according to the rules in LangRef, but present problems with our planned lowering. Neither transform exists in LLVM today:

Tail merging blocks ending in unreachable where some blocks are "inside" a preallocated call region and some are outside.
Rolling up loops of intrinsics that establish half-open preallocated call regions. The lack of the "region close" marker in the IR is what makes this transform possible.

Similar to the case of SEH try, I think we can really only solve this with a first class scope/region concept. In the meantime, I think we should move ahead with the implementation, and try to come up with a scope/region design that works for EH, call setup, SEH __try, and perhaps stack variable lifetime. I have a draft doc with a problem statement, but I don't have a good solution yet.

Similar to the case of SEH try, I think we can really only solve this with a first class scope/region concept. In the meantime, I think we should move ahead with the implementation, and try to come up with a scope/region design that works for EH, call setup, SEH __try, and perhaps stack variable lifetime. I have a draft doc with a problem statement, but I don't have a good solution yet.

Okay, that makes sense. (I still don't really see why we'd want this for variable lifetimes. It isn't fundamentally problematic if variable lifetimes don't properly nest.)

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

137 lines

include/

llvm/

Bitcode/

LLVMBitCodes.h

1 line

IR/

9 lines

3 lines

3 lines

4 lines

lib/

AsmParser/

1 line

1 line

40 lines

1 line

Bitcode/

Reader/

BitcodeReader.cpp

20 lines

Writer/

BitcodeWriter.cpp

2 lines

IR/

10 lines

1 line

72 lines

5 lines

129 lines

Transforms/

Utils/

CodeExtractor.cpp

1 line

test/

Assembler/

invalid-byval-type3.ll

2 lines

Bitcode/

attributes.ll

8 lines

operand-bundles-bc-analyzer.ll

1 line

Verifier/

preallocated-invalid.ll

118 lines

preallocated-valid.ll

32 lines

Diff 260498

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,050 Lines • ▼ Show 20 Lines	``byval`` or ``byval(<ty>)``
The byval attribute also supports an optional type argument, which must be		The byval attribute also supports an optional type argument, which must be
the same as the pointee type of the argument.		the same as the pointee type of the argument.

The byval attribute also supports specifying an alignment with the		The byval attribute also supports specifying an alignment with the
align attribute. It indicates the alignment of the stack slot to		align attribute. It indicates the alignment of the stack slot to
form and the known alignment of the pointer specified to the call		form and the known alignment of the pointer specified to the call
site. If the alignment is not specified, then the code generator		site. If the alignment is not specified, then the code generator
makes a target-specific assumption.		makes a target-specific assumption.
		``preallocated(<ty>)``
		This indicates that the pointer parameter should really be passed by
		value to the function, and that the pointer parameter's pointee has
		already been initialized before the call instruction. This attribute
		is only valid on LLVM pointer arguments. The argument must be the value
		returned by the appropriate
		:ref:`llvm.call.preallocated.arg<int_call_preallocated_arg>`, although is
		ignored during codegen.

		Any function call with a ``preallocated`` attribute in any parameter
		must have a ``"preallocated"`` operand bundle.

		The preallocated attribute requires a type argument, which must be
		the same as the pointee type of the argument.

		The preallocated attribute also supports specifying an alignment with the
		align attribute. It indicates the alignment of the stack slot to
		form and the known alignment of the pointer specified to the call
		site. If the alignment is not specified, then the code generator
		makes a target-specific assumption.

.. _attr_inalloca:		.. _attr_inalloca:

		efriedmaUnsubmitted Done Reply Inline Actions Are there any rules for what value has to be passed to a "preallocated" argument? Or is the value ignored? efriedma: Are there any rules for what value has to be passed to a "preallocated" argument? Or is the…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions It's ignored, clarified. aeubanks: It's ignored, clarified.
		efriedmaUnsubmitted Not Done Reply Inline Actions For the purpose of actually generating code, saying it's ignored is fine. It might be inconvenient for IPO transforms/analysis, though: any transform that examines a preallocated argument would have to explicitly blacklist optimizations on the argument. It might make sense to require that the argument has the "correct" value, even if code generation won't use it, for the sake of optimizations. efriedma: For the purpose of actually generating code, saying it's ignored is fine. It might be…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Done, PTAL at new wording. aeubanks: Done, PTAL at new wording.
		efriedmaUnsubmitted Not Done Reply Inline Actions New wording seems fine. efriedma: New wording seems fine.
``inalloca``		``inalloca``

The ``inalloca`` argument attribute allows the caller to take the		The ``inalloca`` argument attribute allows the caller to take the
address of outgoing stack arguments. An ``inalloca`` argument must		address of outgoing stack arguments. An ``inalloca`` argument must
be a pointer to stack memory produced by an ``alloca`` instruction.		be a pointer to stack memory produced by an ``alloca`` instruction.
The alloca, or argument allocation, must also be tagged with the		The alloca, or argument allocation, must also be tagged with the
inalloca keyword. Only the last argument may have the ``inalloca``		inalloca keyword. Only the last argument may have the ``inalloca``
attribute, and that argument is guaranteed to be passed in memory.		attribute, and that argument is guaranteed to be passed in memory.
▲ Show 20 Lines • Show All 878 Lines • ▼ Show 20 Lines	the remaining tokens can have the following values:::
\| Ls <pos> -> runtime linear with val modifier		\| Ls <pos> -> runtime linear with val modifier
\| Us <pos> -> runtime linear with uval modifier		\| Us <pos> -> runtime linear with uval modifier
\| u -> uniform		\| u -> uniform

<scalar_name>:= name of the scalar function		<scalar_name>:= name of the scalar function

<vector_redirection>:= optional, custom name of the vector function		<vector_redirection>:= optional, custom name of the vector function

		``preallocated(<ty>)``
		This attribute is required on calls to ``llvm.call.preallocated.arg``
		and cannot be used on any other call. See
		:ref:`llvm.call.preallocated.arg<int_call_preallocated_arg>` for more
		details.

.. _glattrs:		.. _glattrs:

Global Attributes		Global Attributes
		rnkUnsubmitted Not Done Reply Inline Actions I think this wording could be improved. I made a draft, didn't like it, and threw it away. :( I think "actual argument" is vague. I tried to say something like, "The type used on this attribute has to match the type used by the attribute of the corresponding argument at the final call site." I think it might be better to take the rest of this documentation and move it to the description of the intrinsic, and then refer to it here, after documenting the requirement on the attribute type. I'd replace the fragment "which would result in the stack being adjusted..." with something about the fact that, without a call site, we can't do any stack adjustments at all, since we don't know the argument types, and therefore don't know how much to adjust the stack by. From the standpoint of an optimizer, `llvm.call.preallocated.arg` is a special alloca: it allocates stack memory with some type. This attribute carries that type. If there is no corresponding call site, then it is legal and desirable to transform these intrinsics to plain static allocas using the type on this attribute. rnk: I think this wording could be improved. I made a draft, didn't like it, and threw it away.
-----------------		-----------------

Attributes may be set to communicate additional information about a global variable.		Attributes may be set to communicate additional information about a global variable.
Unlike :ref:`function attributes <fnattrs>`, attributes on a global variable		Unlike :ref:`function attributes <fnattrs>`, attributes on a global variable
are grouped into a single :ref:`attribute group <attrgrp>`.		are grouped into a single :ref:`attribute group <attrgrp>`.

.. _opbundles:		.. _opbundles:

▲ Show 20 Lines • Show All 193 Lines • ▼ Show 20 Lines	* Attributes that can be expressed via operand bundles are directly the
operand bundles removes the need for an instruction sequence that represents		operand bundles removes the need for an instruction sequence that represents
the property (e.g., `icmp ne i32* %p, null` for `nonnull`) and for the		the property (e.g., `icmp ne i32* %p, null` for `nonnull`) and for the
optimizer to deduce the property from that instruction sequence.		optimizer to deduce the property from that instruction sequence.
* Expressing the property using operand bundles makes it easy to identify the		* Expressing the property using operand bundles makes it easy to identify the
use of the value as a use in an :ref:`llvm.assume <int_assume>`. This then		use of the value as a use in an :ref:`llvm.assume <int_assume>`. This then
simplifies and improves heuristics, e.g., for use "use-sensitive"		simplifies and improves heuristics, e.g., for use "use-sensitive"
optimizations.		optimizations.

		.. _ob_preallocated:

		Preallocated Operand Bundles
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Preallocated operand bundles are characterized by the ``"preallocated"``
		operand bundle tag. These operand bundles allow separation of the allocation
		of the call argument memory from the call site. This is necessary to pass
		non-trivially copyable objects by value in a way that is compatible with MSVC
		on some targets. There can be at most one ``"preallocated"`` operand bundle
		rnkUnsubmitted Done Reply Inline Actions I guess I wrote the words on "on Win32", but perhaps we should make it more vague and say "... compatible with MSVC on some targets." I think this call setup stuff is also necessary on arm32/thumb. rnk: I guess I wrote the words on "on Win32", but perhaps we should make it more vague and say "...
		attached to a call site and it must have exactly one bundle operand, which is
		a token generated by ``@llvm.call.preallocated.setup``. A call with this
		operand bundle should not adjust the stack before entering the function, as
		that will have been done by one of the ``@llvm.call.preallocated.*`` intrinsics.
		rnkUnsubmitted Done Reply Inline Actions I think .rst files need double backticks, or it's a link, not monospace text. rnk: I think .rst files need double backticks, or it's a link, not monospace text.

		.. code-block:: llvm

		%foo = type { i64, i32 }

		...

		%t = call token @llvm.call.preallocated.setup(i32 1)
		%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%foo)
		rnkUnsubmitted Done Reply Inline Actions This should take the token as the first argument. I guess now it should also use the new call site attribute as well. rnk: This should take the token as the first argument. I guess now it should also use the new call…
		%b = bitcast i8* %a to %foo*
		; initialize %b
		call void @bar(i32 42, %foo* preallocated(%foo) %b) ["preallocated"(token %t)]

.. _moduleasm:		.. _moduleasm:

Module-Level Inline Assembly		Module-Level Inline Assembly
----------------------------		----------------------------

Modules may contain "module-level inline asm" blocks, which corresponds		Modules may contain "module-level inline asm" blocks, which corresponds
to the GCC "file scope inline asm" blocks. These blocks are internally		to the GCC "file scope inline asm" blocks. These blocks are internally
concatenated by LLVM and treated as a single unit, but may be separated		concatenated by LLVM and treated as a single unit, but may be separated
▲ Show 20 Lines • Show All 9,693 Lines • ▼ Show 20 Lines
The '``llvm.thread.pointer``' intrinsic returns a pointer to the TLS area		The '``llvm.thread.pointer``' intrinsic returns a pointer to the TLS area
for the current thread. The exact semantics of this value are target		for the current thread. The exact semantics of this value are target
specific: it may point to the start of TLS area, to the end, or somewhere		specific: it may point to the start of TLS area, to the end, or somewhere
in the middle. Depending on the target, this intrinsic may read a register,		in the middle. Depending on the target, this intrinsic may read a register,
call a helper function, read from an alternate memory space, or perform		call a helper function, read from an alternate memory space, or perform
other operations necessary to locate the TLS area. Not all targets support		other operations necessary to locate the TLS area. Not all targets support
this intrinsic.		this intrinsic.

		'``llvm.call.preallocated.setup``' Intrinsic
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Syntax:
		"""""""

		::

		declare token @llvm.call.preallocated.setup(i32 %num_args)

		Overview:
		"""""""""

		The '``llvm.call.preallocated.setup``' intrinsic returns a token which can
		be used with a call's ``"preallocated"`` operand bundle to indicate that
		certain arguments are allocated and initialized before the call.

		Semantics:
		""""""""""

		The '``llvm.call.preallocated.setup``' intrinsic returns a token which is
		associated with at most one call. The token can be passed to
		'``@llvm.call.preallocated.arg``' to get a pointer to get that
		corresponding argument. The token must be the parameter to a
		``"preallocated"`` operand bundle for the corresponding call.
		efriedmaUnsubmitted Not Done Reply Inline Actions Probably worth mentioning the rules for nested llvm.call.preallocated.setup . efriedma: Probably worth mentioning the rules for nested llvm.call.preallocated.setup .
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Added blurb about how t1 = setup() t2 = setup() foo() [t2] foo() [t1] is ok but not t1 = setup() t2 = setup() foo() [t1] foo() [t2], is that good enough? aeubanks: Added blurb about how t1 = setup() t2 = setup() foo() [t2] foo() [t1] is ok but not t1 =…

		Nested calls to '``llvm.call.preallocated.setup``' are allowed, but must
		be properly nested. e.g.

		:: code-block:: llvm

		%t1 = call token @llvm.call.preallocated.setup(i32 0)
		rnkUnsubmitted Done Reply Inline Actions You have to add `call token` between `= @` here to make it legal-ish IR rnk: You have to add `call token` between `= @` here to make it legal-ish IR
		%t2 = call token @llvm.call.preallocated.setup(i32 0)
		call void foo() ["preallocated"(token %t2)]
		rnkUnsubmitted Done Reply Inline Actions You should close the bundle name string in the example, `"preallocated"(token...` rnk: You should close the bundle name string in the example, `"preallocated"(token...`
		call void foo() ["preallocated"(token %t1)]

		is allowed, but not

		:: code-block:: llvm

		%t1 = call token @llvm.call.preallocated.setup(i32 0)
		%t2 = call token @llvm.call.preallocated.setup(i32 0)
		call void foo() ["preallocated"(token %t1)]
		call void foo() ["preallocated"(token %t2)]

		.. _int_call_preallocated_arg:

		'``llvm.call.preallocated.arg``' Intrinsic
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
		efriedmaUnsubmitted Not Done Reply Inline Actions Maybe worth mentioning what happens if you call llvm.call.preallocated.arg after the memory is deallocated? efriedma: Maybe worth mentioning what happens if you call llvm.call.preallocated.arg after the memory is…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Said it's UB if you call llvm.call.preallocated.arg after the call, or after another llvm.call.preallocated.setup. aeubanks: Said it's UB if you call llvm.call.preallocated.arg after the call, or after another llvm.call.

		Syntax:
		"""""""

		::

		declare i8* @llvm.call.preallocated.arg(token %setup_token, i32 %arg_index)
		rnkUnsubmitted Not Done Reply Inline Actions We discussed the need to add the argument size here rnk: We discussed the need to add the argument size here
		aeubanksAuthorUnsubmitted Done Reply Inline Actions I thought we didn't need it since now we have the type as part of the preallocated call site attribute. aeubanks: I thought we didn't need it since now we have the type as part of the preallocated call site…
		rnkUnsubmitted Not Done Reply Inline Actions Sorry, that was a stale comment. rnk: Sorry, that was a stale comment.

		Overview:
		"""""""""

		The '``llvm.call.preallocated.arg``' intrinsic returns a pointer to the
		corresponding preallocated argument for the preallocated call.

		Semantics:
		""""""""""

		The '``llvm.call.preallocated.arg``' intrinsic returns a pointer to the
		``%arg_index``th argument with the ``preallocated`` attribute for
		the call associated with the ``%setup_token``, which must be from
		'``llvm.call.preallocated.setup``'.

		A call to '``llvm.call.preallocated.arg``' must have a call site
		``preallocated`` attribute. The type of the ``preallocated`` attribute must
		match the type used by the ``preallocated`` attribute of the corresponding
		argument at the preallocated call. The type is used in the case that an
		``llvm.call.preallocated.setup`` does not have a corresponding call (e.g. due
		to DCE), where otherwise we cannot know how large the arguments are.

		It is undefined behavior if this is called with a token from an
		'``llvm.call.preallocated.setup``' if another
		'``llvm.call.preallocated.setup``' has already been called or if the
		preallocated call corresponding to the '``llvm.call.preallocated.setup``'
		has already been called.

Standard C Library Intrinsics		Standard C Library Intrinsics
-----------------------------		-----------------------------

LLVM provides intrinsics for a few important standard C library		LLVM provides intrinsics for a few important standard C library
functions. These intrinsics allow source-language front-ends to pass		functions. These intrinsics allow source-language front-ends to pass
information about the alignment of the pointer arguments to the code		information about the alignment of the pointer arguments to the code
generator, providing opportunity for more efficient code generation.		generator, providing opportunity for more efficient code generation.

▲ Show 20 Lines • Show All 7,789 Lines • Show Last 20 Lines

llvm/include/llvm/Bitcode/LLVMBitCodes.h

Show First 20 Lines • Show All 627 Lines • ▼ Show 20 Lines	enum AttributeKindCodes {
ATTR_KIND_OPT_FOR_FUZZING = 57,		ATTR_KIND_OPT_FOR_FUZZING = 57,
ATTR_KIND_SHADOWCALLSTACK = 58,		ATTR_KIND_SHADOWCALLSTACK = 58,
ATTR_KIND_SPECULATIVE_LOAD_HARDENING = 59,		ATTR_KIND_SPECULATIVE_LOAD_HARDENING = 59,
ATTR_KIND_IMMARG = 60,		ATTR_KIND_IMMARG = 60,
ATTR_KIND_WILLRETURN = 61,		ATTR_KIND_WILLRETURN = 61,
ATTR_KIND_NOFREE = 62,		ATTR_KIND_NOFREE = 62,
ATTR_KIND_NOSYNC = 63,		ATTR_KIND_NOSYNC = 63,
ATTR_KIND_SANITIZE_MEMTAG = 64,		ATTR_KIND_SANITIZE_MEMTAG = 64,
		ATTR_KIND_PREALLOCATED = 65,
		rnkUnsubmitted Not Done Reply Inline Actions This set off my "is this used as the RHS of a left shift?" spidey sense, but I checked, and I think it's all good. rnk: This set off my "is this used as the RHS of a left shift?" spidey sense, but I checked, and I…
};		};

enum ComdatSelectionKindCodes {		enum ComdatSelectionKindCodes {
COMDAT_SELECTION_KIND_ANY = 1,		COMDAT_SELECTION_KIND_ANY = 1,
COMDAT_SELECTION_KIND_EXACT_MATCH = 2,		COMDAT_SELECTION_KIND_EXACT_MATCH = 2,
COMDAT_SELECTION_KIND_LARGEST = 3,		COMDAT_SELECTION_KIND_LARGEST = 3,
COMDAT_SELECTION_KIND_NO_DUPLICATES = 4,		COMDAT_SELECTION_KIND_NO_DUPLICATES = 4,
COMDAT_SELECTION_KIND_SAME_SIZE = 5,		COMDAT_SELECTION_KIND_SAME_SIZE = 5,
Show All 14 Lines

llvm/include/llvm/IR/Attributes.h

Show First 20 Lines • Show All 102 Lines • ▼ Show 20 Lines	public:
static Attribute getWithDereferenceableBytes(LLVMContext &Context,		static Attribute getWithDereferenceableBytes(LLVMContext &Context,
uint64_t Bytes);		uint64_t Bytes);
static Attribute getWithDereferenceableOrNullBytes(LLVMContext &Context,		static Attribute getWithDereferenceableOrNullBytes(LLVMContext &Context,
uint64_t Bytes);		uint64_t Bytes);
static Attribute getWithAllocSizeArgs(LLVMContext &Context,		static Attribute getWithAllocSizeArgs(LLVMContext &Context,
unsigned ElemSizeArg,		unsigned ElemSizeArg,
const Optional<unsigned> &NumElemsArg);		const Optional<unsigned> &NumElemsArg);
static Attribute getWithByValType(LLVMContext &Context, Type *Ty);		static Attribute getWithByValType(LLVMContext &Context, Type *Ty);
		static Attribute getWithPreallocatedType(LLVMContext &Context, Type *Ty);

static Attribute::AttrKind getAttrKindFromName(StringRef AttrName);		static Attribute::AttrKind getAttrKindFromName(StringRef AttrName);

static StringRef getNameFromAttrKind(Attribute::AttrKind AttrKind);		static StringRef getNameFromAttrKind(Attribute::AttrKind AttrKind);

/// Return true if and only if the attribute has an Argument.		/// Return true if and only if the attribute has an Argument.
static bool doesAttrKindHaveArgument(Attribute::AttrKind AttrKind);		static bool doesAttrKindHaveArgument(Attribute::AttrKind AttrKind);

▲ Show 20 Lines • Show All 178 Lines • ▼ Show 20 Lines	public:
/// Return the target-dependent attribute object.		/// Return the target-dependent attribute object.
Attribute getAttribute(StringRef Kind) const;		Attribute getAttribute(StringRef Kind) const;

MaybeAlign getAlignment() const;		MaybeAlign getAlignment() const;
MaybeAlign getStackAlignment() const;		MaybeAlign getStackAlignment() const;
uint64_t getDereferenceableBytes() const;		uint64_t getDereferenceableBytes() const;
uint64_t getDereferenceableOrNullBytes() const;		uint64_t getDereferenceableOrNullBytes() const;
Type *getByValType() const;		Type *getByValType() const;
		Type *getPreallocatedType() const;
std::pair<unsigned, Optional<unsigned>> getAllocSizeArgs() const;		std::pair<unsigned, Optional<unsigned>> getAllocSizeArgs() const;
std::string getAsString(bool InAttrGrp = false) const;		std::string getAsString(bool InAttrGrp = false) const;

using iterator = const Attribute *;		using iterator = const Attribute *;

iterator begin() const;		iterator begin() const;
iterator end() const;		iterator end() const;
#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
▲ Show 20 Lines • Show All 406 Lines • ▼ Show 20 Lines	class AttrBuilder {
std::bitset<Attribute::EndAttrKinds> Attrs;		std::bitset<Attribute::EndAttrKinds> Attrs;
std::map<std::string, std::string, std::less<>> TargetDepAttrs;		std::map<std::string, std::string, std::less<>> TargetDepAttrs;
MaybeAlign Alignment;		MaybeAlign Alignment;
MaybeAlign StackAlignment;		MaybeAlign StackAlignment;
uint64_t DerefBytes = 0;		uint64_t DerefBytes = 0;
uint64_t DerefOrNullBytes = 0;		uint64_t DerefOrNullBytes = 0;
uint64_t AllocSizeArgs = 0;		uint64_t AllocSizeArgs = 0;
Type *ByValType = nullptr;		Type *ByValType = nullptr;
		Type *PreallocatedType = nullptr;

public:		public:
AttrBuilder() = default;		AttrBuilder() = default;

AttrBuilder(const Attribute &A) {		AttrBuilder(const Attribute &A) {
addAttribute(A);		addAttribute(A);
}		}

▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	public:

/// Retrieve the number of dereferenceable_or_null bytes, if the		/// Retrieve the number of dereferenceable_or_null bytes, if the
/// dereferenceable_or_null attribute exists (zero is returned otherwise).		/// dereferenceable_or_null attribute exists (zero is returned otherwise).
uint64_t getDereferenceableOrNullBytes() const { return DerefOrNullBytes; }		uint64_t getDereferenceableOrNullBytes() const { return DerefOrNullBytes; }

/// Retrieve the byval type.		/// Retrieve the byval type.
Type *getByValType() const { return ByValType; }		Type *getByValType() const { return ByValType; }

		/// Retrieve the preallocated type.
		Type *getPreallocatedType() const { return PreallocatedType; }

/// Retrieve the allocsize args, if the allocsize attribute exists. If it		/// Retrieve the allocsize args, if the allocsize attribute exists. If it
/// doesn't exist, pair(0, 0) is returned.		/// doesn't exist, pair(0, 0) is returned.
std::pair<unsigned, Optional<unsigned>> getAllocSizeArgs() const;		std::pair<unsigned, Optional<unsigned>> getAllocSizeArgs() const;

/// This turns an alignment into the form used internally in Attribute.		/// This turns an alignment into the form used internally in Attribute.
/// This call has no effect if Align is not set.		/// This call has no effect if Align is not set.
AttrBuilder &addAlignmentAttr(MaybeAlign Align);		AttrBuilder &addAlignmentAttr(MaybeAlign Align);

Show All 27 Lines	public:

/// This turns one (or two) ints into the form used internally in Attribute.		/// This turns one (or two) ints into the form used internally in Attribute.
AttrBuilder &addAllocSizeAttr(unsigned ElemSizeArg,		AttrBuilder &addAllocSizeAttr(unsigned ElemSizeArg,
const Optional<unsigned> &NumElemsArg);		const Optional<unsigned> &NumElemsArg);

/// This turns a byval type into the form used internally in Attribute.		/// This turns a byval type into the form used internally in Attribute.
AttrBuilder &addByValAttr(Type *Ty);		AttrBuilder &addByValAttr(Type *Ty);

		/// This turns a preallocated type into the form used internally in Attribute.
		AttrBuilder &addPreallocatedAttr(Type *Ty);

/// Add an allocsize attribute, using the representation returned by		/// Add an allocsize attribute, using the representation returned by
/// Attribute.getIntValue().		/// Attribute.getIntValue().
AttrBuilder &addAllocSizeAttrFromRawRepr(uint64_t RawAllocSizeRepr);		AttrBuilder &addAllocSizeAttrFromRawRepr(uint64_t RawAllocSizeRepr);

/// Return true if the builder contains no target-independent		/// Return true if the builder contains no target-independent
/// attributes.		/// attributes.
bool empty() const { return Attrs.none(); }		bool empty() const { return Attrs.none(); }

▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Attributes.td

	Show First 20 Lines • Show All 127 Lines • ▼ Show 20 Lines
	def OptForFuzzing : EnumAttr<"optforfuzzing">;			def OptForFuzzing : EnumAttr<"optforfuzzing">;

	/// opt_size.			/// opt_size.
	def OptimizeForSize : EnumAttr<"optsize">;			def OptimizeForSize : EnumAttr<"optsize">;

	/// Function must not be optimized.			/// Function must not be optimized.
	def OptimizeNone : EnumAttr<"optnone">;			def OptimizeNone : EnumAttr<"optnone">;

				/// Similar to byval but without a copy.
				def Preallocated : TypeAttr<"preallocated">;

	/// Function does not access memory.			/// Function does not access memory.
	def ReadNone : EnumAttr<"readnone">;			def ReadNone : EnumAttr<"readnone">;

	/// Function only reads from memory.			/// Function only reads from memory.
	def ReadOnly : EnumAttr<"readonly">;			def ReadOnly : EnumAttr<"readonly">;

	/// Return value is always equal to this argument.			/// Return value is always equal to this argument.
	def Returned : EnumAttr<"returned">;			def Returned : EnumAttr<"returned">;
	▲ Show 20 Lines • Show All 128 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

	Show First 20 Lines • Show All 502 Lines • ▼ Show 20 Lines
	// A call to profile runtime for value profiling of target expressions			// A call to profile runtime for value profiling of target expressions
	// through instrumentation based profiling.			// through instrumentation based profiling.
	def int_instrprof_value_profile : Intrinsic<[],			def int_instrprof_value_profile : Intrinsic<[],
	[llvm_ptr_ty, llvm_i64_ty,			[llvm_ptr_ty, llvm_i64_ty,
	llvm_i64_ty, llvm_i32_ty,			llvm_i64_ty, llvm_i32_ty,
	llvm_i32_ty],			llvm_i32_ty],
	[]>;			[]>;

				def int_call_preallocated_setup : Intrinsic<[llvm_token_ty], [llvm_i32_ty]>;
				def int_call_preallocated_arg : Intrinsic<[llvm_ptr_ty], [llvm_token_ty, llvm_i32_ty]>;

	//===------------------- Standard C Library Intrinsics --------------------===//			//===------------------- Standard C Library Intrinsics --------------------===//
	//			//

	def int_memcpy : Intrinsic<[],			def int_memcpy : Intrinsic<[],
	[llvm_anyptr_ty, llvm_anyptr_ty, llvm_anyint_ty,			[llvm_anyptr_ty, llvm_anyptr_ty, llvm_anyint_ty,
	llvm_i1_ty],			llvm_i1_ty],
	[IntrArgMemOnly, IntrWillReturn, NoCapture<0>, NoCapture<1>,			[IntrArgMemOnly, IntrWillReturn, NoCapture<0>, NoCapture<1>,
	NoAlias<0>, NoAlias<1>, WriteOnly<0>, ReadOnly<1>, ImmArg<3>]>;			NoAlias<0>, NoAlias<1>, WriteOnly<0>, ReadOnly<1>, ImmArg<3>]>;
	▲ Show 20 Lines • Show All 958 Lines • Show Last 20 Lines

llvm/include/llvm/IR/LLVMContext.h

	Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines
	#define LLVM_FIXED_MD_KIND(EnumID, Name, Value) EnumID = Value,			#define LLVM_FIXED_MD_KIND(EnumID, Name, Value) EnumID = Value,
	#include "llvm/IR/FixedMetadataKinds.def"			#include "llvm/IR/FixedMetadataKinds.def"
	#undef LLVM_FIXED_MD_KIND			#undef LLVM_FIXED_MD_KIND
	};			};

	/// Known operand bundle tag IDs, which always have the same value. All			/// Known operand bundle tag IDs, which always have the same value. All
	/// operand bundle tags that LLVM has special knowledge of are listed here.			/// operand bundle tags that LLVM has special knowledge of are listed here.
	/// Additionally, this scheme allows LLVM to efficiently check for specific			/// Additionally, this scheme allows LLVM to efficiently check for specific
	/// operand bundle tags without comparing strings.			/// operand bundle tags without comparing strings. Keep this in sync with
				/// LLVMContext::LLVMContext().
	enum : unsigned {			enum : unsigned {
	OB_deopt = 0, // "deopt"			OB_deopt = 0, // "deopt"
	OB_funclet = 1, // "funclet"			OB_funclet = 1, // "funclet"
	OB_gc_transition = 2, // "gc-transition"			OB_gc_transition = 2, // "gc-transition"
	OB_cfguardtarget = 3, // "cfguardtarget"			OB_cfguardtarget = 3, // "cfguardtarget"
				OB_preallocated = 4, // "preallocated"
	};			};

	/// getMDKindID - Return a unique non-zero ID for the specified metadata kind.			/// getMDKindID - Return a unique non-zero ID for the specified metadata kind.
	/// This ID is uniqued across modules in the current LLVMContext.			/// This ID is uniqued across modules in the current LLVMContext.
	unsigned getMDKindID(StringRef Name) const;			unsigned getMDKindID(StringRef Name) const;

	/// getMDKindNames - Populate client supplied SmallVector with the name for			/// getMDKindNames - Populate client supplied SmallVector with the name for
	/// custom metadata IDs registered in this LLVMContext.			/// custom metadata IDs registered in this LLVMContext.
	▲ Show 20 Lines • Show All 237 Lines • Show Last 20 Lines

llvm/lib/AsmParser/LLLexer.cpp

Show First 20 Lines • Show All 661 Lines • ▼ Show 20 Lines	#define KEYWORD(STR) \
KEYWORD(noredzone);		KEYWORD(noredzone);
KEYWORD(noreturn);		KEYWORD(noreturn);
KEYWORD(nosync);		KEYWORD(nosync);
KEYWORD(nocf_check);		KEYWORD(nocf_check);
KEYWORD(nounwind);		KEYWORD(nounwind);
KEYWORD(optforfuzzing);		KEYWORD(optforfuzzing);
KEYWORD(optnone);		KEYWORD(optnone);
KEYWORD(optsize);		KEYWORD(optsize);
		KEYWORD(preallocated);
KEYWORD(readnone);		KEYWORD(readnone);
KEYWORD(readonly);		KEYWORD(readonly);
KEYWORD(returned);		KEYWORD(returned);
KEYWORD(returns_twice);		KEYWORD(returns_twice);
KEYWORD(signext);		KEYWORD(signext);
KEYWORD(speculatable);		KEYWORD(speculatable);
KEYWORD(sret);		KEYWORD(sret);
KEYWORD(ssp);		KEYWORD(ssp);
▲ Show 20 Lines • Show All 473 Lines • Show Last 20 Lines

llvm/lib/AsmParser/LLParser.h

Show First 20 Lines • Show All 332 Lines • ▼ Show 20 Lines	private:
bool ParseNamedMetadata();		bool ParseNamedMetadata();
bool ParseMDString(MDString *&Result);		bool ParseMDString(MDString *&Result);
bool ParseMDNodeID(MDNode *&Result);		bool ParseMDNodeID(MDNode *&Result);
bool ParseUnnamedAttrGrp();		bool ParseUnnamedAttrGrp();
bool ParseFnAttributeValuePairs(AttrBuilder &B,		bool ParseFnAttributeValuePairs(AttrBuilder &B,
std::vector<unsigned> &FwdRefAttrGrps,		std::vector<unsigned> &FwdRefAttrGrps,
bool inAttrGrp, LocTy &BuiltinLoc);		bool inAttrGrp, LocTy &BuiltinLoc);
bool ParseByValWithOptionalType(Type *&Result);		bool ParseByValWithOptionalType(Type *&Result);
		bool ParsePreallocated(Type *&Result);

// Module Summary Index Parsing.		// Module Summary Index Parsing.
bool SkipModuleSummaryEntry();		bool SkipModuleSummaryEntry();
bool ParseSummaryEntry();		bool ParseSummaryEntry();
bool ParseModuleEntry(unsigned ID);		bool ParseModuleEntry(unsigned ID);
bool ParseModuleReference(StringRef &ModulePath);		bool ParseModuleReference(StringRef &ModulePath);
bool ParseGVReference(ValueInfo &VI, unsigned &GVId);		bool ParseGVReference(ValueInfo &VI, unsigned &GVId);
bool ParseSummaryIndexFlags();		bool ParseSummaryIndexFlags();
▲ Show 20 Lines • Show All 264 Lines • Show Last 20 Lines

llvm/lib/AsmParser/LLParser.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,339 Lines • ▼ Show 20 Lines	case lltok::kw_sanitize_memory:
B.addAttribute(Attribute::SanitizeMemory); break;		B.addAttribute(Attribute::SanitizeMemory); break;
case lltok::kw_speculative_load_hardening:		case lltok::kw_speculative_load_hardening:
B.addAttribute(Attribute::SpeculativeLoadHardening);		B.addAttribute(Attribute::SpeculativeLoadHardening);
break;		break;
case lltok::kw_strictfp: B.addAttribute(Attribute::StrictFP); break;		case lltok::kw_strictfp: B.addAttribute(Attribute::StrictFP); break;
case lltok::kw_uwtable: B.addAttribute(Attribute::UWTable); break;		case lltok::kw_uwtable: B.addAttribute(Attribute::UWTable); break;
case lltok::kw_willreturn: B.addAttribute(Attribute::WillReturn); break;		case lltok::kw_willreturn: B.addAttribute(Attribute::WillReturn); break;
case lltok::kw_writeonly: B.addAttribute(Attribute::WriteOnly); break;		case lltok::kw_writeonly: B.addAttribute(Attribute::WriteOnly); break;
		case lltok::kw_preallocated: {
		Type *Ty;
		if (ParsePreallocated(Ty))
		return true;
		B.addPreallocatedAttr(Ty);
		break;
		}

// Error handling.		// Error handling.
case lltok::kw_inreg:		case lltok::kw_inreg:
case lltok::kw_signext:		case lltok::kw_signext:
case lltok::kw_zeroext:		case lltok::kw_zeroext:
HaveError \|=		HaveError \|=
Error(Lex.getLoc(),		Error(Lex.getLoc(),
"invalid use of attribute on a function");		"invalid use of attribute on a function");
Show All 12 Lines	while (true) {
case lltok::kw_swiftself:		case lltok::kw_swiftself:
case lltok::kw_immarg:		case lltok::kw_immarg:
HaveError \|=		HaveError \|=
Error(Lex.getLoc(),		Error(Lex.getLoc(),
"invalid use of parameter-only attribute on a function");		"invalid use of parameter-only attribute on a function");
break;		break;
}		}

		// ParsePreallocated() consumes token
		if (Token != lltok::kw_preallocated)
Lex.Lex();		Lex.Lex();
}		}
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// GlobalValue Reference/Resolution Routines.		// GlobalValue Reference/Resolution Routines.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static inline GlobalValue createGlobalFwdRef(Module M, PointerType *PTy,		static inline GlobalValue createGlobalFwdRef(Module M, PointerType *PTy,
▲ Show 20 Lines • Show All 247 Lines • ▼ Show 20 Lines	while (true) {
}		}
case lltok::kw_byval: {		case lltok::kw_byval: {
Type *Ty;		Type *Ty;
if (ParseByValWithOptionalType(Ty))		if (ParseByValWithOptionalType(Ty))
return true;		return true;
B.addByValAttr(Ty);		B.addByValAttr(Ty);
continue;		continue;
}		}
		case lltok::kw_preallocated: {
		Type *Ty;
		if (ParsePreallocated(Ty))
		return true;
		B.addPreallocatedAttr(Ty);
		continue;
		}
case lltok::kw_dereferenceable: {		case lltok::kw_dereferenceable: {
uint64_t Bytes;		uint64_t Bytes;
if (ParseOptionalDerefAttrBytes(lltok::kw_dereferenceable, Bytes))		if (ParseOptionalDerefAttrBytes(lltok::kw_dereferenceable, Bytes))
return true;		return true;
B.addDereferenceableAttr(Bytes);		B.addDereferenceableAttr(Bytes);
continue;		continue;
}		}
case lltok::kw_dereferenceable_or_null: {		case lltok::kw_dereferenceable_or_null: {
▲ Show 20 Lines • Show All 151 Lines • ▼ Show 20 Lines	while (true) {
case lltok::kw_sspreq:		case lltok::kw_sspreq:
case lltok::kw_sspstrong:		case lltok::kw_sspstrong:
case lltok::kw_safestack:		case lltok::kw_safestack:
case lltok::kw_shadowcallstack:		case lltok::kw_shadowcallstack:
case lltok::kw_strictfp:		case lltok::kw_strictfp:
case lltok::kw_uwtable:		case lltok::kw_uwtable:
HaveError \|= Error(Lex.getLoc(), "invalid use of function-only attribute");		HaveError \|= Error(Lex.getLoc(), "invalid use of function-only attribute");
break;		break;

case lltok::kw_readnone:		case lltok::kw_readnone:
case lltok::kw_readonly:		case lltok::kw_readonly:
HaveError \|= Error(Lex.getLoc(), "invalid use of attribute on return type");		HaveError \|= Error(Lex.getLoc(), "invalid use of attribute on return type");
		break;
		case lltok::kw_preallocated:
		HaveError \|=
		Error(Lex.getLoc(),
		"invalid use of parameter-only/call site-only attribute");
		break;
}		}

Lex.Lex();		Lex.Lex();
}		}
}		}

static unsigned parseOptionalLinkageAux(lltok::Kind Kind, bool &HasLinkage) {		static unsigned parseOptionalLinkageAux(lltok::Kind Kind, bool &HasLinkage) {
HasLinkage = true;		HasLinkage = true;
▲ Show 20 Lines • Show All 695 Lines • ▼ Show 20 Lines	if (!EatIfPresent(lltok::lparen))
return false;		return false;
if (ParseType(Result))		if (ParseType(Result))
return true;		return true;
if (!EatIfPresent(lltok::rparen))		if (!EatIfPresent(lltok::rparen))
return Error(Lex.getLoc(), "expected ')'");		return Error(Lex.getLoc(), "expected ')'");
return false;		return false;
}		}

		/// ParsePreallocated
		/// ::= preallocated(<ty>)
		bool LLParser::ParsePreallocated(Type *&Result) {
		Result = nullptr;
		if (!EatIfPresent(lltok::kw_preallocated))
		return true;
		if (!EatIfPresent(lltok::lparen))
		return Error(Lex.getLoc(), "expected '('");
		if (ParseType(Result))
		return true;
		if (!EatIfPresent(lltok::rparen))
		return Error(Lex.getLoc(), "expected ')'");
		return false;
		}

/// ParseOptionalOperandBundles		/// ParseOptionalOperandBundles
/// ::= /empty/		/// ::= /empty/
/// ::= '[' OperandBundle [, OperandBundle ]* ']'		/// ::= '[' OperandBundle [, OperandBundle ]* ']'
///		///
/// OperandBundle		/// OperandBundle
/// ::= bundle-tag '(' ')'		/// ::= bundle-tag '(' ')'
/// ::= bundle-tag '(' Type Value [, Type Value ]* ')'		/// ::= bundle-tag '(' Type Value [, Type Value ]* ')'
///		///
▲ Show 20 Lines • Show All 6,427 Lines • Show Last 20 Lines

llvm/lib/AsmParser/LLToken.h

Show First 20 Lines • Show All 207 Lines • ▼ Show 20 Lines	enum Kind {
kw_noredzone,		kw_noredzone,
kw_noreturn,		kw_noreturn,
kw_nosync,		kw_nosync,
kw_nocf_check,		kw_nocf_check,
kw_nounwind,		kw_nounwind,
kw_optforfuzzing,		kw_optforfuzzing,
kw_optnone,		kw_optnone,
kw_optsize,		kw_optsize,
		kw_preallocated,
kw_readnone,		kw_readnone,
kw_readonly,		kw_readonly,
kw_returned,		kw_returned,
kw_returns_twice,		kw_returns_twice,
kw_signext,		kw_signext,
kw_speculatable,		kw_speculatable,
kw_ssp,		kw_ssp,
kw_sspreq,		kw_sspreq,
▲ Show 20 Lines • Show All 254 Lines • Show Last 20 Lines

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 1,297 Lines • ▼ Show 20 Lines	case Attribute::ArgMemOnly:
llvm_unreachable("argmemonly attribute not supported in raw format");		llvm_unreachable("argmemonly attribute not supported in raw format");
break;		break;
case Attribute::AllocSize:		case Attribute::AllocSize:
llvm_unreachable("allocsize not supported in raw format");		llvm_unreachable("allocsize not supported in raw format");
break;		break;
case Attribute::SanitizeMemTag:		case Attribute::SanitizeMemTag:
llvm_unreachable("sanitize_memtag attribute not supported in raw format");		llvm_unreachable("sanitize_memtag attribute not supported in raw format");
break;		break;
		case Attribute::Preallocated:
		llvm_unreachable("preallocated attribute not supported in raw format");
		break;
}		}
llvm_unreachable("Unsupported attribute type");		llvm_unreachable("Unsupported attribute type");
}		}

static void addRawAttributeValue(AttrBuilder &B, uint64_t Val) {		static void addRawAttributeValue(AttrBuilder &B, uint64_t Val) {
if (!Val) return;		if (!Val) return;

for (Attribute::AttrKind I = Attribute::None; I != Attribute::EndAttrKinds;		for (Attribute::AttrKind I = Attribute::None; I != Attribute::EndAttrKinds;
I = Attribute::AttrKind(I + 1)) {		I = Attribute::AttrKind(I + 1)) {
if (I == Attribute::SanitizeMemTag \|\|		if (I == Attribute::SanitizeMemTag \|\| I == Attribute::Dereferenceable \|\|
I == Attribute::Dereferenceable \|\|		I == Attribute::DereferenceableOrNull \|\| I == Attribute::ArgMemOnly \|\|
I == Attribute::DereferenceableOrNull \|\|		I == Attribute::AllocSize \|\| I == Attribute::NoSync \|\|
I == Attribute::ArgMemOnly \|\|		I == Attribute::Preallocated)
I == Attribute::AllocSize \|\|
I == Attribute::NoSync)
continue;		continue;
if (uint64_t A = (Val & getRawAttributeMask(I))) {		if (uint64_t A = (Val & getRawAttributeMask(I))) {
if (I == Attribute::Alignment)		if (I == Attribute::Alignment)
B.addAlignmentAttr(1ULL << ((A >> 16) - 1));		B.addAlignmentAttr(1ULL << ((A >> 16) - 1));
else if (I == Attribute::StackAlignment)		else if (I == Attribute::StackAlignment)
B.addStackAlignmentAttr(1ULL << ((A >> 26)-1));		B.addStackAlignmentAttr(1ULL << ((A >> 26)-1));
else		else
B.addAttribute(I);		B.addAttribute(I);
▲ Show 20 Lines • Show All 210 Lines • ▼ Show 20 Lines	static Attribute::AttrKind getAttrFromCode(uint64_t Code) {
case bitc::ATTR_KIND_WRITEONLY:		case bitc::ATTR_KIND_WRITEONLY:
return Attribute::WriteOnly;		return Attribute::WriteOnly;
case bitc::ATTR_KIND_Z_EXT:		case bitc::ATTR_KIND_Z_EXT:
return Attribute::ZExt;		return Attribute::ZExt;
case bitc::ATTR_KIND_IMMARG:		case bitc::ATTR_KIND_IMMARG:
return Attribute::ImmArg;		return Attribute::ImmArg;
case bitc::ATTR_KIND_SANITIZE_MEMTAG:		case bitc::ATTR_KIND_SANITIZE_MEMTAG:
return Attribute::SanitizeMemTag;		return Attribute::SanitizeMemTag;
		case bitc::ATTR_KIND_PREALLOCATED:
		return Attribute::Preallocated;
}		}
}		}

Error BitcodeReader::parseAlignmentValue(uint64_t Exponent,		Error BitcodeReader::parseAlignmentValue(uint64_t Exponent,
MaybeAlign &Alignment) {		MaybeAlign &Alignment) {
// Note: Alignment in bitcode files is incremented by 1, so that zero		// Note: Alignment in bitcode files is incremented by 1, so that zero
// can be used for default alignment.		// can be used for default alignment.
if (Exponent > Value::MaxAlignmentExponent + 1)		if (Exponent > Value::MaxAlignmentExponent + 1)
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	case bitc::PARAMATTR_GRP_CODE_ENTRY: { // ENTRY: [grpid, idx, a0, a1, ...]
B.addAttribute(KindStr.str(), ValStr.str());		B.addAttribute(KindStr.str(), ValStr.str());
} else {		} else {
assert((Record[i] == 5 \|\| Record[i] == 6) &&		assert((Record[i] == 5 \|\| Record[i] == 6) &&
"Invalid attribute group entry");		"Invalid attribute group entry");
bool HasType = Record[i] == 6;		bool HasType = Record[i] == 6;
Attribute::AttrKind Kind;		Attribute::AttrKind Kind;
if (Error Err = parseAttrKind(Record[++i], &Kind))		if (Error Err = parseAttrKind(Record[++i], &Kind))
return Err;		return Err;
if (Kind == Attribute::ByVal)		if (Kind == Attribute::ByVal) {
B.addByValAttr(HasType ? getTypeByID(Record[++i]) : nullptr);		B.addByValAttr(HasType ? getTypeByID(Record[++i]) : nullptr);
		} else if (Kind == Attribute::Preallocated) {
		B.addPreallocatedAttr(getTypeByID(Record[++i]));
		}
}		}
}		}

UpgradeFramePointerAttributes(B);		UpgradeFramePointerAttributes(B);
MAttributeGroups[GrpID] = AttributeList::get(Context, Idx, B);		MAttributeGroups[GrpID] = AttributeList::get(Context, Idx, B);
break;		break;
}		}
}		}
▲ Show 20 Lines • Show All 5,080 Lines • Show Last 20 Lines

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 719 Lines • ▼ Show 20 Lines	static uint64_t getAttrKindEncoding(Attribute::AttrKind Kind) {
case Attribute::WriteOnly:		case Attribute::WriteOnly:
return bitc::ATTR_KIND_WRITEONLY;		return bitc::ATTR_KIND_WRITEONLY;
case Attribute::ZExt:		case Attribute::ZExt:
return bitc::ATTR_KIND_Z_EXT;		return bitc::ATTR_KIND_Z_EXT;
case Attribute::ImmArg:		case Attribute::ImmArg:
return bitc::ATTR_KIND_IMMARG;		return bitc::ATTR_KIND_IMMARG;
case Attribute::SanitizeMemTag:		case Attribute::SanitizeMemTag:
return bitc::ATTR_KIND_SANITIZE_MEMTAG;		return bitc::ATTR_KIND_SANITIZE_MEMTAG;
		case Attribute::Preallocated:
		return bitc::ATTR_KIND_PREALLOCATED;
case Attribute::EndAttrKinds:		case Attribute::EndAttrKinds:
llvm_unreachable("Can not encode end-attribute kinds marker.");		llvm_unreachable("Can not encode end-attribute kinds marker.");
case Attribute::None:		case Attribute::None:
llvm_unreachable("Can not encode none-attribute.");		llvm_unreachable("Can not encode none-attribute.");
case Attribute::EmptyKey:		case Attribute::EmptyKey:
case Attribute::TombstoneKey:		case Attribute::TombstoneKey:
llvm_unreachable("Trying to encode EmptyKey/TombstoneKey");		llvm_unreachable("Trying to encode EmptyKey/TombstoneKey");
}		}
▲ Show 20 Lines • Show All 4,071 Lines • Show Last 20 Lines

llvm/lib/IR/AsmWriter.cpp

	Show First 20 Lines • Show All 4,191 Lines • ▼ Show 20 Lines
	}			}

	void AssemblyWriter::writeAttribute(const Attribute &Attr, bool InAttrGroup) {			void AssemblyWriter::writeAttribute(const Attribute &Attr, bool InAttrGroup) {
	if (!Attr.isTypeAttribute()) {			if (!Attr.isTypeAttribute()) {
	Out << Attr.getAsString(InAttrGroup);			Out << Attr.getAsString(InAttrGroup);
	return;			return;
	}			}

	assert(Attr.hasAttribute(Attribute::ByVal) && "unexpected type attr");			assert(Attr.hasAttribute(Attribute::ByVal) \|\|
				Attr.hasAttribute(Attribute::Preallocated) && "unexpected type attr");

				if (Attr.hasAttribute(Attribute::ByVal)) {
	Out << "byval";			Out << "byval";
				} else {
				Out << "preallocated";
				}

	if (Type *Ty = Attr.getValueAsType()) {			if (Type *Ty = Attr.getValueAsType()) {
	Out << '(';			Out << '(';
	TypePrinter.print(Ty, Out);			TypePrinter.print(Ty, Out);
	Out << ')';			Out << ')';
	}			}
	}			}

	void AssemblyWriter::writeAttributeSet(const AttributeSet &AttrSet,			void AssemblyWriter::writeAttributeSet(const AttributeSet &AttrSet,
	▲ Show 20 Lines • Show All 344 Lines • Show Last 20 Lines

llvm/lib/IR/AttributeImpl.h

Show First 20 Lines • Show All 214 Lines • ▼ Show 20 Lines	public:

MaybeAlign getAlignment() const;		MaybeAlign getAlignment() const;
MaybeAlign getStackAlignment() const;		MaybeAlign getStackAlignment() const;
uint64_t getDereferenceableBytes() const;		uint64_t getDereferenceableBytes() const;
uint64_t getDereferenceableOrNullBytes() const;		uint64_t getDereferenceableOrNullBytes() const;
std::pair<unsigned, Optional<unsigned>> getAllocSizeArgs() const;		std::pair<unsigned, Optional<unsigned>> getAllocSizeArgs() const;
std::string getAsString(bool InAttrGrp) const;		std::string getAsString(bool InAttrGrp) const;
Type *getByValType() const;		Type *getByValType() const;
		Type *getPreallocatedType() const;

using iterator = const Attribute *;		using iterator = const Attribute *;

iterator begin() const { return getTrailingObjects<Attribute>(); }		iterator begin() const { return getTrailingObjects<Attribute>(); }
iterator end() const { return begin() + NumAttrs; }		iterator end() const { return begin() + NumAttrs; }

void Profile(FoldingSetNodeID &ID) const {		void Profile(FoldingSetNodeID &ID) const {
Profile(ID, makeArrayRef(begin(), end()));		Profile(ID, makeArrayRef(begin(), end()));
▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

llvm/lib/IR/Attributes.cpp

Show First 20 Lines • Show All 163 Lines • ▼ Show 20 Lines	Attribute Attribute::getWithDereferenceableOrNullBytes(LLVMContext &Context,
assert(Bytes && "Bytes must be non-zero.");		assert(Bytes && "Bytes must be non-zero.");
return get(Context, DereferenceableOrNull, Bytes);		return get(Context, DereferenceableOrNull, Bytes);
}		}

Attribute Attribute::getWithByValType(LLVMContext &Context, Type *Ty) {		Attribute Attribute::getWithByValType(LLVMContext &Context, Type *Ty) {
return get(Context, ByVal, Ty);		return get(Context, ByVal, Ty);
}		}

		Attribute Attribute::getWithPreallocatedType(LLVMContext &Context, Type *Ty) {
		return get(Context, Preallocated, Ty);
		}

Attribute		Attribute
Attribute::getWithAllocSizeArgs(LLVMContext &Context, unsigned ElemSizeArg,		Attribute::getWithAllocSizeArgs(LLVMContext &Context, unsigned ElemSizeArg,
const Optional<unsigned> &NumElemsArg) {		const Optional<unsigned> &NumElemsArg) {
assert(!(ElemSizeArg == 0 && NumElemsArg && *NumElemsArg == 0) &&		assert(!(ElemSizeArg == 0 && NumElemsArg && *NumElemsArg == 0) &&
"Invalid allocsize arguments -- given allocsize(0, 0)");		"Invalid allocsize arguments -- given allocsize(0, 0)");
return get(Context, AllocSize, packAllocSizeArgs(ElemSizeArg, NumElemsArg));		return get(Context, AllocSize, packAllocSizeArgs(ElemSizeArg, NumElemsArg));
}		}

▲ Show 20 Lines • Show All 261 Lines • ▼ Show 20 Lines	if (Type *Ty = getValueAsType()) {
Result += '(';		Result += '(';
Ty->print(OS, false, true);		Ty->print(OS, false, true);
OS.flush();		OS.flush();
Result += ')';		Result += ')';
}		}
return Result;		return Result;
}		}

		if (hasAttribute(Attribute::Preallocated)) {
		std::string Result;
		Result += "preallocated";
		raw_string_ostream OS(Result);
		Result += '(';
		getValueAsType()->print(OS, false, true);
		OS.flush();
		Result += ')';
		return Result;
		}

// FIXME: These should be output like this:		// FIXME: These should be output like this:
//		//
// align=4		// align=4
// alignstack=8		// alignstack=8
//		//
if (hasAttribute(Attribute::Alignment)) {		if (hasAttribute(Attribute::Alignment)) {
std::string Result;		std::string Result;
Result += "align";		Result += "align";
▲ Show 20 Lines • Show All 269 Lines • ▼ Show 20 Lines
uint64_t AttributeSet::getDereferenceableOrNullBytes() const {		uint64_t AttributeSet::getDereferenceableOrNullBytes() const {
return SetNode ? SetNode->getDereferenceableOrNullBytes() : 0;		return SetNode ? SetNode->getDereferenceableOrNullBytes() : 0;
}		}

Type *AttributeSet::getByValType() const {		Type *AttributeSet::getByValType() const {
return SetNode ? SetNode->getByValType() : nullptr;		return SetNode ? SetNode->getByValType() : nullptr;
}		}

		Type *AttributeSet::getPreallocatedType() const {
		return SetNode ? SetNode->getPreallocatedType() : nullptr;
		}

std::pair<unsigned, Optional<unsigned>> AttributeSet::getAllocSizeArgs() const {		std::pair<unsigned, Optional<unsigned>> AttributeSet::getAllocSizeArgs() const {
return SetNode ? SetNode->getAllocSizeArgs()		return SetNode ? SetNode->getAllocSizeArgs()
: std::pair<unsigned, Optional<unsigned>>(0, 0);		: std::pair<unsigned, Optional<unsigned>>(0, 0);
}		}

std::string AttributeSet::getAsString(bool InAttrGrp) const {		std::string AttributeSet::getAsString(bool InAttrGrp) const {
return SetNode ? SetNode->getAsString(InAttrGrp) : "";		return SetNode ? SetNode->getAsString(InAttrGrp) : "";
}		}
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	for (Attribute::AttrKind Kind = Attribute::None;
if (!B.contains(Kind))		if (!B.contains(Kind))
continue;		continue;

Attribute Attr;		Attribute Attr;
switch (Kind) {		switch (Kind) {
case Attribute::ByVal:		case Attribute::ByVal:
Attr = Attribute::getWithByValType(C, B.getByValType());		Attr = Attribute::getWithByValType(C, B.getByValType());
break;		break;
		case Attribute::Preallocated:
		Attr = Attribute::getWithPreallocatedType(C, B.getPreallocatedType());
		break;
case Attribute::Alignment:		case Attribute::Alignment:
assert(B.getAlignment() && "Alignment must be set");		assert(B.getAlignment() && "Alignment must be set");
Attr = Attribute::getWithAlignment(C, *B.getAlignment());		Attr = Attribute::getWithAlignment(C, *B.getAlignment());
break;		break;
case Attribute::StackAlignment:		case Attribute::StackAlignment:
assert(B.getStackAlignment() && "StackAlignment must be set");		assert(B.getStackAlignment() && "StackAlignment must be set");
Attr = Attribute::getWithStackAlignment(C, *B.getStackAlignment());		Attr = Attribute::getWithStackAlignment(C, *B.getStackAlignment());
break;		break;
▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines
}		}

Type *AttributeSetNode::getByValType() const {		Type *AttributeSetNode::getByValType() const {
if (auto A = findEnumAttribute(Attribute::ByVal))		if (auto A = findEnumAttribute(Attribute::ByVal))
return A->getValueAsType();		return A->getValueAsType();
return 0;		return 0;
}		}

		Type *AttributeSetNode::getPreallocatedType() const {
		for (const auto &I : *this)
		if (I.hasAttribute(Attribute::Preallocated))
		return I.getValueAsType();
		return 0;
		}

uint64_t AttributeSetNode::getDereferenceableBytes() const {		uint64_t AttributeSetNode::getDereferenceableBytes() const {
if (auto A = findEnumAttribute(Attribute::Dereferenceable))		if (auto A = findEnumAttribute(Attribute::Dereferenceable))
return A->getDereferenceableBytes();		return A->getDereferenceableBytes();
return 0;		return 0;
}		}

uint64_t AttributeSetNode::getDereferenceableOrNullBytes() const {		uint64_t AttributeSetNode::getDereferenceableOrNullBytes() const {
if (auto A = findEnumAttribute(Attribute::DereferenceableOrNull))		if (auto A = findEnumAttribute(Attribute::DereferenceableOrNull))
▲ Show 20 Lines • Show All 567 Lines • ▼ Show 20 Lines
void AttrBuilder::clear() {		void AttrBuilder::clear() {
Attrs.reset();		Attrs.reset();
TargetDepAttrs.clear();		TargetDepAttrs.clear();
Alignment.reset();		Alignment.reset();
StackAlignment.reset();		StackAlignment.reset();
DerefBytes = DerefOrNullBytes = 0;		DerefBytes = DerefOrNullBytes = 0;
AllocSizeArgs = 0;		AllocSizeArgs = 0;
ByValType = nullptr;		ByValType = nullptr;
		PreallocatedType = nullptr;
}		}

AttrBuilder &AttrBuilder::addAttribute(Attribute::AttrKind Val) {		AttrBuilder &AttrBuilder::addAttribute(Attribute::AttrKind Val) {
assert((unsigned)Val < Attribute::EndAttrKinds && "Attribute out of range!");		assert((unsigned)Val < Attribute::EndAttrKinds && "Attribute out of range!");
assert(!Attribute::doesAttrKindHaveArgument(Val) &&		assert(!Attribute::doesAttrKindHaveArgument(Val) &&
"Adding integer attribute without adding a value!");		"Adding integer attribute without adding a value!");
Attrs[Val] = true;		Attrs[Val] = true;
return *this;		return *this;
Show All 9 Lines	AttrBuilder &AttrBuilder::addAttribute(Attribute Attr) {
Attrs[Kind] = true;		Attrs[Kind] = true;

if (Kind == Attribute::Alignment)		if (Kind == Attribute::Alignment)
Alignment = Attr.getAlignment();		Alignment = Attr.getAlignment();
else if (Kind == Attribute::StackAlignment)		else if (Kind == Attribute::StackAlignment)
StackAlignment = Attr.getStackAlignment();		StackAlignment = Attr.getStackAlignment();
else if (Kind == Attribute::ByVal)		else if (Kind == Attribute::ByVal)
ByValType = Attr.getValueAsType();		ByValType = Attr.getValueAsType();
		else if (Kind == Attribute::Preallocated)
		PreallocatedType = Attr.getValueAsType();
else if (Kind == Attribute::Dereferenceable)		else if (Kind == Attribute::Dereferenceable)
DerefBytes = Attr.getDereferenceableBytes();		DerefBytes = Attr.getDereferenceableBytes();
else if (Kind == Attribute::DereferenceableOrNull)		else if (Kind == Attribute::DereferenceableOrNull)
DerefOrNullBytes = Attr.getDereferenceableOrNullBytes();		DerefOrNullBytes = Attr.getDereferenceableOrNullBytes();
else if (Kind == Attribute::AllocSize)		else if (Kind == Attribute::AllocSize)
AllocSizeArgs = Attr.getValueAsInt();		AllocSizeArgs = Attr.getValueAsInt();
return *this;		return *this;
}		}

AttrBuilder &AttrBuilder::addAttribute(StringRef A, StringRef V) {		AttrBuilder &AttrBuilder::addAttribute(StringRef A, StringRef V) {
TargetDepAttrs[std::string(A)] = std::string(V);		TargetDepAttrs[std::string(A)] = std::string(V);
return *this;		return *this;
}		}

AttrBuilder &AttrBuilder::removeAttribute(Attribute::AttrKind Val) {		AttrBuilder &AttrBuilder::removeAttribute(Attribute::AttrKind Val) {
assert((unsigned)Val < Attribute::EndAttrKinds && "Attribute out of range!");		assert((unsigned)Val < Attribute::EndAttrKinds && "Attribute out of range!");
Attrs[Val] = false;		Attrs[Val] = false;

if (Val == Attribute::Alignment)		if (Val == Attribute::Alignment)
Alignment.reset();		Alignment.reset();
else if (Val == Attribute::StackAlignment)		else if (Val == Attribute::StackAlignment)
StackAlignment.reset();		StackAlignment.reset();
else if (Val == Attribute::ByVal)		else if (Val == Attribute::ByVal)
ByValType = nullptr;		ByValType = nullptr;
		else if (Val == Attribute::Preallocated)
		PreallocatedType = nullptr;
else if (Val == Attribute::Dereferenceable)		else if (Val == Attribute::Dereferenceable)
DerefBytes = 0;		DerefBytes = 0;
else if (Val == Attribute::DereferenceableOrNull)		else if (Val == Attribute::DereferenceableOrNull)
DerefOrNullBytes = 0;		DerefOrNullBytes = 0;
else if (Val == Attribute::AllocSize)		else if (Val == Attribute::AllocSize)
AllocSizeArgs = 0;		AllocSizeArgs = 0;

return *this;		return *this;
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines
}		}

AttrBuilder &AttrBuilder::addByValAttr(Type *Ty) {		AttrBuilder &AttrBuilder::addByValAttr(Type *Ty) {
Attrs[Attribute::ByVal] = true;		Attrs[Attribute::ByVal] = true;
ByValType = Ty;		ByValType = Ty;
return *this;		return *this;
}		}

		AttrBuilder &AttrBuilder::addPreallocatedAttr(Type *Ty) {
		Attrs[Attribute::Preallocated] = true;
		PreallocatedType = Ty;
		return *this;
		}

AttrBuilder &AttrBuilder::merge(const AttrBuilder &B) {		AttrBuilder &AttrBuilder::merge(const AttrBuilder &B) {
// FIXME: What if both have alignments, but they don't match?!		// FIXME: What if both have alignments, but they don't match?!
if (!Alignment)		if (!Alignment)
Alignment = B.Alignment;		Alignment = B.Alignment;

if (!StackAlignment)		if (!StackAlignment)
StackAlignment = B.StackAlignment;		StackAlignment = B.StackAlignment;

if (!DerefBytes)		if (!DerefBytes)
DerefBytes = B.DerefBytes;		DerefBytes = B.DerefBytes;

if (!DerefOrNullBytes)		if (!DerefOrNullBytes)
DerefOrNullBytes = B.DerefOrNullBytes;		DerefOrNullBytes = B.DerefOrNullBytes;

if (!AllocSizeArgs)		if (!AllocSizeArgs)
AllocSizeArgs = B.AllocSizeArgs;		AllocSizeArgs = B.AllocSizeArgs;

if (!ByValType)		if (!ByValType)
ByValType = B.ByValType;		ByValType = B.ByValType;

		if (!PreallocatedType)
		PreallocatedType = B.PreallocatedType;

Attrs \|= B.Attrs;		Attrs \|= B.Attrs;

for (auto I : B.td_attrs())		for (auto I : B.td_attrs())
TargetDepAttrs[I.first] = I.second;		TargetDepAttrs[I.first] = I.second;

return *this;		return *this;
}		}

Show All 12 Lines	if (B.DerefOrNullBytes)
DerefOrNullBytes = 0;		DerefOrNullBytes = 0;

if (B.AllocSizeArgs)		if (B.AllocSizeArgs)
AllocSizeArgs = 0;		AllocSizeArgs = 0;

if (B.ByValType)		if (B.ByValType)
ByValType = nullptr;		ByValType = nullptr;

		if (B.PreallocatedType)
		PreallocatedType = nullptr;

Attrs &= ~B.Attrs;		Attrs &= ~B.Attrs;

for (auto I : B.td_attrs())		for (auto I : B.td_attrs())
TargetDepAttrs.erase(I.first);		TargetDepAttrs.erase(I.first);

return *this;		return *this;
}		}

▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	if (Attrs != B.Attrs)
return false;		return false;

for (td_const_iterator I = TargetDepAttrs.begin(),		for (td_const_iterator I = TargetDepAttrs.begin(),
E = TargetDepAttrs.end(); I != E; ++I)		E = TargetDepAttrs.end(); I != E; ++I)
if (B.TargetDepAttrs.find(I->first) == B.TargetDepAttrs.end())		if (B.TargetDepAttrs.find(I->first) == B.TargetDepAttrs.end())
return false;		return false;

return Alignment == B.Alignment && StackAlignment == B.StackAlignment &&		return Alignment == B.Alignment && StackAlignment == B.StackAlignment &&
DerefBytes == B.DerefBytes && ByValType == B.ByValType;		DerefBytes == B.DerefBytes && ByValType == B.ByValType &&
		PreallocatedType == B.PreallocatedType;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AttributeFuncs Function Defintions		// AttributeFuncs Function Defintions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// Which attributes cannot be applied to a type.		/// Which attributes cannot be applied to a type.
AttrBuilder AttributeFuncs::typeIncompatible(Type *Ty) {		AttrBuilder AttributeFuncs::typeIncompatible(Type *Ty) {
AttrBuilder Incompatible;		AttrBuilder Incompatible;

if (!Ty->isIntegerTy())		if (!Ty->isIntegerTy())
// Attribute that only apply to integers.		// Attribute that only apply to integers.
Incompatible.addAttribute(Attribute::SExt)		Incompatible.addAttribute(Attribute::SExt)
.addAttribute(Attribute::ZExt);		.addAttribute(Attribute::ZExt);

if (!Ty->isPointerTy())		if (!Ty->isPointerTy())
// Attribute that only apply to pointers.		// Attribute that only apply to pointers.
Incompatible.addAttribute(Attribute::ByVal)		Incompatible.addAttribute(Attribute::Nest)
.addAttribute(Attribute::Nest)
.addAttribute(Attribute::NoAlias)		.addAttribute(Attribute::NoAlias)
.addAttribute(Attribute::NoCapture)		.addAttribute(Attribute::NoCapture)
.addAttribute(Attribute::NonNull)		.addAttribute(Attribute::NonNull)
.addDereferenceableAttr(1) // the int here is ignored		.addDereferenceableAttr(1) // the int here is ignored
.addDereferenceableOrNullAttr(1) // the int here is ignored		.addDereferenceableOrNullAttr(1) // the int here is ignored
.addAttribute(Attribute::ReadNone)		.addAttribute(Attribute::ReadNone)
.addAttribute(Attribute::ReadOnly)		.addAttribute(Attribute::ReadOnly)
.addAttribute(Attribute::StructRet)		.addAttribute(Attribute::StructRet)
.addAttribute(Attribute::InAlloca);		.addAttribute(Attribute::InAlloca)
		.addPreallocatedAttr(Ty)
		.addByValAttr(Ty);

return Incompatible;		return Incompatible;
}		}

template<typename AttrClass>		template<typename AttrClass>
static bool isEqual(const Function &Caller, const Function &Callee) {		static bool isEqual(const Function &Caller, const Function &Callee) {
return Caller.getFnAttribute(AttrClass::getKind()) ==		return Caller.getFnAttribute(AttrClass::getKind()) ==
Callee.getFnAttribute(AttrClass::getKind());		Callee.getFnAttribute(AttrClass::getKind());
▲ Show 20 Lines • Show All 176 Lines • Show Last 20 Lines

llvm/lib/IR/LLVMContext.cpp

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	assert(GCTransitionEntry->second == LLVMContext::OB_gc_transition &&
"gc-transition operand bundle id drifted!");		"gc-transition operand bundle id drifted!");
(void)GCTransitionEntry;		(void)GCTransitionEntry;

auto *CFGuardTargetEntry = pImpl->getOrInsertBundleTag("cfguardtarget");		auto *CFGuardTargetEntry = pImpl->getOrInsertBundleTag("cfguardtarget");
assert(CFGuardTargetEntry->second == LLVMContext::OB_cfguardtarget &&		assert(CFGuardTargetEntry->second == LLVMContext::OB_cfguardtarget &&
"cfguardtarget operand bundle id drifted!");		"cfguardtarget operand bundle id drifted!");
(void)CFGuardTargetEntry;		(void)CFGuardTargetEntry;

		auto *PreallocatedEntry = pImpl->getOrInsertBundleTag("preallocated");
		assert(PreallocatedEntry->second == LLVMContext::OB_preallocated &&
		"preallocated operand bundle id drifted!");
		(void)PreallocatedEntry;

SyncScope::ID SingleThreadSSID =		SyncScope::ID SingleThreadSSID =
pImpl->getOrInsertSyncScopeID("singlethread");		pImpl->getOrInsertSyncScopeID("singlethread");
assert(SingleThreadSSID == SyncScope::SingleThread &&		assert(SingleThreadSSID == SyncScope::SingleThread &&
"singlethread synchronization scope ID drifted!");		"singlethread synchronization scope ID drifted!");
(void)SingleThreadSSID;		(void)SingleThreadSSID;

SyncScope::ID SystemSSID =		SyncScope::ID SystemSSID =
pImpl->getOrInsertSyncScopeID("");		pImpl->getOrInsertSyncScopeID("");
▲ Show 20 Lines • Show All 265 Lines • Show Last 20 Lines

llvm/lib/IR/Verifier.cpp

Show First 20 Lines • Show All 1,557 Lines • ▼ Show 20 Lines	static bool isFuncOnlyAttr(Attribute::AttrKind Kind) {
}		}
return false;		return false;
}		}

/// Return true if this is a function attribute that can also appear on		/// Return true if this is a function attribute that can also appear on
/// arguments.		/// arguments.
static bool isFuncOrArgAttr(Attribute::AttrKind Kind) {		static bool isFuncOrArgAttr(Attribute::AttrKind Kind) {
return Kind == Attribute::ReadOnly \|\| Kind == Attribute::WriteOnly \|\|		return Kind == Attribute::ReadOnly \|\| Kind == Attribute::WriteOnly \|\|
Kind == Attribute::ReadNone \|\| Kind == Attribute::NoFree;		Kind == Attribute::ReadNone \|\| Kind == Attribute::NoFree \|\|
		Kind == Attribute::Preallocated;
}		}

void Verifier::verifyAttributeTypes(AttributeSet Attrs, bool IsFunction,		void Verifier::verifyAttributeTypes(AttributeSet Attrs, bool IsFunction,
const Value *V) {		const Value *V) {
for (Attribute A : Attrs) {		for (Attribute A : Attrs) {
if (A.isStringAttribute())		if (A.isStringAttribute())
continue;		continue;

Show All 34 Lines	Assert(Attrs.getNumAttributes() == 1,
"Attribute 'immarg' is incompatible with other attributes", V);		"Attribute 'immarg' is incompatible with other attributes", V);
}		}

// Check for mutually incompatible attributes. Only inreg is compatible with		// Check for mutually incompatible attributes. Only inreg is compatible with
// sret.		// sret.
unsigned AttrCount = 0;		unsigned AttrCount = 0;
AttrCount += Attrs.hasAttribute(Attribute::ByVal);		AttrCount += Attrs.hasAttribute(Attribute::ByVal);
AttrCount += Attrs.hasAttribute(Attribute::InAlloca);		AttrCount += Attrs.hasAttribute(Attribute::InAlloca);
		AttrCount += Attrs.hasAttribute(Attribute::Preallocated);
AttrCount += Attrs.hasAttribute(Attribute::StructRet) \|\|		AttrCount += Attrs.hasAttribute(Attribute::StructRet) \|\|
Attrs.hasAttribute(Attribute::InReg);		Attrs.hasAttribute(Attribute::InReg);
AttrCount += Attrs.hasAttribute(Attribute::Nest);		AttrCount += Attrs.hasAttribute(Attribute::Nest);
Assert(AttrCount <= 1, "Attributes 'byval', 'inalloca', 'inreg', 'nest', "		Assert(AttrCount <= 1,
		"Attributes 'byval', 'inalloca', 'preallocated', 'inreg', 'nest', "
"and 'sret' are incompatible!",		"and 'sret' are incompatible!",
V);		V);

Assert(!(Attrs.hasAttribute(Attribute::InAlloca) &&		Assert(!(Attrs.hasAttribute(Attribute::InAlloca) &&
Attrs.hasAttribute(Attribute::ReadOnly)),		Attrs.hasAttribute(Attribute::ReadOnly)),
"Attributes "		"Attributes "
"'inalloca and readonly' are incompatible!",		"'inalloca and readonly' are incompatible!",
V);		V);

Show All 33 Lines	Assert(!(Attrs.hasAttribute(Attribute::NoInline) &&
"'noinline and alwaysinline' are incompatible!",		"'noinline and alwaysinline' are incompatible!",
V);		V);

if (Attrs.hasAttribute(Attribute::ByVal) && Attrs.getByValType()) {		if (Attrs.hasAttribute(Attribute::ByVal) && Attrs.getByValType()) {
Assert(Attrs.getByValType() == cast<PointerType>(Ty)->getElementType(),		Assert(Attrs.getByValType() == cast<PointerType>(Ty)->getElementType(),
"Attribute 'byval' type does not match parameter!", V);		"Attribute 'byval' type does not match parameter!", V);
}		}

		if (Attrs.hasAttribute(Attribute::Preallocated)) {
		Assert(Attrs.getPreallocatedType() ==
		efriedmaUnsubmitted Done Reply Inline Actions Do you need to check `Attrs.getPreallocatedType()->isSized()`, or something like that? Or is that checked elsewhere? efriedma: Do you need to check `Attrs.getPreallocatedType()->isSized()`, or something like that? Or is…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Piggybacked off existing checks below, thanks for catching. aeubanks: Piggybacked off existing checks below, thanks for catching.
		cast<PointerType>(Ty)->getElementType(),
		"Attribute 'preallocated' type does not match parameter!", V);
		}

AttrBuilder IncompatibleAttrs = AttributeFuncs::typeIncompatible(Ty);		AttrBuilder IncompatibleAttrs = AttributeFuncs::typeIncompatible(Ty);
Assert(!AttrBuilder(Attrs).overlaps(IncompatibleAttrs),		Assert(!AttrBuilder(Attrs).overlaps(IncompatibleAttrs),
"Wrong types for attribute: " +		"Wrong types for attribute: " +
AttributeSet::get(Context, IncompatibleAttrs).getAsString(),		AttributeSet::get(Context, IncompatibleAttrs).getAsString(),
V);		V);

if (PointerType *PTy = dyn_cast<PointerType>(Ty)) {		if (PointerType *PTy = dyn_cast<PointerType>(Ty)) {
SmallPtrSet<Type*, 4> Visited;		SmallPtrSet<Type*, 4> Visited;
if (!PTy->getElementType()->isSized(&Visited)) {		if (!PTy->getElementType()->isSized(&Visited)) {
Assert(!Attrs.hasAttribute(Attribute::ByVal) &&		Assert(!Attrs.hasAttribute(Attribute::ByVal) &&
!Attrs.hasAttribute(Attribute::InAlloca),		!Attrs.hasAttribute(Attribute::InAlloca) &&
"Attributes 'byval' and 'inalloca' do not support unsized types!",		!Attrs.hasAttribute(Attribute::Preallocated),
		"Attributes 'byval', 'inalloca', and 'preallocated' do not "
		"support unsized types!",
V);		V);
}		}
if (!isa<PointerType>(PTy->getElementType()))		if (!isa<PointerType>(PTy->getElementType()))
Assert(!Attrs.hasAttribute(Attribute::SwiftError),		Assert(!Attrs.hasAttribute(Attribute::SwiftError),
"Attribute 'swifterror' only applies to parameters "		"Attribute 'swifterror' only applies to parameters "
"with pointer to pointer type!",		"with pointer to pointer type!",
V);		V);
} else {		} else {
Show All 24 Lines	void Verifier::verifyFunctionAttrs(FunctionType *FT, AttributeList Attrs,
AttributeSet RetAttrs = Attrs.getRetAttributes();		AttributeSet RetAttrs = Attrs.getRetAttributes();
Assert((!RetAttrs.hasAttribute(Attribute::ByVal) &&		Assert((!RetAttrs.hasAttribute(Attribute::ByVal) &&
!RetAttrs.hasAttribute(Attribute::Nest) &&		!RetAttrs.hasAttribute(Attribute::Nest) &&
!RetAttrs.hasAttribute(Attribute::StructRet) &&		!RetAttrs.hasAttribute(Attribute::StructRet) &&
!RetAttrs.hasAttribute(Attribute::NoCapture) &&		!RetAttrs.hasAttribute(Attribute::NoCapture) &&
!RetAttrs.hasAttribute(Attribute::NoFree) &&		!RetAttrs.hasAttribute(Attribute::NoFree) &&
!RetAttrs.hasAttribute(Attribute::Returned) &&		!RetAttrs.hasAttribute(Attribute::Returned) &&
!RetAttrs.hasAttribute(Attribute::InAlloca) &&		!RetAttrs.hasAttribute(Attribute::InAlloca) &&
		!RetAttrs.hasAttribute(Attribute::Preallocated) &&
!RetAttrs.hasAttribute(Attribute::SwiftSelf) &&		!RetAttrs.hasAttribute(Attribute::SwiftSelf) &&
!RetAttrs.hasAttribute(Attribute::SwiftError)),		!RetAttrs.hasAttribute(Attribute::SwiftError)),
"Attributes 'byval', 'inalloca', 'nest', 'sret', 'nocapture', 'nofree'"		"Attributes 'byval', 'inalloca', 'preallocated', 'nest', 'sret', "
		"'nocapture', 'nofree', "
		efriedmaUnsubmitted Not Done Reply Inline Actions Missing comma efriedma: Missing comma
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Done. aeubanks: Done.
"'returned', 'swiftself', and 'swifterror' do not apply to return "		"'returned', 'swiftself', and 'swifterror' do not apply to return "
"values!",		"values!",
V);		V);
Assert((!RetAttrs.hasAttribute(Attribute::ReadOnly) &&		Assert((!RetAttrs.hasAttribute(Attribute::ReadOnly) &&
!RetAttrs.hasAttribute(Attribute::WriteOnly) &&		!RetAttrs.hasAttribute(Attribute::WriteOnly) &&
!RetAttrs.hasAttribute(Attribute::ReadNone)),		!RetAttrs.hasAttribute(Attribute::ReadNone)),
"Attribute '" + RetAttrs.getAsString() +		"Attribute '" + RetAttrs.getAsString() +
"' does not apply to function returns",		"' does not apply to function returns",
▲ Show 20 Lines • Show All 1,165 Lines • ▼ Show 20 Lines	void Verifier::visitCallBase(CallBase &Call) {

if (Attrs.hasAttribute(AttributeList::FunctionIndex, Attribute::Speculatable)) {		if (Attrs.hasAttribute(AttributeList::FunctionIndex, Attribute::Speculatable)) {
// Don't allow speculatable on call sites, unless the underlying function		// Don't allow speculatable on call sites, unless the underlying function
// declaration is also speculatable.		// declaration is also speculatable.
Assert(Callee && Callee->isSpeculatable(),		Assert(Callee && Callee->isSpeculatable(),
"speculatable attribute may not apply to call sites", Call);		"speculatable attribute may not apply to call sites", Call);
}		}

		if (Attrs.hasAttribute(AttributeList::FunctionIndex,
		Attribute::Preallocated)) {
		Assert(Call.getCalledFunction()->getIntrinsicID() ==
		Intrinsic::call_preallocated_arg,
		"preallocated as a call site attribute can only be on "
		"llvm.call.preallocated.arg");
		}

// Verify call attributes.		// Verify call attributes.
verifyFunctionAttrs(FTy, Attrs, &Call, IsIntrinsic);		verifyFunctionAttrs(FTy, Attrs, &Call, IsIntrinsic);

// Conservatively check the inalloca argument.		// Conservatively check the inalloca argument.
// We have a bug if we can find that there is an underlying alloca without		// We have a bug if we can find that there is an underlying alloca without
// inalloca.		// inalloca.
if (Call.hasInAllocaArgument()) {		if (Call.hasInAllocaArgument()) {
Value *InAllocaArg = Call.getArgOperand(FTy->getNumParams() - 1);		Value *InAllocaArg = Call.getArgOperand(FTy->getNumParams() - 1);
Show All 30 Lines	if (Attrs.hasParamAttribute(i, Attribute::ImmArg)) {
Call.getArgOperand(i), Call);		Call.getArgOperand(i), Call);
}		}

if (Call.paramHasAttr(i, Attribute::ImmArg)) {		if (Call.paramHasAttr(i, Attribute::ImmArg)) {
Value *ArgVal = Call.getArgOperand(i);		Value *ArgVal = Call.getArgOperand(i);
Assert(isa<ConstantInt>(ArgVal) \|\| isa<ConstantFP>(ArgVal),		Assert(isa<ConstantInt>(ArgVal) \|\| isa<ConstantFP>(ArgVal),
"immarg operand has non-immediate parameter", ArgVal, Call);		"immarg operand has non-immediate parameter", ArgVal, Call);
}		}

		if (Call.paramHasAttr(i, Attribute::Preallocated)) {
		Value *ArgVal = Call.getArgOperand(i);
		Assert(Call.countOperandBundlesOfType(LLVMContext::OB_preallocated) != 0,
		"preallocated operand requires a preallocated bundle", ArgVal,
		Call);
		}
}		}

if (FTy->isVarArg()) {		if (FTy->isVarArg()) {
// FIXME? is 'nest' even legal here?		// FIXME? is 'nest' even legal here?
bool SawNest = false;		bool SawNest = false;
bool SawReturned = false;		bool SawReturned = false;

for (unsigned Idx = 0; Idx < FTy->getNumParams(); ++Idx) {		for (unsigned Idx = 0; Idx < FTy->getNumParams(); ++Idx) {
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	if (!Call.getCalledFunction())
Assert(!FTy->getReturnType()->isTokenTy(),		Assert(!FTy->getReturnType()->isTokenTy(),
"Return type cannot be token for indirect call!");		"Return type cannot be token for indirect call!");

if (Function *F = Call.getCalledFunction())		if (Function *F = Call.getCalledFunction())
if (Intrinsic::ID ID = (Intrinsic::ID)F->getIntrinsicID())		if (Intrinsic::ID ID = (Intrinsic::ID)F->getIntrinsicID())
visitIntrinsicCall(ID, Call);		visitIntrinsicCall(ID, Call);

// Verify that a callsite has at most one "deopt", at most one "funclet", at		// Verify that a callsite has at most one "deopt", at most one "funclet", at
// most one "gc-transition", and at most one "cfguardtarget" operand bundle.		// most one "gc-transition", at most one "cfguardtarget",
		// and at most one "preallocated" operand bundle.
bool FoundDeoptBundle = false, FoundFuncletBundle = false,		bool FoundDeoptBundle = false, FoundFuncletBundle = false,
FoundGCTransitionBundle = false, FoundCFGuardTargetBundle = false;		FoundGCTransitionBundle = false, FoundCFGuardTargetBundle = false,
		FoundPreallocatedBundle = false;
for (unsigned i = 0, e = Call.getNumOperandBundles(); i < e; ++i) {		for (unsigned i = 0, e = Call.getNumOperandBundles(); i < e; ++i) {
OperandBundleUse BU = Call.getOperandBundleAt(i);		OperandBundleUse BU = Call.getOperandBundleAt(i);
uint32_t Tag = BU.getTagID();		uint32_t Tag = BU.getTagID();
if (Tag == LLVMContext::OB_deopt) {		if (Tag == LLVMContext::OB_deopt) {
Assert(!FoundDeoptBundle, "Multiple deopt operand bundles", Call);		Assert(!FoundDeoptBundle, "Multiple deopt operand bundles", Call);
FoundDeoptBundle = true;		FoundDeoptBundle = true;
} else if (Tag == LLVMContext::OB_gc_transition) {		} else if (Tag == LLVMContext::OB_gc_transition) {
Assert(!FoundGCTransitionBundle, "Multiple gc-transition operand bundles",		Assert(!FoundGCTransitionBundle, "Multiple gc-transition operand bundles",
Call);		Call);
FoundGCTransitionBundle = true;		FoundGCTransitionBundle = true;
} else if (Tag == LLVMContext::OB_funclet) {		} else if (Tag == LLVMContext::OB_funclet) {
Assert(!FoundFuncletBundle, "Multiple funclet operand bundles", Call);		Assert(!FoundFuncletBundle, "Multiple funclet operand bundles", Call);
FoundFuncletBundle = true;		FoundFuncletBundle = true;
Assert(BU.Inputs.size() == 1,		Assert(BU.Inputs.size() == 1,
"Expected exactly one funclet bundle operand", Call);		"Expected exactly one funclet bundle operand", Call);
Assert(isa<FuncletPadInst>(BU.Inputs.front()),		Assert(isa<FuncletPadInst>(BU.Inputs.front()),
"Funclet bundle operands should correspond to a FuncletPadInst",		"Funclet bundle operands should correspond to a FuncletPadInst",
Call);		Call);
} else if (Tag == LLVMContext::OB_cfguardtarget) {		} else if (Tag == LLVMContext::OB_cfguardtarget) {
Assert(!FoundCFGuardTargetBundle,		Assert(!FoundCFGuardTargetBundle,
"Multiple CFGuardTarget operand bundles", Call);		"Multiple CFGuardTarget operand bundles", Call);
FoundCFGuardTargetBundle = true;		FoundCFGuardTargetBundle = true;
Assert(BU.Inputs.size() == 1,		Assert(BU.Inputs.size() == 1,
"Expected exactly one cfguardtarget bundle operand", Call);		"Expected exactly one cfguardtarget bundle operand", Call);
		} else if (Tag == LLVMContext::OB_preallocated) {
		Assert(!FoundPreallocatedBundle, "Multiple preallocated operand bundles",
		Call);
		FoundPreallocatedBundle = true;
		Assert(BU.Inputs.size() == 1,
		"Expected exactly one preallocated bundle operand", Call);
		rnkUnsubmitted Done Reply Inline Actions You should add some tests for these checks. rnk: You should add some tests for these checks.
		auto Input = dyn_cast<IntrinsicInst>(BU.Inputs.front());
		Assert(Input &&
		Input->getIntrinsicID() == Intrinsic::call_preallocated_setup,
		"\"preallocated\" argument must be a token from "
		"llvm.call.preallocated.setup",
		rnkUnsubmitted Done Reply Inline Actions The verifier shouldn't reject unknown bundles, the idea is to allow other bundles as an extension point. rnk: The verifier shouldn't reject unknown bundles, the idea is to allow other bundles as an…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Done. I had (and still have) no idea how "callsetup" was getting read as a LLVMContext::OB_callsetup so I wanted to make sure that this branch was actually taken. aeubanks: Done. I had (and still have) no idea how "callsetup" was getting read as a LLVMContext…
		Call);
}		}
}		}

// Verify that each inlinable callsite of a debug-info-bearing function in a		// Verify that each inlinable callsite of a debug-info-bearing function in a
// debug-info-bearing function has a debug location attached to it. Failure to		// debug-info-bearing function has a debug location attached to it. Failure to
// do so causes assertion failures when the inliner sets up inline scope info.		// do so causes assertion failures when the inliner sets up inline scope info.
if (Call.getFunction()->getSubprogram() && Call.getCalledFunction() &&		if (Call.getFunction()->getSubprogram() && Call.getCalledFunction() &&
Call.getCalledFunction()->getSubprogram())		Call.getCalledFunction()->getSubprogram())
Show All 14 Lines	static bool isTypeCongruent(Type L, Type R) {
PointerType *PR = dyn_cast<PointerType>(R);		PointerType *PR = dyn_cast<PointerType>(R);
if (!PL \|\| !PR)		if (!PL \|\| !PR)
return false;		return false;
return PL->getAddressSpace() == PR->getAddressSpace();		return PL->getAddressSpace() == PR->getAddressSpace();
}		}

static AttrBuilder getParameterABIAttributes(int I, AttributeList Attrs) {		static AttrBuilder getParameterABIAttributes(int I, AttributeList Attrs) {
static const Attribute::AttrKind ABIAttrs[] = {		static const Attribute::AttrKind ABIAttrs[] = {
Attribute::StructRet, Attribute::ByVal, Attribute::InAlloca,		Attribute::StructRet, Attribute::ByVal, Attribute::InAlloca,
Attribute::InReg, Attribute::SwiftSelf, Attribute::SwiftError};		Attribute::InReg, Attribute::SwiftSelf, Attribute::SwiftError,
		Attribute::Preallocated};
AttrBuilder Copy;		AttrBuilder Copy;
for (auto AK : ABIAttrs) {		for (auto AK : ABIAttrs) {
if (Attrs.hasParamAttribute(I, AK))		if (Attrs.hasParamAttribute(I, AK))
Copy.addAttribute(AK);		Copy.addAttribute(AK);
}		}
// `align` is ABI-affecting only in combination with `byval`.		// `align` is ABI-affecting only in combination with `byval`.
if (Attrs.hasParamAttribute(I, Attribute::Alignment) &&		if (Attrs.hasParamAttribute(I, Attribute::Alignment) &&
Attrs.hasParamAttribute(I, Attribute::ByVal))		Attrs.hasParamAttribute(I, Attribute::ByVal))
Show All 25 Lines	void Verifier::verifyMustTailCall(CallInst &CI) {
Assert(isTypeCongruent(CallerTy->getReturnType(), CalleeTy->getReturnType()),		Assert(isTypeCongruent(CallerTy->getReturnType(), CalleeTy->getReturnType()),
"cannot guarantee tail call due to mismatched return types", &CI);		"cannot guarantee tail call due to mismatched return types", &CI);

// - The calling conventions of the caller and callee must match.		// - The calling conventions of the caller and callee must match.
Assert(F->getCallingConv() == CI.getCallingConv(),		Assert(F->getCallingConv() == CI.getCallingConv(),
"cannot guarantee tail call due to mismatched calling conv", &CI);		"cannot guarantee tail call due to mismatched calling conv", &CI);

// - All ABI-impacting function attributes, such as sret, byval, inreg,		// - All ABI-impacting function attributes, such as sret, byval, inreg,
// returned, and inalloca, must match.		// returned, preallocated, and inalloca, must match.
AttributeList CallerAttrs = F->getAttributes();		AttributeList CallerAttrs = F->getAttributes();
AttributeList CalleeAttrs = CI.getAttributes();		AttributeList CalleeAttrs = CI.getAttributes();
for (int I = 0, E = CallerTy->getNumParams(); I != E; ++I) {		for (int I = 0, E = CallerTy->getNumParams(); I != E; ++I) {
AttrBuilder CallerABIAttrs = getParameterABIAttributes(I, CallerAttrs);		AttrBuilder CallerABIAttrs = getParameterABIAttributes(I, CallerAttrs);
AttrBuilder CalleeABIAttrs = getParameterABIAttributes(I, CalleeAttrs);		AttrBuilder CalleeABIAttrs = getParameterABIAttributes(I, CalleeAttrs);
Assert(CallerABIAttrs == CalleeABIAttrs,		Assert(CallerABIAttrs == CalleeABIAttrs,
"cannot guarantee tail call due to mismatched ABI impacting "		"cannot guarantee tail call due to mismatched ABI impacting "
"function attributes",		"function attributes",
▲ Show 20 Lines • Show All 1,295 Lines • ▼ Show 20 Lines	Assert(IsValidAlignment(DstAlignment),
"incorrect alignment of the destination argument", Call);		"incorrect alignment of the destination argument", Call);
if (const auto *AMT = dyn_cast<AtomicMemTransferInst>(AMI)) {		if (const auto *AMT = dyn_cast<AtomicMemTransferInst>(AMI)) {
uint64_t SrcAlignment = AMT->getSourceAlignment();		uint64_t SrcAlignment = AMT->getSourceAlignment();
Assert(IsValidAlignment(SrcAlignment),		Assert(IsValidAlignment(SrcAlignment),
"incorrect alignment of the source argument", Call);		"incorrect alignment of the source argument", Call);
}		}
break;		break;
}		}
		case Intrinsic::call_preallocated_setup: {
		auto *NumArgs = dyn_cast<ConstantInt>(Call.getArgOperand(0));
		Assert(NumArgs != nullptr,
		"llvm.call.preallocated.setup argument must be a constant");
		bool FoundCall = false;
		for (User *U : Call.users()) {
		auto *UseCall = dyn_cast<CallBase>(U);
		Assert(UseCall != nullptr,
		"Uses of llvm.call.preallocated.setup must be calls");
		const Function *Fn = UseCall->getCalledFunction();
		if (Fn->getIntrinsicID() == Intrinsic::call_preallocated_arg) {
		auto *AllocArgIndex = dyn_cast<ConstantInt>(UseCall->getArgOperand(1));
		rnkUnsubmitted Done Reply Inline Actions This is the right idea, but I'd make APInt locals for the getValue() results to make it shorter. rnk: This is the right idea, but I'd make APInt locals for the getValue() results to make it shorter.
		Assert(AllocArgIndex != nullptr,
		"llvm.call.preallocated.alloc arg index must be a constant");
		auto AllocArgIndexInt = AllocArgIndex->getValue();
		Assert(AllocArgIndexInt.sge(0) &&
		AllocArgIndexInt.slt(NumArgs->getValue()),
		"llvm.call.preallocated.alloc arg index must be between 0 and "
		"corresponding "
		"llvm.call.preallocated.setup's argument count");
		rnkUnsubmitted Done Reply Inline Actions I guess I'd try to do this with less conditionals: auto CallSetupBundle = ... Assert(CallSetupBundle, "using call site should have a call.setup bundle"); Assert(CallSetupBundle->Inputs.front().get() == &Call, ...); rnk: I guess I'd try to do this with less conditionals: auto CallSetupBundle = ... Assert…
		} else {
		Assert(!FoundCall, "Can have at most one call corresponding to a "
		"llvm.call.preallocated.setup");
		FoundCall = true;
		size_t NumPreallocatedArgs = 0;
		for (auto &Arg : Fn->args()) {
		if (Arg.hasAttribute(Attribute::Preallocated)) {
		++NumPreallocatedArgs;
		}
		}
		Assert(NumArgs->equalsInt(NumPreallocatedArgs),
		"llvm.call.preallocated.setup arg size must be equal to number "
		"of arguments "
		"at call site");
		// getOperandBundle() cannot be called if more than one of the operand
		// bundle exists. There is already a check elsewhere for this, so skip
		// here if we see more than one.
		if (UseCall->countOperandBundlesOfType(LLVMContext::OB_preallocated) >
		1) {
		return;
		}
		auto PreallocatedBundle =
		efriedmaUnsubmitted Done Reply Inline Actions "Prelalocated"? efriedma: "Prelalocated"?
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Done. aeubanks: Done.
		UseCall->getOperandBundle(LLVMContext::OB_preallocated);
		Assert(PreallocatedBundle,
		"Use of llvm.call.preallocated.setup outside intrinsics "
		"must be in \"preallocated\" operand bundle");
		Assert(PreallocatedBundle->Inputs.front().get() == &Call,
		"preallocated bundle must have token from corresponding "
		"llvm.call.preallocated.setup");
		}
		}
		break;
		}
		case Intrinsic::call_preallocated_arg: {
		auto *Token = dyn_cast<CallBase>(Call.getArgOperand(0));
		Assert(Token && Token->getCalledFunction()->getIntrinsicID() ==
		Intrinsic::call_preallocated_setup,
		"llvm.call.preallocated.arg token argument must be a "
		"llvm.call.preallocated.setup");
		Assert(Call.hasFnAttr(Attribute::Preallocated),
		"llvm.call.preallocated.arg must be called with a \"preallocated\" "
		"call site attribute");
		break;
		}
case Intrinsic::gcroot:		case Intrinsic::gcroot:
		efriedmaUnsubmitted Done Reply Inline Actions Do you need to check that there aren't any calls to llvm.call.preallocated.arg with the wrong token? efriedma: Do you need to check that there aren't any calls to llvm.call.preallocated.arg with the wrong…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Good catch, I missed that. Done and added test case. aeubanks: Good catch, I missed that. Done and added test case.
case Intrinsic::gcwrite:		case Intrinsic::gcwrite:
case Intrinsic::gcread:		case Intrinsic::gcread:
if (ID == Intrinsic::gcroot) {		if (ID == Intrinsic::gcroot) {
AllocaInst *AI =		AllocaInst *AI =
dyn_cast<AllocaInst>(Call.getArgOperand(0)->stripPointerCasts());		dyn_cast<AllocaInst>(Call.getArgOperand(0)->stripPointerCasts());
Assert(AI, "llvm.gcroot parameter #1 must be an alloca.", Call);		Assert(AI, "llvm.gcroot parameter #1 must be an alloca.", Call);
Assert(isa<Constant>(Call.getArgOperand(1)),		Assert(isa<Constant>(Call.getArgOperand(1)),
"llvm.gcroot parameter #2 must be a constant.", Call);		"llvm.gcroot parameter #2 must be a constant.", Call);
▲ Show 20 Lines • Show All 1,202 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/CodeExtractor.cpp

Show First 20 Lines • Show All 872 Lines • ▼ Show 20 Lines	if (Attr.isStringAttribute()) {
case Attribute::Nest:		case Attribute::Nest:
case Attribute::NoAlias:		case Attribute::NoAlias:
case Attribute::NoBuiltin:		case Attribute::NoBuiltin:
case Attribute::NoCapture:		case Attribute::NoCapture:
case Attribute::NoReturn:		case Attribute::NoReturn:
case Attribute::NoSync:		case Attribute::NoSync:
case Attribute::None:		case Attribute::None:
case Attribute::NonNull:		case Attribute::NonNull:
		case Attribute::Preallocated:
case Attribute::ReadNone:		case Attribute::ReadNone:
case Attribute::ReadOnly:		case Attribute::ReadOnly:
case Attribute::Returned:		case Attribute::Returned:
case Attribute::ReturnsTwice:		case Attribute::ReturnsTwice:
case Attribute::SExt:		case Attribute::SExt:
case Attribute::Speculatable:		case Attribute::Speculatable:
case Attribute::StackAlignment:		case Attribute::StackAlignment:
case Attribute::StructRet:		case Attribute::StructRet:
▲ Show 20 Lines • Show All 868 Lines • Show Last 20 Lines

llvm/test/Assembler/invalid-byval-type3.ll

	; RUN: not llvm-as %s -o /dev/null 2>&1 \| FileCheck %s			; RUN: not llvm-as %s -o /dev/null 2>&1 \| FileCheck %s

	; CHECK: Attributes 'byval' and 'inalloca' do not support unsized types!			; CHECK: Attributes 'byval'{{.*}} do not support unsized types!
	declare void @foo(void()* byval(void()))			declare void @foo(void()* byval(void()))

llvm/test/Bitcode/attributes.ll

	Show First 20 Lines • Show All 365 Lines • ▼ Show 20 Lines
	define void @f62() nosync			define void @f62() nosync
	{			{
	ret void			ret void
	}			}

	; CHECK: define void @f63() #39			; CHECK: define void @f63() #39
	define void @f63() sanitize_memtag			define void @f63() sanitize_memtag
	{			{
	ret void;			ret void
				}

				; CHECK: define void @f64(i32* preallocated(i32) %a)
				define void @f64(i32* preallocated(i32) %a)
				{
				ret void
	}			}

	; CHECK: attributes #0 = { noreturn }			; CHECK: attributes #0 = { noreturn }
	; CHECK: attributes #1 = { nounwind }			; CHECK: attributes #1 = { nounwind }
	; CHECK: attributes #2 = { readnone }			; CHECK: attributes #2 = { readnone }
	; CHECK: attributes #3 = { readonly }			; CHECK: attributes #3 = { readonly }
	; CHECK: attributes #4 = { noinline }			; CHECK: attributes #4 = { noinline }
	; CHECK: attributes #5 = { alwaysinline }			; CHECK: attributes #5 = { alwaysinline }
	Show All 35 Lines

llvm/test/Bitcode/operand-bundles-bc-analyzer.ll

	; RUN: llvm-as < %s \| llvm-bcanalyzer -dump -disable-histogram \| FileCheck %s			; RUN: llvm-as < %s \| llvm-bcanalyzer -dump -disable-histogram \| FileCheck %s

	; CHECK: <OPERAND_BUNDLE_TAGS_BLOCK			; CHECK: <OPERAND_BUNDLE_TAGS_BLOCK
	; CHECK-NEXT: <OPERAND_BUNDLE_TAG			; CHECK-NEXT: <OPERAND_BUNDLE_TAG
	; CHECK-NEXT: <OPERAND_BUNDLE_TAG			; CHECK-NEXT: <OPERAND_BUNDLE_TAG
	; CHECK-NEXT: <OPERAND_BUNDLE_TAG			; CHECK-NEXT: <OPERAND_BUNDLE_TAG
	; CHECK-NEXT: <OPERAND_BUNDLE_TAG			; CHECK-NEXT: <OPERAND_BUNDLE_TAG
	; CHECK-NEXT: <OPERAND_BUNDLE_TAG			; CHECK-NEXT: <OPERAND_BUNDLE_TAG
	; CHECK-NEXT: <OPERAND_BUNDLE_TAG			; CHECK-NEXT: <OPERAND_BUNDLE_TAG
				; CHECK-NEXT: <OPERAND_BUNDLE_TAG
	; CHECK-NEXT: </OPERAND_BUNDLE_TAGS_BLOCK			; CHECK-NEXT: </OPERAND_BUNDLE_TAGS_BLOCK

	; CHECK: <FUNCTION_BLOCK			; CHECK: <FUNCTION_BLOCK
	; CHECK: <OPERAND_BUNDLE			; CHECK: <OPERAND_BUNDLE
	; CHECK: <OPERAND_BUNDLE			; CHECK: <OPERAND_BUNDLE
	; CHECK-NOT: <OPERAND_BUNDLE			; CHECK-NOT: <OPERAND_BUNDLE
	; CHECK: </FUNCTION_BLOCK			; CHECK: </FUNCTION_BLOCK

	Show All 11 Lines

llvm/test/Verifier/preallocated-invalid.ll

This file was added.

				; RUN: not opt -S %s -verify 2>&1 \| FileCheck %s

				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

				; Fake LLVM intrinsic to return a token
				declare token @llvm.what()

				declare void @foo0()
				declare void @foo1(i32* preallocated(i32))
				declare void @foo2(i32* preallocated(i32), i32, i32 preallocated(i32))
				declare i32 @blackbox()

				; CHECK: llvm.call.preallocated.arg must be called with a "preallocated" call site attribute
				define void @preallocated_arg_missing_preallocated_attribute() {
				%cs = call token @llvm.call.preallocated.setup(i32 1)
				%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0)
				%y = bitcast i8* %x to i32*
				call void @foo1(i32* preallocated(i32) %y) ["preallocated"(token %cs)]
				ret void
				}

				; CHECK: preallocated as a call site attribute can only be on llvm.call.preallocated.arg
				define void @preallocated_call_site_attribute_not_on_arg() {
				call void @foo0() preallocated(i32)
				ret void
				}

				; CHECK: "preallocated" argument must be a token from llvm.call.preallocated.setup
				define void @preallocated_bundle_token() {
				%i = call i32 @blackbox()
				call void @foo0() ["preallocated"(i32 %i)]
				ret void
				}

				; CHECK: "preallocated" argument must be a token from llvm.call.preallocated.setup
				define void @preallocated_bundle_token_from_setup() {
				%cs = call token @llvm.what()
				call void @foo0() ["preallocated"(token %cs)]
				ret void
				}

				; CHECK: Expected exactly one preallocated bundle operand
				define void @preallocated_bundle_one_token() {
				%cs0 = call token @llvm.call.preallocated.setup(i32 0)
				%cs1 = call token @llvm.call.preallocated.setup(i32 0)
				call void @foo0() ["preallocated"(token %cs0, token %cs1)]
				ret void
				}

				; CHECK: Multiple preallocated operand bundles
				define void @preallocated_multiple_bundles() {
				%cs0 = call token @llvm.call.preallocated.setup(i32 0)
				%cs1 = call token @llvm.call.preallocated.setup(i32 0)
				call void @foo0() ["preallocated"(token %cs0), "preallocated"(token %cs1)]
				ret void
				}

				; CHECK: Can have at most one call
				define void @preallocated_one_call() {
				%cs = call token @llvm.call.preallocated.setup(i32 1)
				%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0) preallocated(i32)
				%y = bitcast i8* %x to i32*
				call void @foo1(i32* preallocated(i32) %y) ["preallocated"(token %cs)]
				call void @foo1(i32* preallocated(i32) %y) ["preallocated"(token %cs)]
				ret void
				}

				; CHECK: must be a constant
				define void @preallocated_setup_constant() {
				%ac = call i32 @blackbox()
				%cs = call token @llvm.call.preallocated.setup(i32 %ac)
				ret void
				}

				; CHECK: must be between 0 and corresponding
				define void @preallocated_setup_arg_index_in_bounds() {
				%cs = call token @llvm.call.preallocated.setup(i32 2)
				%a0 = call i8* @llvm.call.preallocated.arg(token %cs, i32 2) preallocated(i32)
				ret void
				}

				; CHECK: Attribute 'preallocated' type does not match parameter
				define void @preallocated_attribute_type_mismatch() {
				%cs = call token @llvm.call.preallocated.setup(i32 1)
				%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0) preallocated(i32)
				%y = bitcast i8* %x to i32*
				call void @foo1(i32* preallocated(i8) %y) ["preallocated"(token %cs)]
				ret void
				}

				; CHECK: preallocated operand requires a preallocated bundle
				define void @preallocated_require_bundle() {
				%cs = call token @llvm.call.preallocated.setup(i32 1)
				%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0) preallocated(i32)
				%y = bitcast i8* %x to i32*
				call void @foo1(i32* preallocated(i32) %y)
				ret void
				}

				; CHECK: arg size must be equal to number of arguments
				define void @preallocated_num_args() {
				%cs = call token @llvm.call.preallocated.setup(i32 3)
				%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0) preallocated(i32)
				%x1 = bitcast i8* %x to i32*
				%y = call i8* @llvm.call.preallocated.arg(token %cs, i32 1) preallocated(i32)
				%y1 = bitcast i8* %y to i32*
				%a = inttoptr i32 0 to i32*
				call void @foo2(i32* preallocated(i32) %x1, i32* %a, i32* preallocated(i32) %y1) ["preallocated"(token %cs)]
				ret void
				}

				; CHECK: token argument must be a llvm.call.preallocated.setup
				define void @preallocated_arg_token() {
				%t = call token @llvm.what()
				%x = call i8* @llvm.call.preallocated.arg(token %t, i32 1) preallocated(i32)
				ret void
				}

llvm/test/Verifier/preallocated-valid.ll

This file was added.

				; RUN: opt -S %s -verify

				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

				declare void @foo1(i32* preallocated(i32))
				declare void @foo2(i32* preallocated(i32), i32, i32 preallocated(i32))

				define void @preallocated() {
				%cs = call token @llvm.call.preallocated.setup(i32 1)
				%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0) preallocated(i32)
				%y = bitcast i8* %x to i32*
				call void @foo1(i32* preallocated(i32) %y) ["preallocated"(token %cs)]
				ret void
				}

				define void @preallocated_setup_without_call() {
				%cs = call token @llvm.call.preallocated.setup(i32 1)
				%a0 = call i8* @llvm.call.preallocated.arg(token %cs, i32 0) preallocated(i32)
				ret void
				}

				define void @preallocated_num_args() {
				%cs = call token @llvm.call.preallocated.setup(i32 2)
				%x = call i8* @llvm.call.preallocated.arg(token %cs, i32 0) preallocated(i32)
				%x1 = bitcast i8* %x to i32*
				%y = call i8* @llvm.call.preallocated.arg(token %cs, i32 1) preallocated(i32)
				%y1 = bitcast i8* %y to i32*
				%a = inttoptr i32 0 to i32*
				call void @foo2(i32* preallocated(i32) %x1, i32* %a, i32* preallocated(i32) %y1) ["preallocated"(token %cs)]
				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

Add IR constructs for inalloca replacement preallocated call setupClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 260498

llvm/docs/LangRef.rst

llvm/include/llvm/Bitcode/LLVMBitCodes.h

llvm/include/llvm/IR/Attributes.h

llvm/include/llvm/IR/Attributes.td

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/IR/LLVMContext.h

llvm/lib/AsmParser/LLLexer.cpp

llvm/lib/AsmParser/LLParser.h

llvm/lib/AsmParser/LLParser.cpp

llvm/lib/AsmParser/LLToken.h

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

llvm/lib/IR/AsmWriter.cpp

llvm/lib/IR/AttributeImpl.h

llvm/lib/IR/Attributes.cpp

llvm/lib/IR/LLVMContext.cpp

llvm/lib/IR/Verifier.cpp

llvm/lib/Transforms/Utils/CodeExtractor.cpp

llvm/test/Assembler/invalid-byval-type3.ll

llvm/test/Bitcode/attributes.ll

llvm/test/Bitcode/operand-bundles-bc-analyzer.ll

llvm/test/Verifier/preallocated-invalid.ll

llvm/test/Verifier/preallocated-valid.ll

Add IR constructs for inalloca replacement preallocated call setup
ClosedPublic