This is an archive of the discontinued LLVM Phabricator instance.

Allow DataLayout to specify addrspace for allocas.
ClosedPublic

Authored by arsenm on Mar 16 2017, 10:54 AM.

Download Raw Diff

Details

Reviewers

hfinkel
efriedma

Summary

LLVM makes several assumptions about address space 0. However,
alloca is presently constrained to always return this address space.
There's no real way to avoid using alloca, so without this
there is no way to opt out of these assumptions.

The problematic assumptions include:

That the pointer size used for the stack is the same size as the code size pointer, which is also the maximum sized pointer.

That 0 is an invalid, non-dereferencable pointer value.

These are problems for AMDGPU because alloca is used to
implement the private address space, which uses a 32-bit
index as the pointer value. Other pointers are 64-bit
and behave more like LLVM's notion of generic address
space. By changing the address space used for allocas,
we can change our generic pointer type to be LLVM's generic
pointer type which does have similar properties.

Diff Detail

Event Timeline

arsenm created this revision.Mar 16 2017, 10:54 AM

Herald added subscribers: tpr, wdng, jholewinski. · View Herald TranscriptMar 16 2017, 10:54 AM

arsenm added a child revision: D31044: Update for alloca construction changes.Mar 16 2017, 10:56 AM

Print as , addrspace(num), but require it match the datalayout

arsenm added reviewers: efriedma, hfinkel.Mar 16 2017, 3:52 PM

efriedma added inline comments.Mar 16 2017, 4:10 PM

include/llvm/IR/IRBuilder.h
1098	I'm not sure this overload is a good idea; it saves a little typing, but it hides the reason we need the datalayout.

arsenm added inline comments.Mar 16 2017, 4:50 PM

include/llvm/IR/IRBuilder.h
1098	I'm not sure that's a problem here. This is the most common version, and the point of IRBuilder is to be the convenient way of creating IR. A DataLayout argument is less error prone than an unsigned argument, where it easier to accidentally pass the alignment or something else. It would also be able to hide this more by getting the parent module's DataLayout from the insertion block (which is what the CHERI patches did). However there are already functions using DataLayout input arguments, so I figured there might be some reason to pass it in rather than doing that.

efriedma added inline comments.Mar 16 2017, 5:07 PM

include/llvm/IR/IRBuilder.h
1098	I guess. The part that concerns me is that the type of the result might not be what someone expects if they aren't paying attention, and the resulting assertion failure is a bit unfriendly... but I guess we can go with this for now, and see if it ends up being confusing in practice. Getting the DataLayout from the insertion block doesn't work for all methods on IRBuilder; an IRBuilder isn't guaranteed to always have an insertion point. This can be used, for example, if you're building a constant. I guess for alloca in particular, it would be safe to assume we have an insertion point.

ping

LGTM except for a minor comment about the text in LangRef.

docs/LangRef.rst
1816	"the address space of the result for the stack" is a little unclear.

arsenm added inline comments.Mar 23 2017, 11:22 AM

docs/LangRef.rst
1816	I wasn't sure about referring to the stack at all. Should I rename everything added that says stack with alloca (e.g DL.getStackAddrSpace() -> DL.getAllocaAddrSpace()?)

efriedma added inline comments.Mar 23 2017, 11:47 AM

docs/LangRef.rst
1816	Well, in many contexts, it isn't precisely clear what the stack refers to... and there's a bunch of other stuff conventionally stored on the stack which could be stored elsewhere. Maybe better to just call it the address-space of allocas, I guess? Even assuming the meaning of "the stack" is clear, the sentence is missing something: it doesn't mention allocation anywhere.

Change LangRef description, don't call stack

LGTM, but I'd to see a second approval for an IR change like this.

What else in LLVM actually hard-codes 0 as the assumed address space? Intrinsic function declarations? Optimizations that create new globals (if there are any?) Wouldn't it be much less invasive to change those and leave the assumption that the stack is in addrspace 0?

In D31042#711596, @rjmccall wrote:

What else in LLVM actually hard-codes 0 as the assumed address space? Intrinsic function declarations? Optimizations that create new globals (if there are any?) Wouldn't it be much less invasive to change those and leave the assumption that the stack is in addrspace 0?

I'm not sure what you are suggesting. The concept of assumed address space doesn't make any sense on AMDGPU. Everything has an address space. This isn't changing how address space 0 is treated, and is specifically avoiding doing so. AMDGPU does want the assumed address space 0 properties, just for some of the other address spaces that are not the stack address space, which is one of the advantages of making this change. The problem isn't really places hard coding 0 as the address space, it's the properties of the stack address space do not behave like LLVM's notion of the generic address space as I mentioned in the RFC.

As far as 0 being a valid stack pointer, we currently have a workaround of not allocating user objects there. The bigger issues in my mind is that the pointer size for the stack does not match the code pointer size, which is a problematic area which keeps popping up. Another is it is not valid to addrspacecast any other address space to the stack address space. One partial alternative would be to introduce a code pointer size to the datalayout or code address space or something along those lines. I think that would still leave a conceptual gap between AMDGPU's stack and alloca, and be more of a hassle. (For example you still cannot addrspacecast any pointer to the address space of the stack).

There are some intrinsics which hard code 0, some of which need to be fixed (such as lifetime intrinsics which D31041 and D31043 take care of) but other than that one that's mostly a different question. There is a jump table optimization which creates new globals (which is disabled if BR_JT is illegal). Moving AMDGPU's flat address space to match LLVM's generic address space would also have the advantage of allowing legal addrspacecasts to address space 0 for some of these intrinsics.

In D31042#711611, @arsenm wrote:

In D31042#711596, @rjmccall wrote:

What else in LLVM actually hard-codes 0 as the assumed address space? Intrinsic function declarations? Optimizations that create new globals (if there are any?) Wouldn't it be much less invasive to change those and leave the assumption that the stack is in addrspace 0?

I'm not sure what you are suggesting. The concept of assumed address space doesn't make any sense on AMDGPU. Everything has an address space. This isn't changing how address space 0 is treated, and is specifically avoiding doing so. AMDGPU does want the assumed address space 0 properties, just for some of the other address spaces that are not the stack address space, which is one of the advantages of making this change. The problem isn't really places hard coding 0 as the address space, it's the properties of the stack address space do not behave like LLVM's notion of the generic address space as I mentioned in the RFC.

As far as 0 being a valid stack pointer, we currently have a workaround of not allocating user objects there. The bigger issues in my mind is that the pointer size for the stack does not match the code pointer size, which is a problematic area which keeps popping up. Another is it is not valid to addrspacecast any other address space to the stack address space. One partial alternative would be to introduce a code pointer size to the datalayout or code address space or something along those lines. I think that would still leave a conceptual gap between AMDGPU's stack and alloca, and be more of a hassle. (For example you still cannot addrspacecast any pointer to the address space of the stack).

There are some intrinsics which hard code 0, some of which need to be fixed (such as lifetime intrinsics which D31041 and D31043 take care of) but other than that one that's mostly a different question. There is a jump table optimization which creates new globals (which is disabled if BR_JT is illegal). Moving AMDGPU's flat address space to match LLVM's generic address space would also have the advantage of allowing legal addrspacecasts to address space 0 for some of these intrinsics.

You keep talking about LLVM having a "generic address space". I'm trying to figure out what you mean by that, because in LLVM, global values can be declared in an arbitrary address space, pointer values all carry an address space, instructions work on arbitrary address spaces, the memory intrinsics are generally overloaded by pointer address spaces, etc. And in general, the idea that there exists a generic address space which subsumes every other address space is just wrong. As an example, on x86-64 we use address spaces as a way of requesting segmented addressing; even if the memory there was addressable in the standard segment (which is not guaranteed), AFAIK there's no way to actually perform that conversion.

Given that, there's nothing wrong with LLVM deciding to hard-code a specific address space index as the address space of the stack. That's worth doing if it significantly simplifies the IR, e.g. by not requiring a data layout to be passed to CreateAlloca. It just means that frontends that *do* assume there's a generic address space may need to insert addrspace conversions in some cases.

it's okay to call out some specific address space, like addrspace 0, and say that it's always the address space of the stack.

I certainly hope that nothing in LLVM is ever implicitly inserting addrspace conversions or assuming anything about them without target-specific information. In general, it is not true that there exists an address space that subsumes all others.

In D31042#711627, @rjmccall wrote:

In D31042#711611, @arsenm wrote:

In D31042#711596, @rjmccall wrote:

What else in LLVM actually hard-codes 0 as the assumed address space? Intrinsic function declarations? Optimizations that create new globals (if there are any?) Wouldn't it be much less invasive to change those and leave the assumption that the stack is in addrspace 0?

I'm not sure what you are suggesting. The concept of assumed address space doesn't make any sense on AMDGPU. Everything has an address space. This isn't changing how address space 0 is treated, and is specifically avoiding doing so. AMDGPU does want the assumed address space 0 properties, just for some of the other address spaces that are not the stack address space, which is one of the advantages of making this change. The problem isn't really places hard coding 0 as the address space, it's the properties of the stack address space do not behave like LLVM's notion of the generic address space as I mentioned in the RFC.

As far as 0 being a valid stack pointer, we currently have a workaround of not allocating user objects there. The bigger issues in my mind is that the pointer size for the stack does not match the code pointer size, which is a problematic area which keeps popping up. Another is it is not valid to addrspacecast any other address space to the stack address space. One partial alternative would be to introduce a code pointer size to the datalayout or code address space or something along those lines. I think that would still leave a conceptual gap between AMDGPU's stack and alloca, and be more of a hassle. (For example you still cannot addrspacecast any pointer to the address space of the stack).

There are some intrinsics which hard code 0, some of which need to be fixed (such as lifetime intrinsics which D31041 and D31043 take care of) but other than that one that's mostly a different question. There is a jump table optimization which creates new globals (which is disabled if BR_JT is illegal). Moving AMDGPU's flat address space to match LLVM's generic address space would also have the advantage of allowing legal addrspacecasts to address space 0 for some of these intrinsics.

You keep talking about LLVM having a "generic address space". I'm trying to figure out what you mean by that, because in LLVM, global values can be declared in an arbitrary address space, pointer values all carry an address space, instructions work on arbitrary address spaces, the memory intrinsics are generally overloaded by pointer address spaces, etc. And in general, the idea that there exists a generic address space which subsumes every other address space is just wrong. As an example, on x86-64 we use address spaces as a way of requesting segmented addressing; even if the memory there was addressable in the standard segment (which is not guaranteed), AFAIK there's no way to actually perform that conversion.

Given that, there's nothing wrong with LLVM deciding to hard-code a specific address space index as the address space of the stack. That's worth doing if it significantly simplifies the IR, e.g. by not requiring a data layout to be passed to CreateAlloca. It just means that frontends that *do* assume there's a generic address space may need to insert addrspace conversions in some cases.

it's okay to call out some specific address space, like addrspace 0, and say that it's always the address space of the stack.

I certainly hope that nothing in LLVM is ever implicitly inserting addrspace conversions or assuming anything about them without target-specific information. In general, it is not true that there exists an address space that subsumes all others.

I'd meant to edit out these last two paragraphs; alas, I'm making these comments through Phabricator, which uses a very small text edit box, and I just failed to see that there was more text below. That's why they seem to be redundant with the earlier comments, but at least they're not contradictory.

In D31042#711627, @rjmccall wrote:

You keep talking about LLVM having a "generic address space". I'm trying to figure out what you mean by that, because in LLVM, global values can be declared in an arbitrary address space, pointer values all carry an address space, instructions work on arbitrary address spaces, the memory intrinsics are generally overloaded by pointer address spaces, etc. And in general, the idea that there exists a generic address space which subsumes every other address space is just wrong. As an example, on x86-64 we use address spaces as a way of requesting segmented addressing; even if the memory there was addressable in the standard segment (which is not guaranteed), AFAIK there's no way to actually perform that conversion.

Given that, there's nothing wrong with LLVM deciding to hard-code a specific address space index as the address space of the stack. That's worth doing if it significantly simplifies the IR, e.g. by not requiring a data layout to be passed to CreateAlloca. It just means that frontends that *do* assume there's a generic address space may need to insert addrspace conversions in some cases.

it's okay to call out some specific address space, like addrspace 0, and say that it's always the address space of the stack.

I certainly hope that nothing in LLVM is ever implicitly inserting addrspace conversions or assuming anything about them without target-specific information. In general, it is not true that there exists an address space that subsumes all others.

LLVM treats 0 as the "generic" address space. This is how the LangRef refers to it. In the AMDGPU sense this is a pointer that can be used to access any memory. For AMDGPU, globals declared without an address space are invalid. All objects must have a definite address space.

There have been bugs in the past from random places inserting addrspacecasts but I don't think I've seen one recently

In D31042#711648, @arsenm wrote:

LLVM treats 0 as the "generic" address space. This is how the LangRef refers to it.

You're revising LangRef here one way or the other. I'm suggesting that a better revision would be to stop saying that address space 0 is the "generic" address space (because there is no such thing), and instead say that it is the address space of the stack. Is there anything actually preventing you from doing that?

In D31042#711661, @rjmccall wrote:

In D31042#711648, @arsenm wrote:

LLVM treats 0 as the "generic" address space. This is how the LangRef refers to it.

You're revising LangRef here one way or the other. I'm suggesting that a better revision would be to stop saying that address space 0 is the "generic" address space (because there is no such thing), and instead say that it is the address space of the stack. Is there anything actually preventing you from doing that?

Yes. The pointer size of the stack smaller than the pointer size for code or most other memory. 0 is a valid, dereferencable pointer for the stack address space

In D31042#711662, @arsenm wrote:

In D31042#711661, @rjmccall wrote:

In D31042#711648, @arsenm wrote:

LLVM treats 0 as the "generic" address space. This is how the LangRef refers to it.

You're revising LangRef here one way or the other. I'm suggesting that a better revision would be to stop saying that address space 0 is the "generic" address space (because there is no such thing), and instead say that it is the address space of the stack. Is there anything actually preventing you from doing that?

Yes. The pointer size of the stack smaller than the pointer size for code or most other memory. 0 is a valid, dereferencable pointer for the stack address space

I don't see why the pointer size has anything to do with it. In the absence of target-specific information, LLVM is not allowed to assume that one address space is subsumed by another address space at all. It doesn't matter if one of them is address space 0.

The special treatment of zero pointers is an interesting point, but again, I feel like that transformation could be made target-dependent *much* more easily than this. This is an incredibly invasive change.

In D31042#711771, @rjmccall wrote:

In D31042#711662, @arsenm wrote:

In D31042#711661, @rjmccall wrote:

In D31042#711648, @arsenm wrote:

LLVM treats 0 as the "generic" address space. This is how the LangRef refers to it.

You're revising LangRef here one way or the other. I'm suggesting that a better revision would be to stop saying that address space 0 is the "generic" address space (because there is no such thing), and instead say that it is the address space of the stack. Is there anything actually preventing you from doing that?

Yes. The pointer size of the stack smaller than the pointer size for code or most other memory. 0 is a valid, dereferencable pointer for the stack address space

I don't see why the pointer size has anything to do with it. In the absence of target-specific information, LLVM is not allowed to assume that one address space is subsumed by another address space at all. It doesn't matter if one of them is address space 0.

The special treatment of zero pointers is an interesting point, but again, I feel like that transformation could be made target-dependent *much* more easily than this. This is an incredibly invasive change.

I think being able to opt out of any assumptions is CHERI's use case for this patch, but I don't work on that so I'm not sure.

The relationships between different address spaces is only indirectly related to adding an address space to alloca, and is more a useful side effect. I think LLVM's assumptions about pointers and address space 0 (and worse so, clang) are more widespread than you think. There is no concept of a code pointer size. This is one area where address space 0 is treated as a lack of an address space. The code size and stack pointer do not have the same size for AMDGPU, and it is important to be able to distinguish these. There is a lot of code that assumes the default address space size is "the one true pointer size". You can't pass around a 64-bit function pointer properly if it's required to have the same properties as a 32-bit stack pointer. It isn't really OK to start treating function pointer types differently than other types. Fixing that is an uphill battle in the frontend and backend. It is less work to align the AMDGPU generic address space notion with LLVM's "default" address space and the assumptions made about it than to fix everything else. Adding address spaces to functions would be a much more invasive change. I don't think adding this DataLayout argument is very invasive, and was less work than I anticipated (The IRBuilder changes could also be avoided by getting the DataLayout from the insertion block's parent module). There is no big semantic change, just simple bookkeeping updates.

The pressing reason for this is C++ GPU support. Clang's handling of address spaces is more problematic. The assumptions about address space 0 are more widespread and tied to LLVM's address space 0. The work to start annotating every pointer there as required would be a much more invasive change.

The relationships between different address spaces is only indirectly related to adding an address space to alloca, and is more a useful side effect. I think LLVM's assumptions about pointers and address space 0 (and worse so, clang) are more widespread than you think. There is no concept of a code pointer size.

Well, address space is carried by PointerType, so it isn't true that LLVM IR has no concept of code pointer sizes. But I hadn't realized that Function doesn't have a way to actually set an address space on creation; I can see how that would make that change more involved. On the other hand, it seems like something that could easily just be added as a default argument to a couple of APIs; I don't see why *that's* seen as an invasive change but adding a new mandatory argument to the interfaces for creating one of the most common instruction classes is not.

This is one area where address space 0 is treated as a lack of an address space. The code size and stack pointer do not have the same size for AMDGPU, and it is important to be able to distinguish these.

Yes, I did understand that the first time it was explained.

There is a lot of code that assumes the default address space size is "the one true pointer size".

Are you suggesting that LLVM's address space 0 must always be at least as large as every other address space? Where is that assumption made? Again, that seems like something that should be fixed independent of what we decide here.

You can't pass around a 64-bit function pointer properly if it's required to have the same properties as a 32-bit stack pointer. It isn't really OK to start treating function pointer types differently than other types.

Of course. I am not suggesting that we should do that.

I don't think adding this DataLayout argument is very invasive, and was less work than I anticipated

That's because you aren't changing any of the frontends, most of which are not in-tree. I understand that LLVM does not promise to keep the C++ API stable, but there's surely *some* responsibility to not break every frontend for features they don't care about. At the very least, yes, you should be picking up this value from the Module instead of requiring it to be passed in.

The pressing reason for this is C++ GPU support. Clang's handling of address spaces is more problematic. The assumptions about address space 0 are more widespread and tied to LLVM's address space 0.

Clang's assumptions about address space 0 are pervasive because it represents an actual concept in the source language: C has a generic address space. My point is that LLVM generally should not.

I don't think the assumptions in Clang that the Clang AST address space 0 equals LLVM's address space 0 are nearly as pervasive as you think. At any rate, since you require stack allocations to be in a different LLVM address space from the generic address space, but (per C) taking the address of a local variable yields a pointer in the generic address space, you're going to have to teach Clang about these differences between address spaces anyway, because it'll have to insert address-space conversions in a number of places.

In D31042#711863, @rjmccall wrote:

There is a lot of code that assumes the default address space size is "the one true pointer size".

Are you suggesting that LLVM's address space 0 must always be at least as large as every other address space? Where is that assumption made? Again, that seems like something that should be fixed independent of what we decide here.

Not necessarily, it's just an occasional sticking point in some remaining that code isn't aware of multiple pointer sizes. One area was in debug info. Patches to fix that were rejected a few times in the past. If you grep for argumentless getPointerSizes in CodeGen+MC, most of them seem to be for debug info. I have another patch I've been holding on to which works around the DAG picking the wrong value type for pointers to function types. Pretty much everything in the DAG for dealing specifically with pointer types is unworkable with different sized pointers, but most of that is possible to avoid using.

You can't pass around a 64-bit function pointer properly if it's required to have the same properties as a 32-bit stack pointer. It isn't really OK to start treating function pointer types differently than other types.

Of course. I am not suggesting that we should do that.

I don't think adding this DataLayout argument is very invasive, and was less work than I anticipated

That's because you aren't changing any of the frontends, most of which are not in-tree. I understand that LLVM does not promise to keep the C++ API stable, but there's surely *some* responsibility to not break every frontend for features they don't care about. At the very least, yes, you should be picking up this value from the Module instead of requiring it to be passed in.

So your objection is just to the API change? I can change that if you insist. The actual mechanical adding of the argument isn't the source of the work. As someone who has maintained out of tree uses, this class of change is the least irritating. Most of the clang IR gen work is in updating the downstream users which use hardcoded pointer types, which users not using this don't need to care about it.

The pressing reason for this is C++ GPU support. Clang's handling of address spaces is more problematic. The assumptions about address space 0 are more widespread and tied to LLVM's address space 0.

Clang's assumptions about address space 0 are pervasive because it represents an actual concept in the source language: C has a generic address space. My point is that LLVM generally should not.

I don't think the assumptions in Clang that the Clang AST address space 0 equals LLVM's address space 0 are nearly as pervasive as you think. At any rate, since you require stack allocations to be in a different LLVM address space from the generic address space, but (per C) taking the address of a local variable yields a pointer in the generic address space, you're going to have to teach Clang about these differences between address spaces anyway, because it'll have to insert address-space conversions in a number of places.

Yes, I expect there will be many addrspacecasts inserted from the alloca, which then the InferAddressSpaces pass would hopefully optimize away.

In D31042#711913, @arsenm wrote:

In D31042#711863, @rjmccall wrote:

I don't think the assumptions in Clang that the Clang AST address space 0 equals LLVM's address space 0 are nearly as pervasive as you think. At any rate, since you require stack allocations to be in a different LLVM address space from the generic address space, but (per C) taking the address of a local variable yields a pointer in the generic address space, you're going to have to teach Clang about these differences between address spaces anyway, because it'll have to insert address-space conversions in a number of places.

Yes, I expect there will be many addrspacecasts inserted from the alloca, which then the InferAddressSpaces pass would hopefully optimize away.

Okay. So, to summarize:

In amdgpu OpenCL, at source level, sizeof(void private *) == sizeof(void {global,local,constant} *) == 8. However, at an implementation level, stack addresses are actually 32-bit, and you want that to be modeled correctly in the IR. Therefore, allocas need to return a value in a 32-bit address space, which the frontend will immediately widen to a 64-bit address space in order to match the rules for __private. In order to maintain performance, you have a pass that recognizes when an address value is the result of that kind of widening and re-narrows it to the 32-bit stack address space.

Because LLVM currently makes some unfortunate assumptions about address space 0, this stack address space cannot be address space 0. These assumptions include:
(1) As far as debug info is concerned, the size of a pointer is the size of a pointer in address space 0. (Is this assumption why sizeof(void private *) == 8 despite private being clearly intended in the OpenCL spec to be the address space of stack pointers?)
(2) Functions are currently always defined in address space 0. A function pointer in your implementation must be in a 64-bit address space. (My understanding was that OpenCL generally forbade function pointers, so I'm not sure if this is actually important?)
(3) Some optimizations assume that 0 is an invalid address in address space 0, but in fact it's a valid stack address.

It's entirely possible that we should, indeed, change all of these assumptions. For the time being, you feel that's not a reasonable fight, and so you would like to allow IR to use an explicitly non-0 address space for the stack.

I think I'm prepared to accept your argument.

I don't think adding this DataLayout argument is very invasive, and was less work than I anticipated

That's because you aren't changing any of the frontends, most of which are not in-tree. I understand that LLVM does not promise to keep the C++ API stable, but there's surely *some* responsibility to not break every frontend for features they don't care about. At the very least, yes, you should be picking up this value from the Module instead of requiring it to be passed in.

So your objection is just to the API change? I can change that if you insist. The actual mechanical adding of the argument isn't the source of the work. As someone who has maintained out of tree uses, this class of change is the least irritating. Most of the clang IR gen work is in updating the downstream users which use hardcoded pointer types, which users not using this don't need to care about it.

Yes, I would like you to avoid changing the IRBuilder API if you can. I can see why you need to change the AllocaInst constructors, though.

Do you actually need this to be printed/parsed in .ll files? It's globally consistent across all alloca instructions, and the data layout is always parsed before anything else in the file. It's not the end of the world, just seems odd to imply that this is actually something that can be different on different instructions.

Do you actually need this to be printed/parsed in .ll files? It's globally consistent across all alloca instructions, and the data layout is always parsed before anything else in the file. It's not the end of the world, just seems odd to imply that this is actually something that can be different on different instructions.

No, strictly speaking it isn't necessary: every module has a datalayout, so we can compute the type from that. I requested it to make debugging more straightforward. Without it, there isn't any obvious indication if you dump() an instruction or look at an IR file that the address-space of the result is non-zero.

In D31042#712292, @efriedma wrote:

Do you actually need this to be printed/parsed in .ll files? It's globally consistent across all alloca instructions, and the data layout is always parsed before anything else in the file. It's not the end of the world, just seems odd to imply that this is actually something that can be different on different instructions.

No, strictly speaking it isn't necessary: every module has a datalayout, so we can compute the type from that. I requested it to make debugging more straightforward. Without it, there isn't any obvious indication if you dump() an instruction or look at an IR file that the address-space of the result is non-zero.

Well, since IR assembly is primarily a debugging / testing aid, and you won't get similar test output anyway because the address space will show up in every use of the instruction, I suppose it's fine.

In D31042#712124, @rjmccall wrote:

In D31042#711913, @arsenm wrote:

In D31042#711863, @rjmccall wrote:

I don't think the assumptions in Clang that the Clang AST address space 0 equals LLVM's address space 0 are nearly as pervasive as you think. At any rate, since you require stack allocations to be in a different LLVM address space from the generic address space, but (per C) taking the address of a local variable yields a pointer in the generic address space, you're going to have to teach Clang about these differences between address spaces anyway, because it'll have to insert address-space conversions in a number of places.

Yes, I expect there will be many addrspacecasts inserted from the alloca, which then the InferAddressSpaces pass would hopefully optimize away.

Okay. So, to summarize:

In amdgpu OpenCL, at source level, sizeof(void private *) == sizeof(void {global,local,constant} *) == 8. However, at an implementation level, stack addresses are actually 32-bit, and you want that to be modeled correctly in the IR. Therefore, allocas need to return a value in a 32-bit address space, which the frontend will immediately widen to a 64-bit address space in order to match the rules for __private. In order to maintain performance, you have a pass that recognizes when an address value is the result of that kind of widening and re-narrows it to the 32-bit stack address space.

This is mostly accurate except for the detail about the cast. For OpenCL 2.0 or C++ the alloca pointer will most likely need to be addrspacecasted to the OpenCL generic address space. Private pointers in memory may need to be zero extended to 64-bit in memory. For OpenCL 1.x there is no generic address space, and the old subtargets don't have the necessary hardware features to support this.This is one of reasons I don't want to use the same workaround NVPTX uses for this problem. There would be a pointer that if not optimized out would require creative codegen work to artificially support it on those targets.

Because LLVM currently makes some unfortunate assumptions about address space 0, this stack address space cannot be address space 0. These assumptions include:
(1) As far as debug info is concerned, the size of a pointer is the size of a pointer in address space 0. (Is this assumption why sizeof(void private *) == 8 despite private being clearly intended in the OpenCL spec to be the address space of stack pointers?)

This is a point that the spec is vague on. While the spec explicitly allows different size pointers for different address spaces, there isn't really any detailed description of what this entails. This wasn't addressed at all by the SPIR spec, which just defined a single 64-bit datalayout string. sizeof really indicates the in-memory representation's size, so by using a DataLayout with 32-bit pointers with an ABI alignment of 64-bit, we get the necessary codegen properties of a 32-bit pointer while maintaining the required ABI and single sizeof().

(2) Functions are currently always defined in address space 0. A function pointer in your implementation must be in a 64-bit address space. (My understanding was that OpenCL generally forbade function pointers, so I'm not sure if this is actually important?)

It is true that OpenCL doesn't allow function pointers at the source level. I'm working on implementing calls without inlining everything now, and to be able to correctly lower LLVM IR for them requires the DAG picking the right MVT when emitting the global address so it still comes up there. Function pointer support is also helpful for implementing other languages being worked on like C++ and python. I believe recent version of CUDA support them as well.

(3) Some optimizations assume that 0 is an invalid address in address space 0, but in fact it's a valid stack address.

It's entirely possible that we should, indeed, change all of these assumptions. For the time being, you feel that's not a reasonable fight, and so you would like to allow IR to use an explicitly non-0 address space for the stack.

Yes. Some day I would like to be able to apply some of these assumptions to other non-0 address spaces as well (i.e. 0 is null in some address spaces, but not others) but I think that is a larger task.

I think I'm prepared to accept your argument.

I don't think adding this DataLayout argument is very invasive, and was less work than I anticipated

That's because you aren't changing any of the frontends, most of which are not in-tree. I understand that LLVM does not promise to keep the C++ API stable, but there's surely *some* responsibility to not break every frontend for features they don't care about. At the very least, yes, you should be picking up this value from the Module instead of requiring it to be passed in.

So your objection is just to the API change? I can change that if you insist. The actual mechanical adding of the argument isn't the source of the work. As someone who has maintained out of tree uses, this class of change is the least irritating. Most of the clang IR gen work is in updating the downstream users which use hardcoded pointer types, which users not using this don't need to care about it.

Yes, I would like you to avoid changing the IRBuilder API if you can. I can see why you need to change the AllocaInst constructors, though.

Do you actually need this to be printed/parsed in .ll files? It's globally consistent across all alloca instructions, and the data layout is always parsed before anything else in the file. It's not the end of the world, just seems odd to imply that this is actually something that can be different on different instructions.

No, this was just a request in the RFC thread to make it more clear. I can maybe envision a use case in the future for having a different address space on individual allocas, but that is not a concern I have today.

Don't change IRBuilder API

In D31042#712324, @arsenm wrote:

In D31042#712124, @rjmccall wrote:

In D31042#711913, @arsenm wrote:

In D31042#711863, @rjmccall wrote:

I don't think the assumptions in Clang that the Clang AST address space 0 equals LLVM's address space 0 are nearly as pervasive as you think. At any rate, since you require stack allocations to be in a different LLVM address space from the generic address space, but (per C) taking the address of a local variable yields a pointer in the generic address space, you're going to have to teach Clang about these differences between address spaces anyway, because it'll have to insert address-space conversions in a number of places.

Yes, I expect there will be many addrspacecasts inserted from the alloca, which then the InferAddressSpaces pass would hopefully optimize away.

Okay. So, to summarize:

In amdgpu OpenCL, at source level, sizeof(void private *) == sizeof(void {global,local,constant} *) == 8. However, at an implementation level, stack addresses are actually 32-bit, and you want that to be modeled correctly in the IR. Therefore, allocas need to return a value in a 32-bit address space, which the frontend will immediately widen to a 64-bit address space in order to match the rules for __private. In order to maintain performance, you have a pass that recognizes when an address value is the result of that kind of widening and re-narrows it to the 32-bit stack address space.

This is mostly accurate except for the detail about the cast. For OpenCL 2.0 or C++ the alloca pointer will most likely need to be addrspacecasted to the OpenCL generic address space.

It looks to me like the language spec says that the address of a local variable is in the __private address space, but that a pointer in any address space can be promoted into the generic address space. Clang's AST should call out that second promotion as a separate ImplicitCastExpr. Of course, you may decide that you want to peephole it into a single addrspacecast instruction.

Private pointers in memory may need to be zero extended to 64-bit in memory. For OpenCL 1.x there is no generic address space, and the old subtargets don't have the necessary hardware features to support this.

By "this", you mean an efficiently-accessible generic address space? Presumably because the different memory-access operations work on specific address spaces and so, as you say, you would need creative codegen to figure out which case the pointer originally belonged to. That makes sense.

Because LLVM currently makes some unfortunate assumptions about address space 0, this stack address space cannot be address space 0. These assumptions include:
(1) As far as debug info is concerned, the size of a pointer is the size of a pointer in address space 0. (Is this assumption why sizeof(void private *) == 8 despite private being clearly intended in the OpenCL spec to be the address space of stack pointers?)

This is a point that the spec is vague on. While the spec explicitly allows different size pointers for different address spaces, there isn't really any detailed description of what this entails.

I don't see what the problem is. The spec doesn't allow pointers to be converted between address spaces at all (except for the promotion into the generic address space in OpenCL 2.0). Pointers into different address spaces are just completely different types.

I mean, in general, the OpenCL spec is rather hilariously poorly-drafted, and as a compiler programmer you have to read between the lines to figure out how things are supposed to be generalized. The old 1.1 spec used to say that "variables declared as pointers are assumed to point to the __private address space if an address space qualifier is not specified", and everyone just assumes that that's meant to be a general rule for pointer types, not something that only applies to variables and only at the outermost level of pointer. Similarly, you have to read the paragraph about assignability as a general conversion rule, and you have to guess that probably when the 2.0 specification says that "[c]asting a pointer to address space A to a pointer to address space B is illegal if A and B are named address spaces and A is not the same as B", that it's okay to both cast into *and out of* the generic address space, and you have to guess at what the semantics of the latter are supposed to be. The best guidance I can offer is to instead read the Embedded C specification (WG14 N1169 might be the most version), which goes into address spaces somewhat more rigorously; it's what I've always used as a baseline for understanding what OpenCL is trying to do. From this point of view, the four OpenCL 1.0 address spaces are disjoint, and OpenCL 2.0's generic address space is simply one that is a superset of all the others. Embedded C allows casts between address spaces, but the cast has undefined behavior if the pointer doesn't actually point into the destination address space. Since the four named address spaces are formally disjoint, casts directly between them are always invalid, and so OpenCL's rule that they're just forbidden makes sense. Anyway, that's what I would suggest as an approach for understanding OpenCL.

This wasn't addressed at all by the SPIR spec, which just defined a single 64-bit datalayout string. sizeof really indicates the in-memory representation's size, so by using a DataLayout with 32-bit pointers with an ABI alignment of 64-bit, we get the necessary codegen properties of a 32-bit pointer while maintaining the required ABI and single sizeof().

Well, SPIR is an abstract representation. If you were emitting SPIR, you wouldn't want any of these assumptions about stack addressing to leak into it. If you're lowering from SPIR, of course, you just have to invent an ABI rule for how your 32-bit pointers are passed around as 64-bit quantities, and then you globally rewrite the IR to implement that.

(2) Functions are currently always defined in address space 0. A function pointer in your implementation must be in a 64-bit address space. (My understanding was that OpenCL generally forbade function pointers, so I'm not sure if this is actually important?)

It is true that OpenCL doesn't allow function pointers at the source level. I'm working on implementing calls without inlining everything now, and to be able to correctly lower LLVM IR for them requires the DAG picking the right MVT when emitting the global address so it still comes up there. Function pointer support is also helpful for implementing other languages being worked on like C++ and python. I believe recent version of CUDA support them as well.

Well, if you don't have real function pointers, and therefore the only use of functions is in direct calls, it feels like you can always hack around pointer-size problems in your backend pretty easily. If the source language has real function pointers, it also has to decide what address space they're in — it could be a dedicated address space for functions, which would be legal under C rules, but which would take some effort to support in Clang.

(3) Some optimizations assume that 0 is an invalid address in address space 0, but in fact it's a valid stack address.

It's entirely possible that we should, indeed, change all of these assumptions. For the time being, you feel that's not a reasonable fight, and so you would like to allow IR to use an explicitly non-0 address space for the stack.

Yes. Some day I would like to be able to apply some of these assumptions to other non-0 address spaces as well (i.e. 0 is null in some address spaces, but not others) but I think that is a larger task.

Yes, that seems like a good goal. In fact, it would be nice if the assumptions about address space 0 were just a default behavior that could be overridden by the data layout.

In D31042#712484, @rjmccall wrote:

In D31042#712324, @arsenm wrote:

In D31042#712124, @rjmccall wrote:

In D31042#711913, @arsenm wrote:

In D31042#711863, @rjmccall wrote:

I don't think the assumptions in Clang that the Clang AST address space 0 equals LLVM's address space 0 are nearly as pervasive as you think. At any rate, since you require stack allocations to be in a different LLVM address space from the generic address space, but (per C) taking the address of a local variable yields a pointer in the generic address space, you're going to have to teach Clang about these differences between address spaces anyway, because it'll have to insert address-space conversions in a number of places.

Yes, I expect there will be many addrspacecasts inserted from the alloca, which then the InferAddressSpaces pass would hopefully optimize away.

Okay. So, to summarize:

In amdgpu OpenCL, at source level, sizeof(void private *) == sizeof(void {global,local,constant} *) == 8. However, at an implementation level, stack addresses are actually 32-bit, and you want that to be modeled correctly in the IR. Therefore, allocas need to return a value in a 32-bit address space, which the frontend will immediately widen to a 64-bit address space in order to match the rules for __private. In order to maintain performance, you have a pass that recognizes when an address value is the result of that kind of widening and re-narrows it to the 32-bit stack address space.

This is mostly accurate except for the detail about the cast. For OpenCL 2.0 or C++ the alloca pointer will most likely need to be addrspacecasted to the OpenCL generic address space.

It looks to me like the language spec says that the address of a local variable is in the __private address space, but that a pointer in any address space can be promoted into the generic address space. Clang's AST should call out that second promotion as a separate ImplicitCastExpr. Of course, you may decide that you want to peephole it into a single addrspacecast instruction.

Private pointers in memory may need to be zero extended to 64-bit in memory. For OpenCL 1.x there is no generic address space, and the old subtargets don't have the necessary hardware features to support this.

By "this", you mean an efficiently-accessible generic address space? Presumably because the different memory-access operations work on specific address spaces and so, as you say, you would need creative codegen to figure out which case the pointer originally belonged to. That makes sense.

Yes

Because LLVM currently makes some unfortunate assumptions about address space 0, this stack address space cannot be address space 0. These assumptions include:
(1) As far as debug info is concerned, the size of a pointer is the size of a pointer in address space 0. (Is this assumption why sizeof(void private *) == 8 despite private being clearly intended in the OpenCL spec to be the address space of stack pointers?)

This is a point that the spec is vague on. While the spec explicitly allows different size pointers for different address spaces, there isn't really any detailed description of what this entails.

I don't see what the problem is. The spec doesn't allow pointers to be converted between address spaces at all (except for the promotion into the generic address space in OpenCL 2.0). Pointers into different address spaces are just completely different types.

The problem is mostly what happens if the host and device pointer size don't match? Struct layouts etc. still need to be compatible for whatever memory buffer was passed into the kernel. There isn't much practical reason to do it, but you could have a struct with private pointer members in it changing the offsets of the other items, not that you could do anything valid with the contents.

In D31042#712613, @arsenm wrote:

In D31042#712484, @rjmccall wrote:

Because LLVM currently makes some unfortunate assumptions about address space 0, this stack address space cannot be address space 0. These assumptions include:
(1) As far as debug info is concerned, the size of a pointer is the size of a pointer in address space 0. (Is this assumption why sizeof(void private *) == 8 despite private being clearly intended in the OpenCL spec to be the address space of stack pointers?)

This is a point that the spec is vague on. While the spec explicitly allows different size pointers for different address spaces, there isn't really any detailed description of what this entails.

I don't see what the problem is. The spec doesn't allow pointers to be converted between address spaces at all (except for the promotion into the generic address space in OpenCL 2.0). Pointers into different address spaces are just completely different types.

The problem is mostly what happens if the host and device pointer size don't match? Struct layouts etc. still need to be compatible for whatever memory buffer was passed into the kernel. There isn't much practical reason to do it, but you could have a struct with private pointer members in it changing the offsets of the other items, not that you could do anything valid with the contents.

The easy lowering would be to leave a 64-bit private address space around but consider it "illegal". Address space conversions between it and the 32-bit private address space would simply be zexts and truncs. But your 8-byte-aligned pointer idea works, too, assuming it doesn't give LLVM too many fits to have an over-aligned primitive type.

The more complex lowering would be to rewrite all the struct types to replace T addrspace(private64)* with { T addrspace(private32)*, i32 }. There's actually relatively few IR constructs that work with struct types — GEP, insertvalue/extractvalue, constant aggregates, I think that's it. The IR type would change alignment, but only by getting weaker, and memory accesses should all have explicit alignments anyway.

Anyway, I guess we're getting a bit afield.

John.

yaxunl added a subscriber: yaxunl.Mar 30 2017, 11:22 AM

whchung added a subscriber: whchung.Apr 2 2017, 7:49 PM

Any more comments for the patch?

Not from me.

t-tye added a subscriber: t-tye.Apr 6 2017, 8:45 AM

Is this good to commit?

Yes, I think you've addressed everything.

This revision is now accepted and ready to land.Apr 10 2017, 11:14 AM

r299888

Revision Contents

Path

Size

docs/

LangRef.rst

7 lines

include/

llvm/

IR/

DataLayout.h

3 lines

IRBuilder.h

11 lines

Instructions.h

15 lines

lib/

AsmParser/

LLParser.h

2 lines

LLParser.cpp

53 lines

Bitcode/

Reader/

BitcodeReader.cpp

7 lines

CodeGen/

AtomicExpandPass.cpp

6 lines

SafeStack.cpp

6 lines

ShadowStackGCLowering.cpp

3 lines

SjLjEHPrepare.cpp

4 lines

StackProtector.cpp

2 lines

WinEHPrepare.cpp

6 lines

IR/

6 lines

9 lines

8 lines

41 lines

5 lines

Target/

NVPTX/

NVPTXLowerArgs.cpp

3 lines

X86/

X86WinEHState.cpp

8 lines

Transforms/

Coroutines/

CoroElide.cpp

3 lines

CoroSplit.cpp

4 lines

IPO/

ArgumentPromotion.cpp

5 lines

GlobalOpt.cpp

4 lines

InstCombine/

InstCombineCasts.cpp

2 lines

InstCombineLoadStoreAlloca.cpp

3 lines

Instrumentation/

AddressSanitizer.cpp

20 lines

DataFlowSanitizer.cpp

18 lines

MemorySanitizer.cpp

15 lines

Scalar/

RewriteStatepointsForGC.cpp

4 lines

SROA.cpp

14 lines

Utils/

CodeExtractor.cpp

14 lines

DemoteRegToStack.cpp

17 lines

InlineFunction.cpp

14 lines

test/

Assembler/

alloca-addrspace-parse-error-0.ll

11 lines

alloca-addrspace-parse-error-1.ll

12 lines

alloca-addrspace0.ll

24 lines

datalayout-alloca-addrspace-mismatch-0.ll

9 lines

datalayout-alloca-addrspace-mismatch-1.ll

9 lines

datalayout-alloca-addrspace-mismatch-2.ll

11 lines

datalayout-alloca-addrspace.ll

23 lines

invalid-datalayout-alloca-addrspace.ll

4 lines

Transforms/

SROA/

alloca-address-space.ll

84 lines

tools/

llvm-stress/

llvm-stress.cpp

4 lines

unittests/

Analysis/

ScalarEvolutionTest.cpp

4 lines

IR/

IRBuilderTest.cpp

10 lines

Transforms/

Utils/

Cloning.cpp

3 lines

MemorySSA.cpp

22 lines

Diff 92080

docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,806 Lines • ▼ Show 20 Lines	``e``
location.		location.
``S<size>``		``S<size>``
Specifies the natural alignment of the stack in bits. Alignment		Specifies the natural alignment of the stack in bits. Alignment
promotion of stack variables is limited to the natural stack		promotion of stack variables is limited to the natural stack
alignment to avoid dynamic stack realignment. The stack alignment		alignment to avoid dynamic stack realignment. The stack alignment
must be a multiple of 8-bits. If omitted, the natural stack		must be a multiple of 8-bits. If omitted, the natural stack
alignment defaults to "unspecified", which does not prevent any		alignment defaults to "unspecified", which does not prevent any
alignment promotions.		alignment promotions.
		``A<address space``
		Specifies the address space of the result for the stack. Defaults to the
		efriedmaUnsubmitted Not Done Reply Inline Actions "the address space of the result for the stack" is a little unclear. efriedma: "the address space of the result for the stack" is a little unclear.
		arsenmAuthorUnsubmitted Not Done Reply Inline Actions I wasn't sure about referring to the stack at all. Should I rename everything added that says stack with alloca (e.g DL.getStackAddrSpace() -> DL.getAllocaAddrSpace()?) arsenm: I wasn't sure about referring to the stack at all. Should I rename everything added that says…
		efriedmaUnsubmitted Not Done Reply Inline Actions Well, in many contexts, it isn't precisely clear what the stack refers to... and there's a bunch of other stuff conventionally stored on the stack which could be stored elsewhere. Maybe better to just call it the address-space of allocas, I guess? Even assuming the meaning of "the stack" is clear, the sentence is missing something: it doesn't mention allocation anywhere. efriedma: Well, in many contexts, it isn't precisely clear what the stack refers to... and there's a…
		default address space of 0.
``p[n]:<size>:<abi>:<pref>``		``p[n]:<size>:<abi>:<pref>``
This specifies the size of a pointer and its ``<abi>`` and		This specifies the size of a pointer and its ``<abi>`` and
``<pref>``\erred alignments for address space ``n``. All sizes are in		``<pref>``\erred alignments for address space ``n``. All sizes are in
bits. The address space, ``n``, is optional, and if not specified,		bits. The address space, ``n``, is optional, and if not specified,
denotes the default address space 0. The value of ``n`` must be		denotes the default address space 0. The value of ``n`` must be
in the range [1,2^23).		in the range [1,2^23).
``i<size>:<abi>:<pref>``		``i<size>:<abi>:<pref>``
This specifies the alignment for an integer type of a given bit		This specifies the alignment for an integer type of a given bit
▲ Show 20 Lines • Show All 5,321 Lines • ▼ Show 20 Lines
'``alloca``' Instruction		'``alloca``' Instruction
^^^^^^^^^^^^^^^^^^^^^^^^		^^^^^^^^^^^^^^^^^^^^^^^^

Syntax:		Syntax:
"""""""		"""""""

::		::

<result> = alloca [inalloca] <type> [, <ty> <NumElements>] [, align <alignment>] ; yields type*:result		<result> = alloca [inalloca] <type> [, <ty> <NumElements>] [, align <alignment>] [, addrspace(<num>)] ; yields type addrspace(num)*:result

Overview:		Overview:
"""""""""		"""""""""

The '``alloca``' instruction allocates memory on the stack frame of the		The '``alloca``' instruction allocates memory on the stack frame of the
currently executing function, to be automatically released when this		currently executing function, to be automatically released when this
function returns to its caller. The object is always allocated in the		function returns to its caller. The object is always allocated in the
generic address space (address space zero).		address space for allocas indicated in the datalayout.

Arguments:		Arguments:
""""""""""		""""""""""

The '``alloca``' instruction allocates ``sizeof(<type>)*NumElements``		The '``alloca``' instruction allocates ``sizeof(<type>)*NumElements``
bytes of memory on the runtime stack, returning a pointer of the		bytes of memory on the runtime stack, returning a pointer of the
appropriate type to the program. If "NumElements" is specified, it is		appropriate type to the program. If "NumElements" is specified, it is
the number of elements allocated, otherwise "NumElements" is defaulted		the number of elements allocated, otherwise "NumElements" is defaulted
▲ Show 20 Lines • Show All 6,038 Lines • Show Last 20 Lines

include/llvm/IR/DataLayout.h

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines
/// The target data layout string is specified by the target - a frontend		/// The target data layout string is specified by the target - a frontend
/// generating LLVM IR is required to generate the right target data for the		/// generating LLVM IR is required to generate the right target data for the
/// target being codegen'd to.		/// target being codegen'd to.
class DataLayout {		class DataLayout {
private:		private:
/// Defaults to false.		/// Defaults to false.
bool BigEndian;		bool BigEndian;

		unsigned StackAddrSpace;
unsigned StackNaturalAlign;		unsigned StackNaturalAlign;

enum ManglingModeT {		enum ManglingModeT {
MM_None,		MM_None,
MM_ELF,		MM_ELF,
MM_MachO,		MM_MachO,
MM_WinCOFF,		MM_WinCOFF,
MM_WinCOFFX86,		MM_WinCOFFX86,
▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	public:
void init(const Module *M);		void init(const Module *M);

DataLayout(const DataLayout &DL) : LayoutMap(nullptr) { *this = DL; }		DataLayout(const DataLayout &DL) : LayoutMap(nullptr) { *this = DL; }

DataLayout &operator=(const DataLayout &DL) {		DataLayout &operator=(const DataLayout &DL) {
clear();		clear();
StringRepresentation = DL.StringRepresentation;		StringRepresentation = DL.StringRepresentation;
BigEndian = DL.isBigEndian();		BigEndian = DL.isBigEndian();
		StackAddrSpace = DL.StackAddrSpace;
StackNaturalAlign = DL.StackNaturalAlign;		StackNaturalAlign = DL.StackNaturalAlign;
ManglingMode = DL.ManglingMode;		ManglingMode = DL.ManglingMode;
LegalIntWidths = DL.LegalIntWidths;		LegalIntWidths = DL.LegalIntWidths;
Alignments = DL.Alignments;		Alignments = DL.Alignments;
Pointers = DL.Pointers;		Pointers = DL.Pointers;
NonIntegralAddressSpaces = DL.NonIntegralAddressSpaces;		NonIntegralAddressSpaces = DL.NonIntegralAddressSpaces;
return *this;		return *this;
}		}
Show All 39 Lines	public:
bool isIllegalInteger(uint64_t Width) const { return !isLegalInteger(Width); }		bool isIllegalInteger(uint64_t Width) const { return !isLegalInteger(Width); }

/// Returns true if the given alignment exceeds the natural stack alignment.		/// Returns true if the given alignment exceeds the natural stack alignment.
bool exceedsNaturalStackAlignment(unsigned Align) const {		bool exceedsNaturalStackAlignment(unsigned Align) const {
return (StackNaturalAlign != 0) && (Align > StackNaturalAlign);		return (StackNaturalAlign != 0) && (Align > StackNaturalAlign);
}		}

unsigned getStackAlignment() const { return StackNaturalAlign; }		unsigned getStackAlignment() const { return StackNaturalAlign; }
		unsigned getStackAddrSpace() const { return StackAddrSpace; }

bool hasMicrosoftFastStdCallMangling() const {		bool hasMicrosoftFastStdCallMangling() const {
return ManglingMode == MM_WinCOFFX86;		return ManglingMode == MM_WinCOFFX86;
}		}

bool hasLinkerPrivateGlobalPrefix() const { return ManglingMode == MM_MachO; }		bool hasLinkerPrivateGlobalPrefix() const { return ManglingMode == MM_MachO; }

StringRef getLinkerPrivateGlobalPrefix() const {		StringRef getLinkerPrivateGlobalPrefix() const {
▲ Show 20 Lines • Show All 311 Lines • Show Last 20 Lines

include/llvm/IR/IRBuilder.h

Show First 20 Lines • Show All 1,083 Lines • ▼ Show 20 Lines	if (Constant *VC = dyn_cast<Constant>(V))
return Insert(Folder.CreateNot(VC), Name);		return Insert(Folder.CreateNot(VC), Name);
return Insert(BinaryOperator::CreateNot(V), Name);		return Insert(BinaryOperator::CreateNot(V), Name);
}		}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Instruction creation methods: Memory Instructions		// Instruction creation methods: Memory Instructions
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

AllocaInst CreateAlloca(Type Ty, Value *ArraySize = nullptr,		AllocaInst CreateAlloca(Type Ty, unsigned AddrSpace,
const Twine &Name = "") {		Value *ArraySize = nullptr, const Twine &Name = "") {
return Insert(new AllocaInst(Ty, ArraySize), Name);		return Insert(new AllocaInst(Ty, AddrSpace, ArraySize), Name);
		}

		AllocaInst CreateAlloca(const DataLayout &DL, Type Ty,
		Value *ArraySize = nullptr, const Twine &Name = "") {
		efriedmaUnsubmitted Not Done Reply Inline Actions I'm not sure this overload is a good idea; it saves a little typing, but it hides the reason we need the datalayout. efriedma: I'm not sure this overload is a good idea; it saves a little typing, but it hides the reason we…
		arsenmAuthorUnsubmitted Not Done Reply Inline Actions I'm not sure that's a problem here. This is the most common version, and the point of IRBuilder is to be the convenient way of creating IR. A DataLayout argument is less error prone than an unsigned argument, where it easier to accidentally pass the alignment or something else. It would also be able to hide this more by getting the parent module's DataLayout from the insertion block (which is what the CHERI patches did). However there are already functions using DataLayout input arguments, so I figured there might be some reason to pass it in rather than doing that. arsenm: I'm not sure that's a problem here. This is the most common version, and the point of IRBuilder…
		efriedmaUnsubmitted Not Done Reply Inline Actions I guess. The part that concerns me is that the type of the result might not be what someone expects if they aren't paying attention, and the resulting assertion failure is a bit unfriendly... but I guess we can go with this for now, and see if it ends up being confusing in practice. Getting the DataLayout from the insertion block doesn't work for all methods on IRBuilder; an IRBuilder isn't guaranteed to always have an insertion point. This can be used, for example, if you're building a constant. I guess for alloca in particular, it would be safe to assume we have an insertion point. efriedma: I guess. The part that concerns me is that the type of the result might not be what someone…
		return Insert(new AllocaInst(Ty, DL.getStackAddrSpace(), ArraySize), Name);
}		}
// \brief Provided to resolve 'CreateLoad(Ptr, "...")' correctly, instead of		// \brief Provided to resolve 'CreateLoad(Ptr, "...")' correctly, instead of
// converting the string to 'bool' for the isVolatile parameter.		// converting the string to 'bool' for the isVolatile parameter.
LoadInst CreateLoad(Value Ptr, const char *Name) {		LoadInst CreateLoad(Value Ptr, const char *Name) {
return Insert(new LoadInst(Ptr), Name);		return Insert(new LoadInst(Ptr), Name);
}		}
LoadInst CreateLoad(Value Ptr, const Twine &Name = "") {		LoadInst CreateLoad(Value Ptr, const Twine &Name = "") {
return Insert(new LoadInst(Ptr), Name);		return Insert(new LoadInst(Ptr), Name);
▲ Show 20 Lines • Show All 751 Lines • Show Last 20 Lines

include/llvm/IR/Instructions.h

	Show First 20 Lines • Show All 61 Lines • ▼ Show 20 Lines

	protected:			protected:
	// Note: Instruction needs to be a friend here to call cloneImpl.			// Note: Instruction needs to be a friend here to call cloneImpl.
	friend class Instruction;			friend class Instruction;

	AllocaInst *cloneImpl() const;			AllocaInst *cloneImpl() const;

	public:			public:
	explicit AllocaInst(Type Ty, Value ArraySize = nullptr,			explicit AllocaInst(Type *Ty, unsigned AddrSpace,
				Value *ArraySize = nullptr,
	const Twine &Name = "",			const Twine &Name = "",
	Instruction *InsertBefore = nullptr);			Instruction *InsertBefore = nullptr);
	AllocaInst(Type Ty, Value ArraySize,			AllocaInst(Type Ty, unsigned AddrSpace, Value ArraySize,
	const Twine &Name, BasicBlock *InsertAtEnd);			const Twine &Name, BasicBlock *InsertAtEnd);

	AllocaInst(Type Ty, const Twine &Name, Instruction InsertBefore = nullptr);			AllocaInst(Type *Ty, unsigned AddrSpace,
	AllocaInst(Type Ty, const Twine &Name, BasicBlock InsertAtEnd);			const Twine &Name, Instruction *InsertBefore = nullptr);
				AllocaInst(Type *Ty, unsigned AddrSpace,
				const Twine &Name, BasicBlock *InsertAtEnd);

	AllocaInst(Type Ty, Value ArraySize, unsigned Align,			AllocaInst(Type Ty, unsigned AddrSpace, Value ArraySize, unsigned Align,
	const Twine &Name = "", Instruction *InsertBefore = nullptr);			const Twine &Name = "", Instruction *InsertBefore = nullptr);
	AllocaInst(Type Ty, Value ArraySize, unsigned Align,			AllocaInst(Type Ty, unsigned AddrSpace, Value ArraySize, unsigned Align,
	const Twine &Name, BasicBlock *InsertAtEnd);			const Twine &Name, BasicBlock *InsertAtEnd);

	// Out of line virtual method, so the vtable, etc. has a home.			// Out of line virtual method, so the vtable, etc. has a home.
	~AllocaInst() override;			~AllocaInst() override;

	/// Return true if there is an allocation size parameter to the allocation			/// Return true if there is an allocation size parameter to the allocation
	/// instruction that is not 1.			/// instruction that is not 1.
	bool isArrayAllocation() const;			bool isArrayAllocation() const;
	▲ Show 20 Lines • Show All 4,980 Lines • Show Last 20 Lines

lib/AsmParser/LLParser.h

Show First 20 Lines • Show All 236 Lines • ▼ Show 20 Lines	private:
bool ParseOptionalCallingConv(unsigned &CC);		bool ParseOptionalCallingConv(unsigned &CC);
bool ParseOptionalAlignment(unsigned &Alignment);		bool ParseOptionalAlignment(unsigned &Alignment);
bool ParseOptionalDerefAttrBytes(lltok::Kind AttrKind, uint64_t &Bytes);		bool ParseOptionalDerefAttrBytes(lltok::Kind AttrKind, uint64_t &Bytes);
bool ParseScopeAndOrdering(bool isAtomic, SynchronizationScope &Scope,		bool ParseScopeAndOrdering(bool isAtomic, SynchronizationScope &Scope,
AtomicOrdering &Ordering);		AtomicOrdering &Ordering);
bool ParseOrdering(AtomicOrdering &Ordering);		bool ParseOrdering(AtomicOrdering &Ordering);
bool ParseOptionalStackAlignment(unsigned &Alignment);		bool ParseOptionalStackAlignment(unsigned &Alignment);
bool ParseOptionalCommaAlign(unsigned &Alignment, bool &AteExtraComma);		bool ParseOptionalCommaAlign(unsigned &Alignment, bool &AteExtraComma);
		bool ParseOptionalCommaAddrSpace(unsigned &AddrSpace, LocTy &Loc,
		bool &AteExtraComma);
bool ParseOptionalCommaInAlloca(bool &IsInAlloca);		bool ParseOptionalCommaInAlloca(bool &IsInAlloca);
bool parseAllocSizeArguments(unsigned &ElemSizeArg,		bool parseAllocSizeArguments(unsigned &ElemSizeArg,
Optional<unsigned> &HowManyArg);		Optional<unsigned> &HowManyArg);
bool ParseIndexList(SmallVectorImpl<unsigned> &Indices,		bool ParseIndexList(SmallVectorImpl<unsigned> &Indices,
bool &AteExtraComma);		bool &AteExtraComma);
bool ParseIndexList(SmallVectorImpl<unsigned> &Indices) {		bool ParseIndexList(SmallVectorImpl<unsigned> &Indices) {
bool AteExtraComma;		bool AteExtraComma;
if (ParseIndexList(Indices, AteExtraComma)) return true;		if (ParseIndexList(Indices, AteExtraComma)) return true;
▲ Show 20 Lines • Show All 258 Lines • Show Last 20 Lines

lib/AsmParser/LLParser.cpp

Show First 20 Lines • Show All 1,849 Lines • ▼ Show 20 Lines	if (Lex.getKind() != lltok::kw_align)
return Error(Lex.getLoc(), "expected metadata or 'align'");		return Error(Lex.getLoc(), "expected metadata or 'align'");

if (ParseOptionalAlignment(Alignment)) return true;		if (ParseOptionalAlignment(Alignment)) return true;
}		}

return false;		return false;
}		}

		/// ParseOptionalCommaAddrSpace
		/// ::=
		/// ::= ',' addrspace(1)
		///
		/// This returns with AteExtraComma set to true if it ate an excess comma at the
		/// end.
		bool LLParser::ParseOptionalCommaAddrSpace(unsigned &AddrSpace,
		LocTy &Loc,
		bool &AteExtraComma) {
		AteExtraComma = false;
		while (EatIfPresent(lltok::comma)) {
		// Metadata at the end is an early exit.
		if (Lex.getKind() == lltok::MetadataVar) {
		AteExtraComma = true;
		return false;
		}

		Loc = Lex.getLoc();
		if (Lex.getKind() != lltok::kw_addrspace)
		return Error(Lex.getLoc(), "expected metadata or 'addrspace'");

		if (ParseOptionalAddrSpace(AddrSpace))
		return true;
		}

		return false;
		}

bool LLParser::parseAllocSizeArguments(unsigned &BaseSizeArg,		bool LLParser::parseAllocSizeArguments(unsigned &BaseSizeArg,
Optional<unsigned> &HowManyArg) {		Optional<unsigned> &HowManyArg) {
Lex.Lex();		Lex.Lex();

auto StartParen = Lex.getLoc();		auto StartParen = Lex.getLoc();
if (!EatIfPresent(lltok::lparen))		if (!EatIfPresent(lltok::lparen))
return Error(StartParen, "expected '('");		return Error(StartParen, "expected '('");

▲ Show 20 Lines • Show All 4,169 Lines • ▼ Show 20 Lines
// Memory Instructions.		// Memory Instructions.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// ParseAlloc		/// ParseAlloc
/// ::= 'alloca' 'inalloca'? 'swifterror'? Type (',' TypeAndValue)?		/// ::= 'alloca' 'inalloca'? 'swifterror'? Type (',' TypeAndValue)?
/// (',' 'align' i32)?		/// (',' 'align' i32)?
int LLParser::ParseAlloc(Instruction *&Inst, PerFunctionState &PFS) {		int LLParser::ParseAlloc(Instruction *&Inst, PerFunctionState &PFS) {
Value *Size = nullptr;		Value *Size = nullptr;
LocTy SizeLoc, TyLoc;		LocTy SizeLoc, TyLoc, ASLoc;
unsigned Alignment = 0;		unsigned Alignment = 0;
		unsigned AddrSpace = 0;
Type *Ty = nullptr;		Type *Ty = nullptr;

bool IsInAlloca = EatIfPresent(lltok::kw_inalloca);		bool IsInAlloca = EatIfPresent(lltok::kw_inalloca);
bool IsSwiftError = EatIfPresent(lltok::kw_swifterror);		bool IsSwiftError = EatIfPresent(lltok::kw_swifterror);

if (ParseType(Ty, TyLoc)) return true;		if (ParseType(Ty, TyLoc)) return true;

if (Ty->isFunctionTy() \|\| !PointerType::isValidElementType(Ty))		if (Ty->isFunctionTy() \|\| !PointerType::isValidElementType(Ty))
return Error(TyLoc, "invalid type for alloca");		return Error(TyLoc, "invalid type for alloca");

bool AteExtraComma = false;		bool AteExtraComma = false;
if (EatIfPresent(lltok::comma)) {		if (EatIfPresent(lltok::comma)) {
if (Lex.getKind() == lltok::kw_align) {		if (Lex.getKind() == lltok::kw_align) {
if (ParseOptionalAlignment(Alignment)) return true;		if (ParseOptionalAlignment(Alignment))
		return true;
		if (ParseOptionalCommaAddrSpace(AddrSpace, ASLoc, AteExtraComma))
		return true;
		} else if (Lex.getKind() == lltok::kw_addrspace) {
		ASLoc = Lex.getLoc();
		if (ParseOptionalAddrSpace(AddrSpace))
		return true;
} else if (Lex.getKind() == lltok::MetadataVar) {		} else if (Lex.getKind() == lltok::MetadataVar) {
AteExtraComma = true;		AteExtraComma = true;
} else {		} else {
if (ParseTypeAndValue(Size, SizeLoc, PFS) \|\|		if (ParseTypeAndValue(Size, SizeLoc, PFS) \|\|
ParseOptionalCommaAlign(Alignment, AteExtraComma))		ParseOptionalCommaAlign(Alignment, AteExtraComma) \|\|
		(!AteExtraComma &&
		ParseOptionalCommaAddrSpace(AddrSpace, ASLoc, AteExtraComma)))
return true;		return true;
}		}
}		}

if (Size && !Size->getType()->isIntegerTy())		if (Size && !Size->getType()->isIntegerTy())
return Error(SizeLoc, "element count must have integer type");		return Error(SizeLoc, "element count must have integer type");

AllocaInst *AI = new AllocaInst(Ty, Size, Alignment);		const DataLayout &DL = M->getDataLayout();
		unsigned AS = DL.getStackAddrSpace();
		if (AS != AddrSpace) {
		// TODO: In the future it should be possible to specify addrspace per-alloca.
		return Error(ASLoc, "address space must match datalayout");
		}

		AllocaInst *AI = new AllocaInst(Ty, AS, Size, Alignment);
AI->setUsedWithInAlloca(IsInAlloca);		AI->setUsedWithInAlloca(IsInAlloca);
AI->setSwiftError(IsSwiftError);		AI->setSwiftError(IsSwiftError);
Inst = AI;		Inst = AI;
return AteExtraComma ? InstExtraComma : InstNormal;		return AteExtraComma ? InstExtraComma : InstNormal;
}		}

/// ParseLoad		/// ParseLoad
/// ::= 'load' 'volatile'? TypeAndValue (',' 'align' i32)?		/// ::= 'load' 'volatile'? TypeAndValue (',' 'align' i32)?
▲ Show 20 Lines • Show All 490 Lines • Show Last 20 Lines

lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 4,011 Lines • ▼ Show 20 Lines	case bitc::FUNC_CODE_INST_ALLOCA: { // ALLOCA: [instty, opty, op, align]
Type *OpTy = getTypeByID(Record[1]);		Type *OpTy = getTypeByID(Record[1]);
Value *Size = getFnValueByID(Record[2], OpTy);		Value *Size = getFnValueByID(Record[2], OpTy);
unsigned Align;		unsigned Align;
if (Error Err = parseAlignmentValue(AlignRecord & ~FlagMask, Align)) {		if (Error Err = parseAlignmentValue(AlignRecord & ~FlagMask, Align)) {
return Err;		return Err;
}		}
if (!Ty \|\| !Size)		if (!Ty \|\| !Size)
return error("Invalid record");		return error("Invalid record");
AllocaInst *AI = new AllocaInst(Ty, Size, Align);
		// FIXME: Make this an optional field.
		const DataLayout &DL = TheModule->getDataLayout();
		unsigned AS = DL.getStackAddrSpace();

		AllocaInst *AI = new AllocaInst(Ty, AS, Size, Align);
AI->setUsedWithInAlloca(InAlloca);		AI->setUsedWithInAlloca(InAlloca);
AI->setSwiftError(SwiftError);		AI->setSwiftError(SwiftError);
I = AI;		I = AI;
InstructionList.push_back(I);		InstructionList.push_back(I);
break;		break;
}		}
case bitc::FUNC_CODE_INST_LOAD: { // LOAD: [opty, op, align, vol]		case bitc::FUNC_CODE_INST_LOAD: { // LOAD: [opty, op, align, vol]
unsigned OpNum = 0;		unsigned OpNum = 0;
▲ Show 20 Lines • Show All 1,424 Lines • Show Last 20 Lines

lib/CodeGen/AtomicExpandPass.cpp

Show First 20 Lines • Show All 1,541 Lines • ▼ Show 20 Lines	bool AtomicExpand::expandAtomicOpToLibcall(

// 'ptr' argument.		// 'ptr' argument.
Value *PtrVal =		Value *PtrVal =
Builder.CreateBitCast(PointerOperand, Type::getInt8PtrTy(Ctx));		Builder.CreateBitCast(PointerOperand, Type::getInt8PtrTy(Ctx));
Args.push_back(PtrVal);		Args.push_back(PtrVal);

// 'expected' argument, if present.		// 'expected' argument, if present.
if (CASExpected) {		if (CASExpected) {
AllocaCASExpected = AllocaBuilder.CreateAlloca(CASExpected->getType());		AllocaCASExpected = AllocaBuilder.CreateAlloca(DL, CASExpected->getType());
AllocaCASExpected->setAlignment(AllocaAlignment);		AllocaCASExpected->setAlignment(AllocaAlignment);
AllocaCASExpected_i8 =		AllocaCASExpected_i8 =
Builder.CreateBitCast(AllocaCASExpected, Type::getInt8PtrTy(Ctx));		Builder.CreateBitCast(AllocaCASExpected, Type::getInt8PtrTy(Ctx));
Builder.CreateLifetimeStart(AllocaCASExpected_i8, SizeVal64);		Builder.CreateLifetimeStart(AllocaCASExpected_i8, SizeVal64);
Builder.CreateAlignedStore(CASExpected, AllocaCASExpected, AllocaAlignment);		Builder.CreateAlignedStore(CASExpected, AllocaCASExpected, AllocaAlignment);
Args.push_back(AllocaCASExpected_i8);		Args.push_back(AllocaCASExpected_i8);
}		}

// 'val' argument ('desired' for cas), if present.		// 'val' argument ('desired' for cas), if present.
if (ValueOperand) {		if (ValueOperand) {
if (UseSizedLibcall) {		if (UseSizedLibcall) {
Value *IntValue =		Value *IntValue =
Builder.CreateBitOrPointerCast(ValueOperand, SizedIntTy);		Builder.CreateBitOrPointerCast(ValueOperand, SizedIntTy);
Args.push_back(IntValue);		Args.push_back(IntValue);
} else {		} else {
AllocaValue = AllocaBuilder.CreateAlloca(ValueOperand->getType());		AllocaValue = AllocaBuilder.CreateAlloca(DL, ValueOperand->getType());
AllocaValue->setAlignment(AllocaAlignment);		AllocaValue->setAlignment(AllocaAlignment);
AllocaValue_i8 =		AllocaValue_i8 =
Builder.CreateBitCast(AllocaValue, Type::getInt8PtrTy(Ctx));		Builder.CreateBitCast(AllocaValue, Type::getInt8PtrTy(Ctx));
Builder.CreateLifetimeStart(AllocaValue_i8, SizeVal64);		Builder.CreateLifetimeStart(AllocaValue_i8, SizeVal64);
Builder.CreateAlignedStore(ValueOperand, AllocaValue, AllocaAlignment);		Builder.CreateAlignedStore(ValueOperand, AllocaValue, AllocaAlignment);
Args.push_back(AllocaValue_i8);		Args.push_back(AllocaValue_i8);
}		}
}		}

// 'ret' argument.		// 'ret' argument.
if (!CASExpected && HasResult && !UseSizedLibcall) {		if (!CASExpected && HasResult && !UseSizedLibcall) {
AllocaResult = AllocaBuilder.CreateAlloca(I->getType());		AllocaResult = AllocaBuilder.CreateAlloca(DL, I->getType());
AllocaResult->setAlignment(AllocaAlignment);		AllocaResult->setAlignment(AllocaAlignment);
AllocaResult_i8 =		AllocaResult_i8 =
Builder.CreateBitCast(AllocaResult, Type::getInt8PtrTy(Ctx));		Builder.CreateBitCast(AllocaResult, Type::getInt8PtrTy(Ctx));
Builder.CreateLifetimeStart(AllocaResult_i8, SizeVal64);		Builder.CreateLifetimeStart(AllocaResult_i8, SizeVal64);
Args.push_back(AllocaResult_i8);		Args.push_back(AllocaResult_i8);
}		}

// 'ordering' ('success_order' for cas) argument.		// 'ordering' ('success_order' for cas) argument.
▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines

lib/CodeGen/SafeStack.cpp

Show First 20 Lines • Show All 179 Lines • ▼ Show 20 Lines	public:

void getAnalysisUsage(AnalysisUsage &AU) const override {		void getAnalysisUsage(AnalysisUsage &AU) const override {
AU.addRequired<ScalarEvolutionWrapperPass>();		AU.addRequired<ScalarEvolutionWrapperPass>();
}		}

bool doInitialization(Module &M) override {		bool doInitialization(Module &M) override {
DL = &M.getDataLayout();		DL = &M.getDataLayout();

StackPtrTy = Type::getInt8PtrTy(M.getContext());		StackPtrTy = Type::getInt8PtrTy(M.getContext(), DL->getStackAddrSpace());
IntPtrTy = DL->getIntPtrType(M.getContext());		IntPtrTy = DL->getIntPtrType(M.getContext());
Int32Ty = Type::getInt32Ty(M.getContext());		Int32Ty = Type::getInt32Ty(M.getContext());
Int8Ty = Type::getInt8Ty(M.getContext());		Int8Ty = Type::getInt8Ty(M.getContext());

return false;		return false;
}		}

bool runOnFunction(Function &F) override;		bool runOnFunction(Function &F) override;
▲ Show 20 Lines • Show All 216 Lines • ▼ Show 20 Lines	SafeStack::createStackRestorePoints(IRBuilder<> &IRB, Function &F,

// FIXME: On some platforms this could be handled by the longjmp/exception		// FIXME: On some platforms this could be handled by the longjmp/exception
// runtime itself.		// runtime itself.

AllocaInst *DynamicTop = nullptr;		AllocaInst *DynamicTop = nullptr;
if (NeedDynamicTop) {		if (NeedDynamicTop) {
// If we also have dynamic alloca's, the stack pointer value changes		// If we also have dynamic alloca's, the stack pointer value changes
// throughout the function. For now we store it in an alloca.		// throughout the function. For now we store it in an alloca.
DynamicTop = IRB.CreateAlloca(StackPtrTy, /ArraySize=/nullptr,		DynamicTop = IRB.CreateAlloca(DL, StackPtrTy, /ArraySize=*/nullptr,
"unsafe_stack_dynamic_ptr");		"unsafe_stack_dynamic_ptr");
IRB.CreateStore(StaticTop, DynamicTop);		IRB.CreateStore(StaticTop, DynamicTop);
}		}

// Restore current stack pointer after longjmp/exception catch.		// Restore current stack pointer after longjmp/exception catch.
for (Instruction *I : StackRestorePoints) {		for (Instruction *I : StackRestorePoints) {
++NumUnsafeStackRestorePoints;		++NumUnsafeStackRestorePoints;

▲ Show 20 Lines • Show All 315 Lines • ▼ Show 20 Lines	bool SafeStack::runOnFunction(Function &F) {
assert(BasePointer->getType() == StackPtrTy);		assert(BasePointer->getType() == StackPtrTy);

AllocaInst *StackGuardSlot = nullptr;		AllocaInst *StackGuardSlot = nullptr;
// FIXME: implement weaker forms of stack protector.		// FIXME: implement weaker forms of stack protector.
if (F.hasFnAttribute(Attribute::StackProtect) \|\|		if (F.hasFnAttribute(Attribute::StackProtect) \|\|
F.hasFnAttribute(Attribute::StackProtectStrong) \|\|		F.hasFnAttribute(Attribute::StackProtectStrong) \|\|
F.hasFnAttribute(Attribute::StackProtectReq)) {		F.hasFnAttribute(Attribute::StackProtectReq)) {
Value *StackGuard = getStackGuard(IRB, F);		Value *StackGuard = getStackGuard(IRB, F);
StackGuardSlot = IRB.CreateAlloca(StackPtrTy, nullptr);		StackGuardSlot = IRB.CreateAlloca(*DL, StackPtrTy, nullptr);
IRB.CreateStore(StackGuard, StackGuardSlot);		IRB.CreateStore(StackGuard, StackGuardSlot);

for (ReturnInst *RI : Returns) {		for (ReturnInst *RI : Returns) {
IRBuilder<> IRBRet(RI);		IRBuilder<> IRBRet(RI);
checkStackGuard(IRBRet, F, *RI, StackGuardSlot, StackGuard);		checkStackGuard(IRBRet, F, *RI, StackGuardSlot, StackGuard);
}		}
}		}

Show All 40 Lines

lib/CodeGen/ShadowStackGCLowering.cpp

Show First 20 Lines • Show All 282 Lines • ▼ Show 20 Lines	bool ShadowStackGCLowering::runOnFunction(Function &F) {
// Build the constant map and figure the type of the shadow stack entry.		// Build the constant map and figure the type of the shadow stack entry.
Value *FrameMap = GetFrameMap(F);		Value *FrameMap = GetFrameMap(F);
Type *ConcreteStackEntryTy = GetConcreteStackEntryType(F);		Type *ConcreteStackEntryTy = GetConcreteStackEntryType(F);

// Build the shadow stack entry at the very start of the function.		// Build the shadow stack entry at the very start of the function.
BasicBlock::iterator IP = F.getEntryBlock().begin();		BasicBlock::iterator IP = F.getEntryBlock().begin();
IRBuilder<> AtEntry(IP->getParent(), IP);		IRBuilder<> AtEntry(IP->getParent(), IP);

		const DataLayout &DL = F.getParent()->getDataLayout();
Instruction *StackEntry =		Instruction *StackEntry =
AtEntry.CreateAlloca(ConcreteStackEntryTy, nullptr, "gc_frame");		AtEntry.CreateAlloca(DL, ConcreteStackEntryTy, nullptr, "gc_frame");

while (isa<AllocaInst>(IP))		while (isa<AllocaInst>(IP))
++IP;		++IP;
AtEntry.SetInsertPoint(IP->getParent(), IP);		AtEntry.SetInsertPoint(IP->getParent(), IP);

// Initialize the map pointer and load the current head of the shadow stack.		// Initialize the map pointer and load the current head of the shadow stack.
Instruction *CurrentHead = AtEntry.CreateLoad(Head, "gc_currhead");		Instruction *CurrentHead = AtEntry.CreateLoad(Head, "gc_currhead");
Instruction *EntryMapPtr = CreateGEP(Context, AtEntry, ConcreteStackEntryTy,		Instruction *EntryMapPtr = CreateGEP(Context, AtEntry, ConcreteStackEntryTy,
▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines

lib/CodeGen/SjLjEHPrepare.cpp

Show First 20 Lines • Show All 169 Lines • ▼ Show 20 Lines	Value *SjLjEHPrepare::setupFunctionContext(Function &F,
ArrayRef<LandingPadInst *> LPads) {		ArrayRef<LandingPadInst *> LPads) {
BasicBlock *EntryBB = &F.front();		BasicBlock *EntryBB = &F.front();

// Create an alloca for the incoming jump buffer ptr and the new jump buffer		// Create an alloca for the incoming jump buffer ptr and the new jump buffer
// that needs to be restored on all exits from the function. This is an alloca		// that needs to be restored on all exits from the function. This is an alloca
// because the value needs to be added to the global context list.		// because the value needs to be added to the global context list.
auto &DL = F.getParent()->getDataLayout();		auto &DL = F.getParent()->getDataLayout();
unsigned Align = DL.getPrefTypeAlignment(FunctionContextTy);		unsigned Align = DL.getPrefTypeAlignment(FunctionContextTy);
FuncCtx = new AllocaInst(FunctionContextTy, nullptr, Align, "fn_context",		FuncCtx = new AllocaInst(FunctionContextTy, DL.getStackAddrSpace(),
&EntryBB->front());		nullptr, Align, "fn_context", &EntryBB->front());

// Fill in the function context structure.		// Fill in the function context structure.
for (LandingPadInst *LPI : LPads) {		for (LandingPadInst *LPI : LPads) {
IRBuilder<> Builder(LPI->getParent(),		IRBuilder<> Builder(LPI->getParent(),
LPI->getParent()->getFirstInsertionPt());		LPI->getParent()->getFirstInsertionPt());

// Reference the __data field.		// Reference the __data field.
Value *FCData =		Value *FCData =
▲ Show 20 Lines • Show All 315 Lines • Show Last 20 Lines

lib/CodeGen/StackProtector.cpp

	Show First 20 Lines • Show All 345 Lines • ▼ Show 20 Lines
	///			///
	/// Returns true if the platform/triple supports the stackprotectorcreate pseudo			/// Returns true if the platform/triple supports the stackprotectorcreate pseudo
	/// node.			/// node.
	static bool CreatePrologue(Function F, Module M, ReturnInst *RI,			static bool CreatePrologue(Function F, Module M, ReturnInst *RI,
	const TargetLoweringBase TLI, AllocaInst &AI) {			const TargetLoweringBase TLI, AllocaInst &AI) {
	bool SupportsSelectionDAGSP = false;			bool SupportsSelectionDAGSP = false;
	IRBuilder<> B(&F->getEntryBlock().front());			IRBuilder<> B(&F->getEntryBlock().front());
	PointerType *PtrTy = Type::getInt8PtrTy(RI->getContext());			PointerType *PtrTy = Type::getInt8PtrTy(RI->getContext());
	AI = B.CreateAlloca(PtrTy, nullptr, "StackGuardSlot");			AI = B.CreateAlloca(M->getDataLayout(), PtrTy, nullptr, "StackGuardSlot");

	Value *GuardSlot = getStackGuard(TLI, M, B, &SupportsSelectionDAGSP);			Value *GuardSlot = getStackGuard(TLI, M, B, &SupportsSelectionDAGSP);
	B.CreateCall(Intrinsic::getDeclaration(M, Intrinsic::stackprotector),			B.CreateCall(Intrinsic::getDeclaration(M, Intrinsic::stackprotector),
	{GuardSlot, AI});			{GuardSlot, AI});
	return SupportsSelectionDAGSP;			return SupportsSelectionDAGSP;
	}			}

	/// InsertStackProtectors - Insert code into the prologue and epilogue of the			/// InsertStackProtectors - Insert code into the prologue and epilogue of the
	▲ Show 20 Lines • Show All 140 Lines • Show Last 20 Lines

lib/CodeGen/WinEHPrepare.cpp

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	private:
void cloneCommonBlocks(Function &F);		void cloneCommonBlocks(Function &F);
void removeImplausibleInstructions(Function &F);		void removeImplausibleInstructions(Function &F);
void cleanupPreparedFunclets(Function &F);		void cleanupPreparedFunclets(Function &F);
void verifyPreparedFunclets(Function &F);		void verifyPreparedFunclets(Function &F);

// All fields are reset by runOnFunction.		// All fields are reset by runOnFunction.
EHPersonality Personality = EHPersonality::Unknown;		EHPersonality Personality = EHPersonality::Unknown;

		const DataLayout *DL = nullptr;
DenseMap<BasicBlock *, ColorVector> BlockColors;		DenseMap<BasicBlock *, ColorVector> BlockColors;
MapVector<BasicBlock , std::vector<BasicBlock >> FuncletBlocks;		MapVector<BasicBlock , std::vector<BasicBlock >> FuncletBlocks;
};		};

} // end anonymous namespace		} // end anonymous namespace

char WinEHPrepare::ID = 0;		char WinEHPrepare::ID = 0;
INITIALIZE_TM_PASS(WinEHPrepare, "winehprepare", "Prepare Windows exceptions",		INITIALIZE_TM_PASS(WinEHPrepare, "winehprepare", "Prepare Windows exceptions",
Show All 9 Lines	bool WinEHPrepare::runOnFunction(Function &Fn) {

// Classify the personality to see what kind of preparation we need.		// Classify the personality to see what kind of preparation we need.
Personality = classifyEHPersonality(Fn.getPersonalityFn());		Personality = classifyEHPersonality(Fn.getPersonalityFn());

// Do nothing if this is not a funclet-based personality.		// Do nothing if this is not a funclet-based personality.
if (!isFuncletEHPersonality(Personality))		if (!isFuncletEHPersonality(Personality))
return false;		return false;

		DL = &Fn.getParent()->getDataLayout();
return prepareExplicitEH(Fn);		return prepareExplicitEH(Fn);
}		}

bool WinEHPrepare::doFinalization(Module &M) { return false; }		bool WinEHPrepare::doFinalization(Module &M) { return false; }

void WinEHPrepare::getAnalysisUsage(AnalysisUsage &AU) const {}		void WinEHPrepare::getAnalysisUsage(AnalysisUsage &AU) const {}

static int addUnwindMapEntry(WinEHFuncInfo &FuncInfo, int ToState,		static int addUnwindMapEntry(WinEHFuncInfo &FuncInfo, int ToState,
▲ Show 20 Lines • Show All 943 Lines • ▼ Show 20 Lines
AllocaInst WinEHPrepare::insertPHILoads(PHINode PN, Function &F) {		AllocaInst WinEHPrepare::insertPHILoads(PHINode PN, Function &F) {
BasicBlock *PHIBlock = PN->getParent();		BasicBlock *PHIBlock = PN->getParent();
AllocaInst *SpillSlot = nullptr;		AllocaInst *SpillSlot = nullptr;
Instruction *EHPad = PHIBlock->getFirstNonPHI();		Instruction *EHPad = PHIBlock->getFirstNonPHI();

if (!isa<TerminatorInst>(EHPad)) {		if (!isa<TerminatorInst>(EHPad)) {
// If the EHPad isn't a terminator, then we can insert a load in this block		// If the EHPad isn't a terminator, then we can insert a load in this block
// that will dominate all uses.		// that will dominate all uses.
SpillSlot = new AllocaInst(PN->getType(), nullptr,		SpillSlot = new AllocaInst(PN->getType(), DL->getStackAddrSpace(), nullptr,
Twine(PN->getName(), ".wineh.spillslot"),		Twine(PN->getName(), ".wineh.spillslot"),
&F.getEntryBlock().front());		&F.getEntryBlock().front());
Value *V = new LoadInst(SpillSlot, Twine(PN->getName(), ".wineh.reload"),		Value *V = new LoadInst(SpillSlot, Twine(PN->getName(), ".wineh.reload"),
&*PHIBlock->getFirstInsertionPt());		&*PHIBlock->getFirstInsertionPt());
PN->replaceAllUsesWith(V);		PN->replaceAllUsesWith(V);
return SpillSlot;		return SpillSlot;
}		}

▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	void WinEHPrepare::insertPHIStore(
new StoreInst(PredVal, SpillSlot, PredBlock->getTerminator());		new StoreInst(PredVal, SpillSlot, PredBlock->getTerminator());
}		}

void WinEHPrepare::replaceUseWithLoad(Value V, Use &U, AllocaInst &SpillSlot,		void WinEHPrepare::replaceUseWithLoad(Value V, Use &U, AllocaInst &SpillSlot,
DenseMap<BasicBlock , Value > &Loads,		DenseMap<BasicBlock , Value > &Loads,
Function &F) {		Function &F) {
// Lazilly create the spill slot.		// Lazilly create the spill slot.
if (!SpillSlot)		if (!SpillSlot)
SpillSlot = new AllocaInst(V->getType(), nullptr,		SpillSlot = new AllocaInst(V->getType(), DL->getStackAddrSpace(), nullptr,
Twine(V->getName(), ".wineh.spillslot"),		Twine(V->getName(), ".wineh.spillslot"),
&F.getEntryBlock().front());		&F.getEntryBlock().front());

auto *UsingInst = cast<Instruction>(U.getUser());		auto *UsingInst = cast<Instruction>(U.getUser());
if (auto *UsingPHI = dyn_cast<PHINode>(UsingInst)) {		if (auto *UsingPHI = dyn_cast<PHINode>(UsingInst)) {
// If this is a PHI node, we can't insert a load of the value before		// If this is a PHI node, we can't insert a load of the value before
// the use. Instead insert the load in the predecessor block		// the use. Instead insert the load in the predecessor block
// corresponding to the incoming value.		// corresponding to the incoming value.
▲ Show 20 Lines • Show All 71 Lines • Show Last 20 Lines

lib/IR/AsmWriter.cpp

Show First 20 Lines • Show All 3,108 Lines • ▼ Show 20 Lines	if (isa<BranchInst>(I) && cast<BranchInst>(I).isConditional()) {
if (!AI->getArraySize() \|\| AI->isArrayAllocation() \|\|		if (!AI->getArraySize() \|\| AI->isArrayAllocation() \|\|
!AI->getArraySize()->getType()->isIntegerTy(32)) {		!AI->getArraySize()->getType()->isIntegerTy(32)) {
Out << ", ";		Out << ", ";
writeOperand(AI->getArraySize(), true);		writeOperand(AI->getArraySize(), true);
}		}
if (AI->getAlignment()) {		if (AI->getAlignment()) {
Out << ", align " << AI->getAlignment();		Out << ", align " << AI->getAlignment();
}		}

		unsigned AddrSpace = AI->getType()->getAddressSpace();
		if (AddrSpace != 0) {
		Out << ", addrspace(" << AddrSpace << ')';
		}

} else if (isa<CastInst>(I)) {		} else if (isa<CastInst>(I)) {
if (Operand) {		if (Operand) {
Out << ' ';		Out << ' ';
writeOperand(Operand, true); // Work with broken code		writeOperand(Operand, true); // Work with broken code
}		}
Out << " to ";		Out << " to ";
TypePrinter.print(I.getType(), Out);		TypePrinter.print(I.getType(), Out);
} else if (isa<VAArgInst>(I)) {		} else if (isa<VAArgInst>(I)) {
▲ Show 20 Lines • Show All 451 Lines • Show Last 20 Lines

lib/IR/Core.cpp

Show First 20 Lines • Show All 2,692 Lines • ▼ Show 20 Lines	LLVMValueRef LLVMBuildArrayMalloc(LLVMBuilderRef B, LLVMTypeRef Ty,
Instruction* Malloc = CallInst::CreateMalloc(unwrap(B)->GetInsertBlock(),		Instruction* Malloc = CallInst::CreateMalloc(unwrap(B)->GetInsertBlock(),
ITy, unwrap(Ty), AllocSize,		ITy, unwrap(Ty), AllocSize,
unwrap(Val), nullptr, "");		unwrap(Val), nullptr, "");
return wrap(unwrap(B)->Insert(Malloc, Twine(Name)));		return wrap(unwrap(B)->Insert(Malloc, Twine(Name)));
}		}

LLVMValueRef LLVMBuildAlloca(LLVMBuilderRef B, LLVMTypeRef Ty,		LLVMValueRef LLVMBuildAlloca(LLVMBuilderRef B, LLVMTypeRef Ty,
const char *Name) {		const char *Name) {
return wrap(unwrap(B)->CreateAlloca(unwrap(Ty), nullptr, Name));		const DataLayout &DL
		= unwrap(B)->GetInsertBlock()->getModule()->getDataLayout();
		return wrap(unwrap(B)->CreateAlloca(DL, unwrap(Ty), nullptr, Name));
}		}

LLVMValueRef LLVMBuildArrayAlloca(LLVMBuilderRef B, LLVMTypeRef Ty,		LLVMValueRef LLVMBuildArrayAlloca(LLVMBuilderRef B, LLVMTypeRef Ty,
LLVMValueRef Val, const char *Name) {		LLVMValueRef Val, const char *Name) {
return wrap(unwrap(B)->CreateAlloca(unwrap(Ty), unwrap(Val), Name));		const DataLayout &DL
		= unwrap(B)->GetInsertBlock()->getModule()->getDataLayout();

		return wrap(unwrap(B)->CreateAlloca(DL, unwrap(Ty), unwrap(Val), Name));
}		}

LLVMValueRef LLVMBuildFree(LLVMBuilderRef B, LLVMValueRef PointerVal) {		LLVMValueRef LLVMBuildFree(LLVMBuilderRef B, LLVMValueRef PointerVal) {
return wrap(unwrap(B)->Insert(		return wrap(unwrap(B)->Insert(
CallInst::CreateFree(unwrap(PointerVal), unwrap(B)->GetInsertBlock())));		CallInst::CreateFree(unwrap(PointerVal), unwrap(B)->GetInsertBlock())));
}		}

LLVMValueRef LLVMBuildLoad(LLVMBuilderRef B, LLVMValueRef PointerVal,		LLVMValueRef LLVMBuildLoad(LLVMBuilderRef B, LLVMValueRef PointerVal,
▲ Show 20 Lines • Show All 511 Lines • Show Last 20 Lines

lib/IR/DataLayout.cpp

Show First 20 Lines • Show All 174 Lines • ▼ Show 20 Lines	static const LayoutAlignElem DefaultAlignments[] = {
{ AGGREGATE_ALIGN, 0, 0, 8 } // struct		{ AGGREGATE_ALIGN, 0, 0, 8 } // struct
};		};

void DataLayout::reset(StringRef Desc) {		void DataLayout::reset(StringRef Desc) {
clear();		clear();

LayoutMap = nullptr;		LayoutMap = nullptr;
BigEndian = false;		BigEndian = false;
		StackAddrSpace = 0;
StackNaturalAlign = 0;		StackNaturalAlign = 0;
ManglingMode = MM_None;		ManglingMode = MM_None;
NonIntegralAddressSpaces.clear();		NonIntegralAddressSpaces.clear();

// Default alignments		// Default alignments
for (const LayoutAlignElem &E : DefaultAlignments) {		for (const LayoutAlignElem &E : DefaultAlignments) {
setAlignment((AlignTypeEnum)E.AlignType, E.ABIAlign, E.PrefAlign,		setAlignment((AlignTypeEnum)E.AlignType, E.ABIAlign, E.PrefAlign,
E.TypeBitWidth);		E.TypeBitWidth);
▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	case 'n': // Native integer types.
break;		break;
Split = split(Rest, ':');		Split = split(Rest, ':');
}		}
break;		break;
case 'S': { // Stack natural alignment.		case 'S': { // Stack natural alignment.
StackNaturalAlign = inBytes(getInt(Tok));		StackNaturalAlign = inBytes(getInt(Tok));
break;		break;
}		}
		case 'A': { // Default stack/alloca address space.
		StackAddrSpace = getInt(Tok);
		if (!isUInt<24>(StackAddrSpace))
		report_fatal_error("Invalid address space, must be a 24bit integer");
		break;
		}
case 'm':		case 'm':
if (!Tok.empty())		if (!Tok.empty())
report_fatal_error("Unexpected trailing characters after mangling specifier in datalayout string");		report_fatal_error("Unexpected trailing characters after mangling specifier in datalayout string");
if (Rest.empty())		if (Rest.empty())
report_fatal_error("Expected mangling specifier in datalayout string");		report_fatal_error("Expected mangling specifier in datalayout string");
if (Rest.size() > 1)		if (Rest.size() > 1)
report_fatal_error("Unknown mangling specifier in datalayout string");		report_fatal_error("Unknown mangling specifier in datalayout string");
switch(Rest[0]) {		switch(Rest[0]) {
Show All 26 Lines
DataLayout::DataLayout(const Module *M) : LayoutMap(nullptr) {		DataLayout::DataLayout(const Module *M) : LayoutMap(nullptr) {
init(M);		init(M);
}		}

void DataLayout::init(const Module M) { this = M->getDataLayout(); }		void DataLayout::init(const Module M) { this = M->getDataLayout(); }

bool DataLayout::operator==(const DataLayout &Other) const {		bool DataLayout::operator==(const DataLayout &Other) const {
bool Ret = BigEndian == Other.BigEndian &&		bool Ret = BigEndian == Other.BigEndian &&
		StackAddrSpace == Other.StackAddrSpace &&
StackNaturalAlign == Other.StackNaturalAlign &&		StackNaturalAlign == Other.StackNaturalAlign &&
ManglingMode == Other.ManglingMode &&		ManglingMode == Other.ManglingMode &&
LegalIntWidths == Other.LegalIntWidths &&		LegalIntWidths == Other.LegalIntWidths &&
Alignments == Other.Alignments && Pointers == Other.Pointers;		Alignments == Other.Alignments && Pointers == Other.Pointers;
// Note: getStringRepresentation() might differs, it is not canonicalized		// Note: getStringRepresentation() might differs, it is not canonicalized
return Ret;		return Ret;
}		}

▲ Show 20 Lines • Show All 384 Lines • Show Last 20 Lines

lib/IR/Instructions.cpp

Show First 20 Lines • Show All 1,193 Lines • ▼ Show 20 Lines	else {
assert(!isa<BasicBlock>(Amt) &&		assert(!isa<BasicBlock>(Amt) &&
"Passed basic block into allocation size parameter! Use other ctor");		"Passed basic block into allocation size parameter! Use other ctor");
assert(Amt->getType()->isIntegerTy() &&		assert(Amt->getType()->isIntegerTy() &&
"Allocation array size is not an integer!");		"Allocation array size is not an integer!");
}		}
return Amt;		return Amt;
}		}

AllocaInst::AllocaInst(Type Ty, const Twine &Name, Instruction InsertBefore)		AllocaInst::AllocaInst(Type *Ty, unsigned AddrSpace, const Twine &Name,
: AllocaInst(Ty, /ArraySize=/nullptr, Name, InsertBefore) {}

AllocaInst::AllocaInst(Type Ty, const Twine &Name, BasicBlock InsertAtEnd)
: AllocaInst(Ty, /ArraySize=/nullptr, Name, InsertAtEnd) {}

AllocaInst::AllocaInst(Type Ty, Value ArraySize, const Twine &Name,
Instruction *InsertBefore)		Instruction *InsertBefore)
: AllocaInst(Ty, ArraySize, /Align=/0, Name, InsertBefore) {}		: AllocaInst(Ty, AddrSpace, /ArraySize=/nullptr, Name, InsertBefore) {}

AllocaInst::AllocaInst(Type Ty, Value ArraySize, const Twine &Name,		AllocaInst::AllocaInst(Type *Ty, unsigned AddrSpace, const Twine &Name,
BasicBlock *InsertAtEnd)		BasicBlock *InsertAtEnd)
: AllocaInst(Ty, ArraySize, /Align=/0, Name, InsertAtEnd) {}		: AllocaInst(Ty, AddrSpace, /ArraySize=/nullptr, Name, InsertAtEnd) {}

AllocaInst::AllocaInst(Type Ty, Value ArraySize, unsigned Align,		AllocaInst::AllocaInst(Type Ty, unsigned AddrSpace, Value ArraySize,
const Twine &Name, Instruction *InsertBefore)		const Twine &Name, Instruction *InsertBefore)
: UnaryInstruction(PointerType::getUnqual(Ty), Alloca,		: AllocaInst(Ty, AddrSpace, ArraySize, /Align=/0, Name, InsertBefore) {}

		AllocaInst::AllocaInst(Type Ty, unsigned AddrSpace, Value ArraySize,
		const Twine &Name, BasicBlock *InsertAtEnd)
		: AllocaInst(Ty, AddrSpace, ArraySize, /Align=/0, Name, InsertAtEnd) {}

		AllocaInst::AllocaInst(Type Ty, unsigned AddrSpace, Value ArraySize,
		unsigned Align, const Twine &Name,
		Instruction *InsertBefore)
		: UnaryInstruction(PointerType::get(Ty, AddrSpace), Alloca,
getAISize(Ty->getContext(), ArraySize), InsertBefore),		getAISize(Ty->getContext(), ArraySize), InsertBefore),
AllocatedType(Ty) {		AllocatedType(Ty) {
setAlignment(Align);		setAlignment(Align);
assert(!Ty->isVoidTy() && "Cannot allocate void!");		assert(!Ty->isVoidTy() && "Cannot allocate void!");
setName(Name);		setName(Name);
}		}

AllocaInst::AllocaInst(Type Ty, Value ArraySize, unsigned Align,		AllocaInst::AllocaInst(Type Ty, unsigned AddrSpace, Value ArraySize,
const Twine &Name, BasicBlock *InsertAtEnd)		unsigned Align, const Twine &Name,
: UnaryInstruction(PointerType::getUnqual(Ty), Alloca,		BasicBlock *InsertAtEnd)
		: UnaryInstruction(PointerType::get(Ty, AddrSpace), Alloca,
getAISize(Ty->getContext(), ArraySize), InsertAtEnd),		getAISize(Ty->getContext(), ArraySize), InsertAtEnd),
AllocatedType(Ty) {		AllocatedType(Ty) {
setAlignment(Align);		setAlignment(Align);
assert(!Ty->isVoidTy() && "Cannot allocate void!");		assert(!Ty->isVoidTy() && "Cannot allocate void!");
setName(Name);		setName(Name);
}		}

// Out of line virtual method, so the vtable, etc has a home.		// Out of line virtual method, so the vtable, etc has a home.
AllocaInst::~AllocaInst() {		AllocaInst::~AllocaInst() {
▲ Show 20 Lines • Show All 2,583 Lines • ▼ Show 20 Lines
}		}

InsertValueInst *InsertValueInst::cloneImpl() const {		InsertValueInst *InsertValueInst::cloneImpl() const {
return new InsertValueInst(*this);		return new InsertValueInst(*this);
}		}

AllocaInst *AllocaInst::cloneImpl() const {		AllocaInst *AllocaInst::cloneImpl() const {
AllocaInst *Result = new AllocaInst(getAllocatedType(),		AllocaInst *Result = new AllocaInst(getAllocatedType(),
		getType()->getAddressSpace(),
(Value *)getOperand(0), getAlignment());		(Value *)getOperand(0), getAlignment());
Result->setUsedWithInAlloca(isUsedWithInAlloca());		Result->setUsedWithInAlloca(isUsedWithInAlloca());
Result->setSwiftError(isSwiftError());		Result->setSwiftError(isSwiftError());
return Result;		return Result;
}		}

LoadInst *LoadInst::cloneImpl() const {		LoadInst *LoadInst::cloneImpl() const {
return new LoadInst(getOperand(0), Twine(), isVolatile(),		return new LoadInst(getOperand(0), Twine(), isVolatile(),
▲ Show 20 Lines • Show All 161 Lines • Show Last 20 Lines

lib/IR/Verifier.cpp

Show First 20 Lines • Show All 3,160 Lines • ▼ Show 20 Lines	for (const User *U : SwiftErrorVal->users()) {
if (auto II = dyn_cast<InvokeInst>(U))		if (auto II = dyn_cast<InvokeInst>(U))
verifySwiftErrorCallSite(const_cast<InvokeInst*>(II), SwiftErrorVal);		verifySwiftErrorCallSite(const_cast<InvokeInst*>(II), SwiftErrorVal);
}		}
}		}

void Verifier::visitAllocaInst(AllocaInst &AI) {		void Verifier::visitAllocaInst(AllocaInst &AI) {
SmallPtrSet<Type*, 4> Visited;		SmallPtrSet<Type*, 4> Visited;
PointerType *PTy = AI.getType();		PointerType *PTy = AI.getType();
Assert(PTy->getAddressSpace() == 0,		// TODO: Relax this restriction?
"Allocation instruction pointer not in the generic address space!",		Assert(PTy->getAddressSpace() == DL.getStackAddrSpace(),
		"Allocation instruction pointer not in the stack address space!",
&AI);		&AI);
Assert(AI.getAllocatedType()->isSized(&Visited),		Assert(AI.getAllocatedType()->isSized(&Visited),
"Cannot allocate unsized type", &AI);		"Cannot allocate unsized type", &AI);
Assert(AI.getArraySize()->getType()->isIntegerTy(),		Assert(AI.getArraySize()->getType()->isIntegerTy(),
"Alloca array size must have integer type", &AI);		"Alloca array size must have integer type", &AI);
Assert(AI.getAlignment() <= Value::MaximumAlignment,		Assert(AI.getAlignment() <= Value::MaximumAlignment,
"huge alignment values are unsupported", &AI);		"huge alignment values are unsupported", &AI);

▲ Show 20 Lines • Show All 1,725 Lines • Show Last 20 Lines

lib/Target/NVPTX/NVPTXLowerArgs.cpp

	Show First 20 Lines • Show All 153 Lines • ▼ Show 20 Lines
	void NVPTXLowerArgs::handleByValParam(Argument *Arg) {			void NVPTXLowerArgs::handleByValParam(Argument *Arg) {
	Function *Func = Arg->getParent();			Function *Func = Arg->getParent();
	Instruction *FirstInst = &(Func->getEntryBlock().front());			Instruction *FirstInst = &(Func->getEntryBlock().front());
	PointerType *PType = dyn_cast<PointerType>(Arg->getType());			PointerType *PType = dyn_cast<PointerType>(Arg->getType());

	assert(PType && "Expecting pointer type in handleByValParam");			assert(PType && "Expecting pointer type in handleByValParam");

	Type *StructType = PType->getElementType();			Type *StructType = PType->getElementType();
	AllocaInst *AllocA = new AllocaInst(StructType, Arg->getName(), FirstInst);			unsigned AS = Func->getParent()->getDataLayout().getStackAddrSpace();
				AllocaInst *AllocA = new AllocaInst(StructType, AS, Arg->getName(), FirstInst);
	// Set the alignment to alignment of the byval parameter. This is because,			// Set the alignment to alignment of the byval parameter. This is because,
	// later load/stores assume that alignment, and we are going to replace			// later load/stores assume that alignment, and we are going to replace
	// the use of the byval parameter with this alloca instruction.			// the use of the byval parameter with this alloca instruction.
	AllocA->setAlignment(Func->getParamAlignment(Arg->getArgNo() + 1));			AllocA->setAlignment(Func->getParamAlignment(Arg->getArgNo() + 1));
	Arg->replaceAllUsesWith(AllocA);			Arg->replaceAllUsesWith(AllocA);

	Value *ArgInParam = new AddrSpaceCastInst(			Value *ArgInParam = new AddrSpaceCastInst(
	Arg, PointerType::get(StructType, ADDRESS_SPACE_PARAM), Arg->getName(),			Arg, PointerType::get(StructType, ADDRESS_SPACE_PARAM), Arg->getName(),
	▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

lib/Target/X86/X86WinEHState.cpp

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	private:

// Module-level type getters.		// Module-level type getters.
Type *getEHLinkRegistrationType();		Type *getEHLinkRegistrationType();
Type *getSEHRegistrationType();		Type *getSEHRegistrationType();
Type *getCXXEHRegistrationType();		Type *getCXXEHRegistrationType();

// Per-module data.		// Per-module data.
Module *TheModule = nullptr;		Module *TheModule = nullptr;
		const DataLayout *DL = nullptr;
StructType *EHLinkRegistrationTy = nullptr;		StructType *EHLinkRegistrationTy = nullptr;
StructType *CXXEHRegistrationTy = nullptr;		StructType *CXXEHRegistrationTy = nullptr;
StructType *SEHRegistrationTy = nullptr;		StructType *SEHRegistrationTy = nullptr;
Constant *SetJmp3 = nullptr;		Constant *SetJmp3 = nullptr;
Constant *CxxLongjmpUnwind = nullptr;		Constant *CxxLongjmpUnwind = nullptr;

// Per-function state		// Per-function state
EHPersonality Personality = EHPersonality::Unknown;		EHPersonality Personality = EHPersonality::Unknown;
Show All 22 Lines

char WinEHStatePass::ID = 0;		char WinEHStatePass::ID = 0;

INITIALIZE_PASS(WinEHStatePass, "x86-winehstate",		INITIALIZE_PASS(WinEHStatePass, "x86-winehstate",
"Insert stores for EH state numbers", false, false)		"Insert stores for EH state numbers", false, false)

bool WinEHStatePass::doInitialization(Module &M) {		bool WinEHStatePass::doInitialization(Module &M) {
TheModule = &M;		TheModule = &M;
		DL = &M.getDataLayout();
return false;		return false;
}		}

bool WinEHStatePass::doFinalization(Module &M) {		bool WinEHStatePass::doFinalization(Module &M) {
assert(TheModule == &M);		assert(TheModule == &M);
TheModule = nullptr;		TheModule = nullptr;
EHLinkRegistrationTy = nullptr;		EHLinkRegistrationTy = nullptr;
CXXEHRegistrationTy = nullptr;		CXXEHRegistrationTy = nullptr;
▲ Show 20 Lines • Show All 143 Lines • ▼ Show 20 Lines	void WinEHStatePass::emitExceptionRegistrationRecord(Function *F) {

IRBuilder<> Builder(&F->getEntryBlock(), F->getEntryBlock().begin());		IRBuilder<> Builder(&F->getEntryBlock(), F->getEntryBlock().begin());
Type *Int8PtrType = Builder.getInt8PtrTy();		Type *Int8PtrType = Builder.getInt8PtrTy();
Type *Int32Ty = Builder.getInt32Ty();		Type *Int32Ty = Builder.getInt32Ty();
Type *VoidTy = Builder.getVoidTy();		Type *VoidTy = Builder.getVoidTy();

if (Personality == EHPersonality::MSVC_CXX) {		if (Personality == EHPersonality::MSVC_CXX) {
RegNodeTy = getCXXEHRegistrationType();		RegNodeTy = getCXXEHRegistrationType();
RegNode = Builder.CreateAlloca(RegNodeTy);		RegNode = Builder.CreateAlloca(*DL, RegNodeTy);
// SavedESP = llvm.stacksave()		// SavedESP = llvm.stacksave()
Value *SP = Builder.CreateCall(		Value *SP = Builder.CreateCall(
Intrinsic::getDeclaration(TheModule, Intrinsic::stacksave), {});		Intrinsic::getDeclaration(TheModule, Intrinsic::stacksave), {});
Builder.CreateStore(SP, Builder.CreateStructGEP(RegNodeTy, RegNode, 0));		Builder.CreateStore(SP, Builder.CreateStructGEP(RegNodeTy, RegNode, 0));
// TryLevel = -1		// TryLevel = -1
StateFieldIndex = 2;		StateFieldIndex = 2;
ParentBaseState = -1;		ParentBaseState = -1;
insertStateNumberStore(&*Builder.GetInsertPoint(), ParentBaseState);		insertStateNumberStore(&*Builder.GetInsertPoint(), ParentBaseState);
Show All 10 Lines	void WinEHStatePass::emitExceptionRegistrationRecord(Function *F) {
} else if (Personality == EHPersonality::MSVC_X86SEH) {		} else if (Personality == EHPersonality::MSVC_X86SEH) {
// If _except_handler4 is in use, some additional guard checks and prologue		// If _except_handler4 is in use, some additional guard checks and prologue
// stuff is required.		// stuff is required.
StringRef PersonalityName = PersonalityFn->getName();		StringRef PersonalityName = PersonalityFn->getName();
UseStackGuard = (PersonalityName == "_except_handler4");		UseStackGuard = (PersonalityName == "_except_handler4");

// Allocate local structures.		// Allocate local structures.
RegNodeTy = getSEHRegistrationType();		RegNodeTy = getSEHRegistrationType();
RegNode = Builder.CreateAlloca(RegNodeTy);		RegNode = Builder.CreateAlloca(*DL, RegNodeTy);
if (UseStackGuard)		if (UseStackGuard)
EHGuardNode = Builder.CreateAlloca(Int32Ty);		EHGuardNode = Builder.CreateAlloca(*DL, Int32Ty);

// SavedESP = llvm.stacksave()		// SavedESP = llvm.stacksave()
Value *SP = Builder.CreateCall(		Value *SP = Builder.CreateCall(
Intrinsic::getDeclaration(TheModule, Intrinsic::stacksave), {});		Intrinsic::getDeclaration(TheModule, Intrinsic::stacksave), {});
Builder.CreateStore(SP, Builder.CreateStructGEP(RegNodeTy, RegNode, 0));		Builder.CreateStore(SP, Builder.CreateStructGEP(RegNodeTy, RegNode, 0));
// TryLevel = -2 / -1		// TryLevel = -2 / -1
StateFieldIndex = 4;		StateFieldIndex = 4;
ParentBaseState = UseStackGuard ? -2 : -1;		ParentBaseState = UseStackGuard ? -2 : -1;
▲ Show 20 Lines • Show All 471 Lines • Show Last 20 Lines

lib/Transforms/Coroutines/CoroElide.cpp

Show First 20 Lines • Show All 121 Lines • ▼ Show 20 Lines	for (auto *CA : CoroAllocs) {
CA->replaceAllUsesWith(False);		CA->replaceAllUsesWith(False);
CA->eraseFromParent();		CA->eraseFromParent();
}		}

// FIXME: Design how to transmit alignment information for every alloca that		// FIXME: Design how to transmit alignment information for every alloca that
// is spilled into the coroutine frame and recreate the alignment information		// is spilled into the coroutine frame and recreate the alignment information
// here. Possibly we will need to do a mini SROA here and break the coroutine		// here. Possibly we will need to do a mini SROA here and break the coroutine
// frame into individual AllocaInst recreating the original alignment.		// frame into individual AllocaInst recreating the original alignment.
auto *Frame = new AllocaInst(FrameTy, "", InsertPt);		const DataLayout &DL = F->getParent()->getDataLayout();
		auto *Frame = new AllocaInst(FrameTy, DL.getStackAddrSpace(), "", InsertPt);
auto *FrameVoidPtr =		auto *FrameVoidPtr =
new BitCastInst(Frame, Type::getInt8PtrTy(C), "vFrame", InsertPt);		new BitCastInst(Frame, Type::getInt8PtrTy(C), "vFrame", InsertPt);

for (auto *CB : CoroBegins) {		for (auto *CB : CoroBegins) {
CB->replaceAllUsesWith(FrameVoidPtr);		CB->replaceAllUsesWith(FrameVoidPtr);
CB->eraseFromParent();		CB->eraseFromParent();
}		}

▲ Show 20 Lines • Show All 179 Lines • Show Last 20 Lines

lib/Transforms/Coroutines/CoroSplit.cpp

	Show First 20 Lines • Show All 411 Lines • ▼ Show 20 Lines

	// Coroutine has no suspend points. Remove heap allocation for the coroutine			// Coroutine has no suspend points. Remove heap allocation for the coroutine
	// frame if possible.			// frame if possible.
	static void handleNoSuspendCoroutine(CoroBeginInst CoroBegin, Type FrameTy) {			static void handleNoSuspendCoroutine(CoroBeginInst CoroBegin, Type FrameTy) {
	auto *CoroId = CoroBegin->getId();			auto *CoroId = CoroBegin->getId();
	auto *AllocInst = CoroId->getCoroAlloc();			auto *AllocInst = CoroId->getCoroAlloc();
	coro::replaceCoroFree(CoroId, /Elide=/AllocInst != nullptr);			coro::replaceCoroFree(CoroId, /Elide=/AllocInst != nullptr);
	if (AllocInst) {			if (AllocInst) {
				const DataLayout &DL = AllocInst->getModule()->getDataLayout();

	IRBuilder<> Builder(AllocInst);			IRBuilder<> Builder(AllocInst);
	// FIXME: Need to handle overaligned members.			// FIXME: Need to handle overaligned members.
	auto *Frame = Builder.CreateAlloca(FrameTy);			auto *Frame = Builder.CreateAlloca(DL, FrameTy);
	auto *VFrame = Builder.CreateBitCast(Frame, Builder.getInt8PtrTy());			auto *VFrame = Builder.CreateBitCast(Frame, Builder.getInt8PtrTy());
	AllocInst->replaceAllUsesWith(Builder.getFalse());			AllocInst->replaceAllUsesWith(Builder.getFalse());
	AllocInst->eraseFromParent();			AllocInst->eraseFromParent();
	CoroBegin->replaceAllUsesWith(VFrame);			CoroBegin->replaceAllUsesWith(VFrame);
	} else {			} else {
	CoroBegin->replaceAllUsesWith(CoroBegin->getMem());			CoroBegin->replaceAllUsesWith(CoroBegin->getMem());
	}			}
	CoroBegin->eraseFromParent();			CoroBegin->eraseFromParent();
	▲ Show 20 Lines • Show All 244 Lines • Show Last 20 Lines

lib/Transforms/IPO/ArgumentPromotion.cpp

Show First 20 Lines • Show All 343 Lines • ▼ Show 20 Lines	if (!Call->use_empty()) {
New->takeName(Call);		New->takeName(Call);
}		}

// Finally, remove the old call from the program, reducing the use-count of		// Finally, remove the old call from the program, reducing the use-count of
// F.		// F.
Call->eraseFromParent();		Call->eraseFromParent();
}		}

		const DataLayout &DL = F->getParent()->getDataLayout();

// Since we have now created the new function, splice the body of the old		// Since we have now created the new function, splice the body of the old
// function right into the new function, leaving the old rotting hulk of the		// function right into the new function, leaving the old rotting hulk of the
// function empty.		// function empty.
NF->getBasicBlockList().splice(NF->begin(), F->getBasicBlockList());		NF->getBasicBlockList().splice(NF->begin(), F->getBasicBlockList());

// Loop over the argument list, transferring uses of the old arguments over to		// Loop over the argument list, transferring uses of the old arguments over to
// the new arguments, also transferring over the names as well.		// the new arguments, also transferring over the names as well.
//		//
Show All 11 Lines	for (Function::arg_iterator I = F->arg_begin(), E = F->arg_end(),

if (ByValArgsToTransform.count(&*I)) {		if (ByValArgsToTransform.count(&*I)) {
// In the callee, we create an alloca, and store each of the new incoming		// In the callee, we create an alloca, and store each of the new incoming
// arguments into the alloca.		// arguments into the alloca.
Instruction *InsertPt = &NF->begin()->front();		Instruction *InsertPt = &NF->begin()->front();

// Just add all the struct element types.		// Just add all the struct element types.
Type *AgTy = cast<PointerType>(I->getType())->getElementType();		Type *AgTy = cast<PointerType>(I->getType())->getElementType();
Value *TheAlloca = new AllocaInst(AgTy, nullptr, "", InsertPt);		Value *TheAlloca = new AllocaInst(AgTy, DL.getStackAddrSpace(), nullptr,
		"", InsertPt);
StructType *STy = cast<StructType>(AgTy);		StructType *STy = cast<StructType>(AgTy);
Value *Idxs[2] = {ConstantInt::get(Type::getInt32Ty(F->getContext()), 0),		Value *Idxs[2] = {ConstantInt::get(Type::getInt32Ty(F->getContext()), 0),
nullptr};		nullptr};

for (unsigned i = 0, e = STy->getNumElements(); i != e; ++i) {		for (unsigned i = 0, e = STy->getNumElements(); i != e; ++i) {
Idxs[1] = ConstantInt::get(Type::getInt32Ty(F->getContext()), i);		Idxs[1] = ConstantInt::get(Type::getInt32Ty(F->getContext()), i);
Value *Idx = GetElementPtrInst::Create(		Value *Idx = GetElementPtrInst::Create(
AgTy, TheAlloca, Idxs, TheAlloca->getName() + "." + Twine(i),		AgTy, TheAlloca, Idxs, TheAlloca->getName() + "." + Twine(i),
▲ Show 20 Lines • Show All 698 Lines • Show Last 20 Lines

lib/Transforms/IPO/GlobalOpt.cpp

Show First 20 Lines • Show All 1,813 Lines • ▼ Show 20 Lines	if (!GS.HasMultipleAccessingFunctions &&
GS.AccessingFunction &&		GS.AccessingFunction &&
GV->getValueType()->isSingleValueType() &&		GV->getValueType()->isSingleValueType() &&
GV->getType()->getAddressSpace() == 0 &&		GV->getType()->getAddressSpace() == 0 &&
!GV->isExternallyInitialized() &&		!GV->isExternallyInitialized() &&
allNonInstructionUsersCanBeMadeInstructions(GV) &&		allNonInstructionUsersCanBeMadeInstructions(GV) &&
GS.AccessingFunction->doesNotRecurse() &&		GS.AccessingFunction->doesNotRecurse() &&
isPointerValueDeadOnEntryToFunction(GS.AccessingFunction, GV,		isPointerValueDeadOnEntryToFunction(GS.AccessingFunction, GV,
LookupDomTree)) {		LookupDomTree)) {
		const DataLayout &DL = GV->getParent()->getDataLayout();

DEBUG(dbgs() << "LOCALIZING GLOBAL: " << *GV << "\n");		DEBUG(dbgs() << "LOCALIZING GLOBAL: " << *GV << "\n");
Instruction &FirstI = const_cast<Instruction&>(*GS.AccessingFunction		Instruction &FirstI = const_cast<Instruction&>(*GS.AccessingFunction
->getEntryBlock().begin());		->getEntryBlock().begin());
Type *ElemTy = GV->getValueType();		Type *ElemTy = GV->getValueType();
// FIXME: Pass Global's alignment when globals have alignment		// FIXME: Pass Global's alignment when globals have alignment
AllocaInst *Alloca = new AllocaInst(ElemTy, nullptr,		AllocaInst *Alloca = new AllocaInst(ElemTy, DL.getStackAddrSpace(), nullptr,
GV->getName(), &FirstI);		GV->getName(), &FirstI);
if (!isa<UndefValue>(GV->getInitializer()))		if (!isa<UndefValue>(GV->getInitializer()))
new StoreInst(GV->getInitializer(), Alloca, &FirstI);		new StoreInst(GV->getInitializer(), Alloca, &FirstI);

makeAllConstantUsesInstructions(GV);		makeAllConstantUsesInstructions(GV);

GV->replaceAllUsesWith(Alloca);		GV->replaceAllUsesWith(Alloca);
GV->eraseFromParent();		GV->eraseFromParent();
▲ Show 20 Lines • Show All 766 Lines • Show Last 20 Lines

lib/Transforms/InstCombine/InstCombineCasts.cpp

Show First 20 Lines • Show All 132 Lines • ▼ Show 20 Lines	Instruction *InstCombiner::PromoteCastOfAllocation(BitCastInst &CI,
}		}

if (uint64_t Offset = (AllocElTySize*ArrayOffset)/CastElTySize) {		if (uint64_t Offset = (AllocElTySize*ArrayOffset)/CastElTySize) {
Value *Off = ConstantInt::get(AI.getArraySize()->getType(),		Value *Off = ConstantInt::get(AI.getArraySize()->getType(),
Offset, true);		Offset, true);
Amt = AllocaBuilder.CreateAdd(Amt, Off);		Amt = AllocaBuilder.CreateAdd(Amt, Off);
}		}

AllocaInst *New = AllocaBuilder.CreateAlloca(CastElTy, Amt);		AllocaInst *New = AllocaBuilder.CreateAlloca(DL, CastElTy, Amt);
New->setAlignment(AI.getAlignment());		New->setAlignment(AI.getAlignment());
New->takeName(&AI);		New->takeName(&AI);
New->setUsedWithInAlloca(AI.isUsedWithInAlloca());		New->setUsedWithInAlloca(AI.isUsedWithInAlloca());

// If the allocation has multiple real uses, insert a cast and change all		// If the allocation has multiple real uses, insert a cast and change all
// things that used it to use the new cast. This will also hack on CI, but it		// things that used it to use the new cast. This will also hack on CI, but it
// will die soon.		// will die soon.
if (!AI.hasOneUse()) {		if (!AI.hasOneUse()) {
▲ Show 20 Lines • Show All 2,061 Lines • Show Last 20 Lines

lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp

Show First 20 Lines • Show All 179 Lines • ▼ Show 20 Lines	if (!AI.isArrayAllocation()) {
Value *V = IC.Builder->getInt32(1);		Value *V = IC.Builder->getInt32(1);
AI.setOperand(0, V);		AI.setOperand(0, V);
return &AI;		return &AI;
}		}

// Convert: alloca Ty, C - where C is a constant != 1 into: alloca [C x Ty], 1		// Convert: alloca Ty, C - where C is a constant != 1 into: alloca [C x Ty], 1
if (const ConstantInt *C = dyn_cast<ConstantInt>(AI.getArraySize())) {		if (const ConstantInt *C = dyn_cast<ConstantInt>(AI.getArraySize())) {
Type *NewTy = ArrayType::get(AI.getAllocatedType(), C->getZExtValue());		Type *NewTy = ArrayType::get(AI.getAllocatedType(), C->getZExtValue());
AllocaInst *New = IC.Builder->CreateAlloca(NewTy, nullptr, AI.getName());		AllocaInst *New = IC.Builder->CreateAlloca(IC.getDataLayout(), NewTy,
		nullptr, AI.getName());
New->setAlignment(AI.getAlignment());		New->setAlignment(AI.getAlignment());

// Scan to the end of the allocation instructions, to skip over a block of		// Scan to the end of the allocation instructions, to skip over a block of
// allocas if possible...also skip interleaved debug info		// allocas if possible...also skip interleaved debug info
//		//
BasicBlock::iterator It(New);		BasicBlock::iterator It(New);
while (isa<AllocaInst>(It) \|\| isa<DbgInfoIntrinsic>(It))		while (isa<AllocaInst>(It) \|\| isa<DbgInfoIntrinsic>(It))
++It;		++It;
▲ Show 20 Lines • Show All 1,361 Lines • Show Last 20 Lines

lib/Transforms/Instrumentation/AddressSanitizer.cpp

Show First 20 Lines • Show All 497 Lines • ▼ Show 20 Lines	struct AddressSanitizer : public FunctionPass {
uint64_t getAllocaSizeInBytes(const AllocaInst &AI) const {		uint64_t getAllocaSizeInBytes(const AllocaInst &AI) const {
uint64_t ArraySize = 1;		uint64_t ArraySize = 1;
if (AI.isArrayAllocation()) {		if (AI.isArrayAllocation()) {
const ConstantInt *CI = dyn_cast<ConstantInt>(AI.getArraySize());		const ConstantInt *CI = dyn_cast<ConstantInt>(AI.getArraySize());
assert(CI && "non-constant array size");		assert(CI && "non-constant array size");
ArraySize = CI->getZExtValue();		ArraySize = CI->getZExtValue();
}		}
Type *Ty = AI.getAllocatedType();		Type *Ty = AI.getAllocatedType();
uint64_t SizeInBytes =		uint64_t SizeInBytes = DL->getTypeAllocSize(Ty);
AI.getModule()->getDataLayout().getTypeAllocSize(Ty);
return SizeInBytes * ArraySize;		return SizeInBytes * ArraySize;
}		}
/// Check if we want (and can) handle this alloca.		/// Check if we want (and can) handle this alloca.
bool isInterestingAlloca(const AllocaInst &AI);		bool isInterestingAlloca(const AllocaInst &AI);

/// If it is an interesting memory access, return the PointerOperand		/// If it is an interesting memory access, return the PointerOperand
/// and set IsWrite/Alignment. Otherwise return nullptr.		/// and set IsWrite/Alignment. Otherwise return nullptr.
/// MaybeMask is an output parameter for the mask Value, if we're looking at a		/// MaybeMask is an output parameter for the mask Value, if we're looking at a
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	struct FunctionStateRAII {
}		}
~FunctionStateRAII() {		~FunctionStateRAII() {
Pass->LocalDynamicShadow = nullptr;		Pass->LocalDynamicShadow = nullptr;
Pass->ProcessedAllocas.clear();		Pass->ProcessedAllocas.clear();
}		}
};		};

LLVMContext *C;		LLVMContext *C;
		const DataLayout *DL;
Triple TargetTriple;		Triple TargetTriple;
int LongSize;		int LongSize;
bool CompileKernel;		bool CompileKernel;
bool Recover;		bool Recover;
bool UseAfterScope;		bool UseAfterScope;
Type *IntptrTy;		Type *IntptrTy;
ShadowMapping Mapping;		ShadowMapping Mapping;
DominatorTree *DT;		DominatorTree *DT;
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	size_t MinRedzoneSizeForGlobal() const {
return RedzoneSizeForScale(Mapping.Scale);		return RedzoneSizeForScale(Mapping.Scale);
}		}

GlobalsMetadata GlobalsMD;		GlobalsMetadata GlobalsMD;
bool CompileKernel;		bool CompileKernel;
bool Recover;		bool Recover;
Type *IntptrTy;		Type *IntptrTy;
LLVMContext *C;		LLVMContext *C;
		const DataLayout *DL;
Triple TargetTriple;		Triple TargetTriple;
ShadowMapping Mapping;		ShadowMapping Mapping;
Function *AsanPoisonGlobals;		Function *AsanPoisonGlobals;
Function *AsanUnpoisonGlobals;		Function *AsanUnpoisonGlobals;
Function *AsanRegisterGlobals;		Function *AsanRegisterGlobals;
Function *AsanUnregisterGlobals;		Function *AsanUnregisterGlobals;
Function *AsanRegisterImageGlobals;		Function *AsanRegisterImageGlobals;
Function *AsanUnregisterImageGlobals;		Function *AsanUnregisterImageGlobals;
▲ Show 20 Lines • Show All 1,275 Lines • ▼ Show 20 Lines	if (HasDynamicallyInitializedGlobals)
createInitializerPoisonCalls(M, ModuleName);		createInitializerPoisonCalls(M, ModuleName);

DEBUG(dbgs() << M);		DEBUG(dbgs() << M);
return true;		return true;
}		}

bool AddressSanitizerModule::runOnModule(Module &M) {		bool AddressSanitizerModule::runOnModule(Module &M) {
C = &(M.getContext());		C = &(M.getContext());
		DL = &M.getDataLayout();
int LongSize = M.getDataLayout().getPointerSizeInBits();		int LongSize = M.getDataLayout().getPointerSizeInBits();
IntptrTy = Type::getIntNTy(*C, LongSize);		IntptrTy = Type::getIntNTy(*C, LongSize);
TargetTriple = Triple(M.getTargetTriple());		TargetTriple = Triple(M.getTargetTriple());
Mapping = getShadowMapping(TargetTriple, LongSize, CompileKernel);		Mapping = getShadowMapping(TargetTriple, LongSize, CompileKernel);
initializeCallbacks(M);		initializeCallbacks(M);

bool Changed = false;		bool Changed = false;

▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines

// virtual		// virtual
bool AddressSanitizer::doInitialization(Module &M) {		bool AddressSanitizer::doInitialization(Module &M) {
// Initialize the private fields. No one has accessed them before.		// Initialize the private fields. No one has accessed them before.

GlobalsMD.init(M);		GlobalsMD.init(M);

C = &(M.getContext());		C = &(M.getContext());
		DL = &M.getDataLayout();
LongSize = M.getDataLayout().getPointerSizeInBits();		LongSize = M.getDataLayout().getPointerSizeInBits();
IntptrTy = Type::getIntNTy(*C, LongSize);		IntptrTy = Type::getIntNTy(*C, LongSize);
TargetTriple = Triple(M.getTargetTriple());		TargetTriple = Triple(M.getTargetTriple());

if (!CompileKernel) {		if (!CompileKernel) {
std::tie(AsanCtorFunction, AsanInitFunction) =		std::tie(AsanCtorFunction, AsanInitFunction) =
createSanitizerCtorAndInitFunctions(		createSanitizerCtorAndInitFunctions(
M, kAsanModuleCtorName, kAsanInitName,		M, kAsanModuleCtorName, kAsanInitName,
▲ Show 20 Lines • Show All 357 Lines • ▼ Show 20 Lines	PHINode FunctionStackPoisoner::createPHI(IRBuilder<> &IRB, Value Cond,
BasicBlock *ThenBlock = ThenTerm->getParent();		BasicBlock *ThenBlock = ThenTerm->getParent();
PHI->addIncoming(ValueIfTrue, ThenBlock);		PHI->addIncoming(ValueIfTrue, ThenBlock);
return PHI;		return PHI;
}		}

Value *FunctionStackPoisoner::createAllocaForLayout(		Value *FunctionStackPoisoner::createAllocaForLayout(
IRBuilder<> &IRB, const ASanStackFrameLayout &L, bool Dynamic) {		IRBuilder<> &IRB, const ASanStackFrameLayout &L, bool Dynamic) {
AllocaInst *Alloca;		AllocaInst *Alloca;

		const DataLayout &DL = F.getParent()->getDataLayout();
if (Dynamic) {		if (Dynamic) {
Alloca = IRB.CreateAlloca(IRB.getInt8Ty(),		Alloca = IRB.CreateAlloca(DL, IRB.getInt8Ty(),
ConstantInt::get(IRB.getInt64Ty(), L.FrameSize),		ConstantInt::get(IRB.getInt64Ty(), L.FrameSize),
"MyAlloca");		"MyAlloca");
} else {		} else {
Alloca = IRB.CreateAlloca(ArrayType::get(IRB.getInt8Ty(), L.FrameSize),		Alloca = IRB.CreateAlloca(DL, ArrayType::get(IRB.getInt8Ty(), L.FrameSize),
nullptr, "MyAlloca");		nullptr, "MyAlloca");
assert(Alloca->isStaticAlloca());		assert(Alloca->isStaticAlloca());
}		}
assert((ClRealignStack & (ClRealignStack - 1)) == 0);		assert((ClRealignStack & (ClRealignStack - 1)) == 0);
size_t FrameAlignment = std::max(L.FrameAlignment, (size_t)ClRealignStack);		size_t FrameAlignment = std::max(L.FrameAlignment, (size_t)ClRealignStack);
Alloca->setAlignment(FrameAlignment);		Alloca->setAlignment(FrameAlignment);
return IRB.CreatePointerCast(Alloca, IntptrTy);		return IRB.CreatePointerCast(Alloca, IntptrTy);
}		}

void FunctionStackPoisoner::createDynamicAllocasInitStorage() {		void FunctionStackPoisoner::createDynamicAllocasInitStorage() {
BasicBlock &FirstBB = *F.begin();		BasicBlock &FirstBB = *F.begin();
		const DataLayout &DL = F.getParent()->getDataLayout();
IRBuilder<> IRB(dyn_cast<Instruction>(FirstBB.begin()));		IRBuilder<> IRB(dyn_cast<Instruction>(FirstBB.begin()));
DynamicAllocaLayout = IRB.CreateAlloca(IntptrTy, nullptr);		DynamicAllocaLayout = IRB.CreateAlloca(DL, IntptrTy, nullptr);
IRB.CreateStore(Constant::getNullValue(IntptrTy), DynamicAllocaLayout);		IRB.CreateStore(Constant::getNullValue(IntptrTy), DynamicAllocaLayout);
DynamicAllocaLayout->setAlignment(32);		DynamicAllocaLayout->setAlignment(32);
}		}

void FunctionStackPoisoner::processDynamicAllocas() {		void FunctionStackPoisoner::processDynamicAllocas() {
if (!ClInstrumentDynamicAllocas \|\| DynamicAllocaVec.empty()) {		if (!ClInstrumentDynamicAllocas \|\| DynamicAllocaVec.empty()) {
assert(DynamicAllocaPoisonCallVec.empty());		assert(DynamicAllocaPoisonCallVec.empty());
return;		return;
▲ Show 20 Lines • Show All 354 Lines • ▼ Show 20 Lines	void FunctionStackPoisoner::handleDynamicAllocaCall(AllocaInst *AI) {
// Align is added to locate left redzone, PartialPadding for possible		// Align is added to locate left redzone, PartialPadding for possible
// partial redzone and kAllocaRzSize for right redzone respectively.		// partial redzone and kAllocaRzSize for right redzone respectively.
Value *AdditionalChunkSize = IRB.CreateAdd(		Value *AdditionalChunkSize = IRB.CreateAdd(
ConstantInt::get(IntptrTy, Align + kAllocaRzSize), PartialPadding);		ConstantInt::get(IntptrTy, Align + kAllocaRzSize), PartialPadding);

Value *NewSize = IRB.CreateAdd(OldSize, AdditionalChunkSize);		Value *NewSize = IRB.CreateAdd(OldSize, AdditionalChunkSize);

// Insert new alloca with new NewSize and Align params.		// Insert new alloca with new NewSize and Align params.
AllocaInst *NewAlloca = IRB.CreateAlloca(IRB.getInt8Ty(), NewSize);		AllocaInst *NewAlloca = IRB.CreateAlloca(IRB.getInt8Ty(),
		AI->getType()->getAddressSpace(),
		NewSize);
NewAlloca->setAlignment(Align);		NewAlloca->setAlignment(Align);

// NewAddress = Address + Align		// NewAddress = Address + Align
Value *NewAddress = IRB.CreateAdd(IRB.CreatePtrToInt(NewAlloca, IntptrTy),		Value *NewAddress = IRB.CreateAdd(IRB.CreatePtrToInt(NewAlloca, IntptrTy),
ConstantInt::get(IntptrTy, Align));		ConstantInt::get(IntptrTy, Align));

// Insert __asan_alloca_poison call for new created alloca.		// Insert __asan_alloca_poison call for new created alloca.
IRB.CreateCall(AsanAllocaPoisonFunc, {NewAddress, OldSize});		IRB.CreateCall(AsanAllocaPoisonFunc, {NewAddress, OldSize});
Show All 30 Lines

lib/Transforms/Instrumentation/DataFlowSanitizer.cpp

Show First 20 Lines • Show All 325 Lines • ▼ Show 20 Lines	void storeShadow(Value Addr, uint64_t Size, uint64_t Align, Value Shadow,
Instruction *Pos);		Instruction *Pos);
};		};

class DFSanVisitor : public InstVisitor<DFSanVisitor> {		class DFSanVisitor : public InstVisitor<DFSanVisitor> {
public:		public:
DFSanFunction &DFSF;		DFSanFunction &DFSF;
DFSanVisitor(DFSanFunction &DFSF) : DFSF(DFSF) {}		DFSanVisitor(DFSanFunction &DFSF) : DFSF(DFSF) {}

		const DataLayout &getDataLayout() const {
		return DFSF.F->getParent()->getDataLayout();
		}

void visitOperandShadowInst(Instruction &I);		void visitOperandShadowInst(Instruction &I);

void visitBinaryOperator(BinaryOperator &BO);		void visitBinaryOperator(BinaryOperator &BO);
void visitCastInst(CastInst &CI);		void visitCastInst(CastInst &CI);
void visitCmpInst(CmpInst &CI);		void visitCmpInst(CmpInst &CI);
void visitGetElementPtrInst(GetElementPtrInst &GEPI);		void visitGetElementPtrInst(GetElementPtrInst &GEPI);
void visitLoadInst(LoadInst &LI);		void visitLoadInst(LoadInst &LI);
void visitStoreInst(StoreInst &SI);		void visitStoreInst(StoreInst &SI);
▲ Show 20 Lines • Show All 965 Lines • ▼ Show 20 Lines	if (StoreInst *SI = dyn_cast<StoreInst>(U)) {
continue;		continue;
}		}

AllLoadsStores = false;		AllLoadsStores = false;
break;		break;
}		}
if (AllLoadsStores) {		if (AllLoadsStores) {
IRBuilder<> IRB(&I);		IRBuilder<> IRB(&I);
DFSF.AllocaShadowMap[&I] = IRB.CreateAlloca(DFSF.DFS.ShadowTy);		DFSF.AllocaShadowMap[&I] = IRB.CreateAlloca(getDataLayout(),
		DFSF.DFS.ShadowTy);
}		}
DFSF.setShadow(&I, DFSF.DFS.ZeroShadow);		DFSF.setShadow(&I, DFSF.DFS.ZeroShadow);
}		}

void DFSanVisitor::visitSelectInst(SelectInst &I) {		void DFSanVisitor::visitSelectInst(SelectInst &I) {
Value *CondShadow = DFSF.getShadow(I.getCondition());		Value *CondShadow = DFSF.getShadow(I.getCondition());
Value *TrueShadow = DFSF.getShadow(I.getTrueValue());		Value *TrueShadow = DFSF.getShadow(I.getTrueValue());
Value *FalseShadow = DFSF.getShadow(I.getFalseValue());		Value *FalseShadow = DFSF.getShadow(I.getFalseValue());
▲ Show 20 Lines • Show All 152 Lines • ▼ Show 20 Lines	case DataFlowSanitizer::WK_Custom: {
i = CS.arg_begin();		i = CS.arg_begin();
for (unsigned n = FT->getNumParams(); n != 0; ++i, --n)		for (unsigned n = FT->getNumParams(); n != 0; ++i, --n)
Args.push_back(DFSF.getShadow(*i));		Args.push_back(DFSF.getShadow(*i));

if (FT->isVarArg()) {		if (FT->isVarArg()) {
auto *LabelVATy = ArrayType::get(DFSF.DFS.ShadowTy,		auto *LabelVATy = ArrayType::get(DFSF.DFS.ShadowTy,
CS.arg_size() - FT->getNumParams());		CS.arg_size() - FT->getNumParams());
auto *LabelVAAlloca = new AllocaInst(		auto *LabelVAAlloca = new AllocaInst(
LabelVATy, "labelva", &DFSF.F->getEntryBlock().front());		LabelVATy, getDataLayout().getStackAddrSpace(),
		"labelva", &DFSF.F->getEntryBlock().front());

for (unsigned n = 0; i != CS.arg_end(); ++i, ++n) {		for (unsigned n = 0; i != CS.arg_end(); ++i, ++n) {
auto LabelVAPtr = IRB.CreateStructGEP(LabelVATy, LabelVAAlloca, n);		auto LabelVAPtr = IRB.CreateStructGEP(LabelVATy, LabelVAAlloca, n);
IRB.CreateStore(DFSF.getShadow(*i), LabelVAPtr);		IRB.CreateStore(DFSF.getShadow(*i), LabelVAPtr);
}		}

Args.push_back(IRB.CreateStructGEP(LabelVATy, LabelVAAlloca, 0));		Args.push_back(IRB.CreateStructGEP(LabelVATy, LabelVAAlloca, 0));
}		}

if (!FT->getReturnType()->isVoidTy()) {		if (!FT->getReturnType()->isVoidTy()) {
if (!DFSF.LabelReturnAlloca) {		if (!DFSF.LabelReturnAlloca) {
DFSF.LabelReturnAlloca =		DFSF.LabelReturnAlloca =
new AllocaInst(DFSF.DFS.ShadowTy, "labelreturn",		new AllocaInst(DFSF.DFS.ShadowTy,
&DFSF.F->getEntryBlock().front());		getDataLayout().getStackAddrSpace(),
		"labelreturn", &DFSF.F->getEntryBlock().front());
}		}
Args.push_back(DFSF.LabelReturnAlloca);		Args.push_back(DFSF.LabelReturnAlloca);
}		}

for (i = CS.arg_begin() + FT->getNumParams(); i != CS.arg_end(); ++i)		for (i = CS.arg_begin() + FT->getNumParams(); i != CS.arg_end(); ++i)
Args.push_back(*i);		Args.push_back(*i);

CallInst *CustomCI = IRB.CreateCall(CustomF, Args);		CallInst *CustomCI = IRB.CreateCall(CustomF, Args);
▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	if (DFSF.DFS.getInstrumentedABI() == DataFlowSanitizer::IA_Args) {
i = CS.arg_begin();		i = CS.arg_begin();
for (unsigned n = FT->getNumParams(); n != 0; ++i, --n)		for (unsigned n = FT->getNumParams(); n != 0; ++i, --n)
Args.push_back(DFSF.getShadow(*i));		Args.push_back(DFSF.getShadow(*i));

if (FT->isVarArg()) {		if (FT->isVarArg()) {
unsigned VarArgSize = CS.arg_size() - FT->getNumParams();		unsigned VarArgSize = CS.arg_size() - FT->getNumParams();
ArrayType *VarArgArrayTy = ArrayType::get(DFSF.DFS.ShadowTy, VarArgSize);		ArrayType *VarArgArrayTy = ArrayType::get(DFSF.DFS.ShadowTy, VarArgSize);
AllocaInst *VarArgShadow =		AllocaInst *VarArgShadow =
new AllocaInst(VarArgArrayTy, "", &DFSF.F->getEntryBlock().front());		new AllocaInst(VarArgArrayTy, getDataLayout().getStackAddrSpace(),
		"", &DFSF.F->getEntryBlock().front());
Args.push_back(IRB.CreateConstGEP2_32(VarArgArrayTy, VarArgShadow, 0, 0));		Args.push_back(IRB.CreateConstGEP2_32(VarArgArrayTy, VarArgShadow, 0, 0));
for (unsigned n = 0; i != e; ++i, ++n) {		for (unsigned n = 0; i != e; ++i, ++n) {
IRB.CreateStore(		IRB.CreateStore(
DFSF.getShadow(*i),		DFSF.getShadow(*i),
IRB.CreateConstGEP2_32(VarArgArrayTy, VarArgShadow, 0, n));		IRB.CreateConstGEP2_32(VarArgArrayTy, VarArgShadow, 0, n));
Args.push_back(*i);		Args.push_back(*i);
}		}
}		}
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

lib/Transforms/Instrumentation/MemorySanitizer.cpp

Show First 20 Lines • Show All 3,056 Lines • ▼ Show 20 Lines	void visitVACopyInst(VACopyInst &I) override {

// Unpoison the whole __va_list_tag.		// Unpoison the whole __va_list_tag.
// FIXME: magic ABI constants.		// FIXME: magic ABI constants.
IRB.CreateMemSet(ShadowPtr, Constant::getNullValue(IRB.getInt8Ty()),		IRB.CreateMemSet(ShadowPtr, Constant::getNullValue(IRB.getInt8Ty()),
/* size /24, / alignment */8, false);		/* size /24, / alignment */8, false);
}		}

void finalizeInstrumentation() override {		void finalizeInstrumentation() override {
		const DataLayout &DL = F.getParent()->getDataLayout();

assert(!VAArgOverflowSize && !VAArgTLSCopy &&		assert(!VAArgOverflowSize && !VAArgTLSCopy &&
"finalizeInstrumentation called twice");		"finalizeInstrumentation called twice");
if (!VAStartInstrumentationList.empty()) {		if (!VAStartInstrumentationList.empty()) {
// If there is a va_start in this function, make a backup copy of		// If there is a va_start in this function, make a backup copy of
// va_arg_tls somewhere in the function entry block.		// va_arg_tls somewhere in the function entry block.
IRBuilder<> IRB(F.getEntryBlock().getFirstNonPHI());		IRBuilder<> IRB(F.getEntryBlock().getFirstNonPHI());
VAArgOverflowSize = IRB.CreateLoad(MS.VAArgOverflowSizeTLS);		VAArgOverflowSize = IRB.CreateLoad(MS.VAArgOverflowSizeTLS);
Value *CopySize =		Value *CopySize =
IRB.CreateAdd(ConstantInt::get(MS.IntptrTy, AMD64FpEndOffset),		IRB.CreateAdd(ConstantInt::get(MS.IntptrTy, AMD64FpEndOffset),
VAArgOverflowSize);		VAArgOverflowSize);
VAArgTLSCopy = IRB.CreateAlloca(Type::getInt8Ty(*MS.C), CopySize);		VAArgTLSCopy = IRB.CreateAlloca(DL, Type::getInt8Ty(*MS.C), CopySize);
IRB.CreateMemCpy(VAArgTLSCopy, MS.VAArgTLS, CopySize, 8);		IRB.CreateMemCpy(VAArgTLSCopy, MS.VAArgTLS, CopySize, 8);
}		}

// Instrument va_start.		// Instrument va_start.
// Copy va_list shadow from the backup copy of the TLS contents.		// Copy va_list shadow from the backup copy of the TLS contents.
for (size_t i = 0, n = VAStartInstrumentationList.size(); i < n; i++) {		for (size_t i = 0, n = VAStartInstrumentationList.size(); i < n; i++) {
CallInst *OrigInst = VAStartInstrumentationList[i];		CallInst *OrigInst = VAStartInstrumentationList[i];
IRBuilder<> IRB(OrigInst->getNextNode());		IRBuilder<> IRB(OrigInst->getNextNode());
▲ Show 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	void finalizeInstrumentation() override {
assert(!VAArgSize && !VAArgTLSCopy &&		assert(!VAArgSize && !VAArgTLSCopy &&
"finalizeInstrumentation called twice");		"finalizeInstrumentation called twice");
IRBuilder<> IRB(F.getEntryBlock().getFirstNonPHI());		IRBuilder<> IRB(F.getEntryBlock().getFirstNonPHI());
VAArgSize = IRB.CreateLoad(MS.VAArgOverflowSizeTLS);		VAArgSize = IRB.CreateLoad(MS.VAArgOverflowSizeTLS);
Value *CopySize = IRB.CreateAdd(ConstantInt::get(MS.IntptrTy, 0),		Value *CopySize = IRB.CreateAdd(ConstantInt::get(MS.IntptrTy, 0),
VAArgSize);		VAArgSize);

if (!VAStartInstrumentationList.empty()) {		if (!VAStartInstrumentationList.empty()) {
		const DataLayout &DL = F.getParent()->getDataLayout();

// If there is a va_start in this function, make a backup copy of		// If there is a va_start in this function, make a backup copy of
// va_arg_tls somewhere in the function entry block.		// va_arg_tls somewhere in the function entry block.
VAArgTLSCopy = IRB.CreateAlloca(Type::getInt8Ty(*MS.C), CopySize);		VAArgTLSCopy = IRB.CreateAlloca(DL, Type::getInt8Ty(*MS.C), CopySize);
IRB.CreateMemCpy(VAArgTLSCopy, MS.VAArgTLS, CopySize, 8);		IRB.CreateMemCpy(VAArgTLSCopy, MS.VAArgTLS, CopySize, 8);
}		}

// Instrument va_start.		// Instrument va_start.
// Copy va_list shadow from the backup copy of the TLS contents.		// Copy va_list shadow from the backup copy of the TLS contents.
for (size_t i = 0, n = VAStartInstrumentationList.size(); i < n; i++) {		for (size_t i = 0, n = VAStartInstrumentationList.size(); i < n; i++) {
CallInst *OrigInst = VAStartInstrumentationList[i];		CallInst *OrigInst = VAStartInstrumentationList[i];
IRBuilder<> IRB(OrigInst->getNextNode());		IRBuilder<> IRB(OrigInst->getNextNode());
▲ Show 20 Lines • Show All 154 Lines • ▼ Show 20 Lines	Value* getVAField32(IRBuilder<> &IRB, Value *VAListTag, int offset) {
Value *SaveArea32 = IRB.CreateLoad(SaveAreaPtr);		Value *SaveArea32 = IRB.CreateLoad(SaveAreaPtr);
return IRB.CreateSExt(SaveArea32, MS.IntptrTy);		return IRB.CreateSExt(SaveArea32, MS.IntptrTy);
}		}

void finalizeInstrumentation() override {		void finalizeInstrumentation() override {
assert(!VAArgOverflowSize && !VAArgTLSCopy &&		assert(!VAArgOverflowSize && !VAArgTLSCopy &&
"finalizeInstrumentation called twice");		"finalizeInstrumentation called twice");
if (!VAStartInstrumentationList.empty()) {		if (!VAStartInstrumentationList.empty()) {
		const DataLayout &DL = F.getParent()->getDataLayout();
// If there is a va_start in this function, make a backup copy of		// If there is a va_start in this function, make a backup copy of
// va_arg_tls somewhere in the function entry block.		// va_arg_tls somewhere in the function entry block.
IRBuilder<> IRB(F.getEntryBlock().getFirstNonPHI());		IRBuilder<> IRB(F.getEntryBlock().getFirstNonPHI());
VAArgOverflowSize = IRB.CreateLoad(MS.VAArgOverflowSizeTLS);		VAArgOverflowSize = IRB.CreateLoad(MS.VAArgOverflowSizeTLS);
Value *CopySize =		Value *CopySize =
IRB.CreateAdd(ConstantInt::get(MS.IntptrTy, AArch64VAEndOffset),		IRB.CreateAdd(ConstantInt::get(MS.IntptrTy, AArch64VAEndOffset),
VAArgOverflowSize);		VAArgOverflowSize);
VAArgTLSCopy = IRB.CreateAlloca(Type::getInt8Ty(*MS.C), CopySize);		VAArgTLSCopy = IRB.CreateAlloca(DL, Type::getInt8Ty(*MS.C), CopySize);
IRB.CreateMemCpy(VAArgTLSCopy, MS.VAArgTLS, CopySize, 8);		IRB.CreateMemCpy(VAArgTLSCopy, MS.VAArgTLS, CopySize, 8);
}		}

Value *GrArgSize = ConstantInt::get(MS.IntptrTy, kAArch64GrArgSize);		Value *GrArgSize = ConstantInt::get(MS.IntptrTy, kAArch64GrArgSize);
Value *VrArgSize = ConstantInt::get(MS.IntptrTy, kAArch64VrArgSize);		Value *VrArgSize = ConstantInt::get(MS.IntptrTy, kAArch64VrArgSize);

// Instrument va_start, copy va_list shadow from the backup copy of		// Instrument va_start, copy va_list shadow from the backup copy of
// the TLS contents.		// the TLS contents.
▲ Show 20 Lines • Show All 206 Lines • ▼ Show 20 Lines	void finalizeInstrumentation() override {
assert(!VAArgSize && !VAArgTLSCopy &&		assert(!VAArgSize && !VAArgTLSCopy &&
"finalizeInstrumentation called twice");		"finalizeInstrumentation called twice");
IRBuilder<> IRB(F.getEntryBlock().getFirstNonPHI());		IRBuilder<> IRB(F.getEntryBlock().getFirstNonPHI());
VAArgSize = IRB.CreateLoad(MS.VAArgOverflowSizeTLS);		VAArgSize = IRB.CreateLoad(MS.VAArgOverflowSizeTLS);
Value *CopySize = IRB.CreateAdd(ConstantInt::get(MS.IntptrTy, 0),		Value *CopySize = IRB.CreateAdd(ConstantInt::get(MS.IntptrTy, 0),
VAArgSize);		VAArgSize);

if (!VAStartInstrumentationList.empty()) {		if (!VAStartInstrumentationList.empty()) {
		const DataLayout &DL = F.getParent()->getDataLayout();

// If there is a va_start in this function, make a backup copy of		// If there is a va_start in this function, make a backup copy of
// va_arg_tls somewhere in the function entry block.		// va_arg_tls somewhere in the function entry block.
VAArgTLSCopy = IRB.CreateAlloca(Type::getInt8Ty(*MS.C), CopySize);		VAArgTLSCopy = IRB.CreateAlloca(DL, Type::getInt8Ty(*MS.C), CopySize);
IRB.CreateMemCpy(VAArgTLSCopy, MS.VAArgTLS, CopySize, 8);		IRB.CreateMemCpy(VAArgTLSCopy, MS.VAArgTLS, CopySize, 8);
}		}

// Instrument va_start.		// Instrument va_start.
// Copy va_list shadow from the backup copy of the TLS contents.		// Copy va_list shadow from the backup copy of the TLS contents.
for (size_t i = 0, n = VAStartInstrumentationList.size(); i < n; i++) {		for (size_t i = 0, n = VAStartInstrumentationList.size(); i < n; i++) {
CallInst *OrigInst = VAStartInstrumentationList[i];		CallInst *OrigInst = VAStartInstrumentationList[i];
IRBuilder<> IRB(OrigInst->getNextNode());		IRBuilder<> IRB(OrigInst->getNextNode());
▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

lib/Transforms/Scalar/RewriteStatepointsForGC.cpp

Show First 20 Lines • Show All 1,609 Lines • ▼ Show 20 Lines	#endif
DenseMap<Value , Value > AllocaMap;		DenseMap<Value , Value > AllocaMap;
SmallVector<AllocaInst *, 200> PromotableAllocas;		SmallVector<AllocaInst *, 200> PromotableAllocas;
// Used later to chack that we have enough allocas to store all values		// Used later to chack that we have enough allocas to store all values
std::size_t NumRematerializedValues = 0;		std::size_t NumRematerializedValues = 0;
PromotableAllocas.reserve(Live.size());		PromotableAllocas.reserve(Live.size());

// Emit alloca for "LiveValue" and record it in "allocaMap" and		// Emit alloca for "LiveValue" and record it in "allocaMap" and
// "PromotableAllocas"		// "PromotableAllocas"
		const DataLayout &DL = F.getParent()->getDataLayout();
auto emitAllocaFor = [&](Value *LiveValue) {		auto emitAllocaFor = [&](Value *LiveValue) {
AllocaInst *Alloca = new AllocaInst(LiveValue->getType(), "",		AllocaInst *Alloca = new AllocaInst(LiveValue->getType(),
		DL.getStackAddrSpace(), "",
F.getEntryBlock().getFirstNonPHI());		F.getEntryBlock().getFirstNonPHI());
AllocaMap[LiveValue] = Alloca;		AllocaMap[LiveValue] = Alloca;
PromotableAllocas.push_back(Alloca);		PromotableAllocas.push_back(Alloca);
};		};

// Emit alloca for each live gc pointer		// Emit alloca for each live gc pointer
for (Value *V : Live)		for (Value *V : Live)
emitAllocaFor(V);		emitAllocaFor(V);
▲ Show 20 Lines • Show All 1,074 Lines • Show Last 20 Lines

lib/Transforms/Scalar/SROA.cpp

Show First 20 Lines • Show All 2,289 Lines • ▼ Show 20 Lines	if (LastSROAPrefix != StringRef::npos) {
OldName = OldName.substr(OffsetEnd + 1);		OldName = OldName.substr(OffsetEnd + 1);
}		}
}		}
// Strip any SROA suffixes as well.		// Strip any SROA suffixes as well.
OldName = OldName.substr(0, OldName.find(".sroa_"));		OldName = OldName.substr(0, OldName.find(".sroa_"));
#endif		#endif

return getAdjustedPtr(IRB, DL, &NewAI,		return getAdjustedPtr(IRB, DL, &NewAI,
APInt(DL.getPointerSizeInBits(), Offset), PointerTy,		APInt(DL.getPointerTypeSizeInBits(PointerTy), Offset),
		PointerTy,
#ifndef NDEBUG		#ifndef NDEBUG
Twine(OldName) + "."		Twine(OldName) + "."
#else		#else
Twine()		Twine()
#endif		#endif
);		);
}		}

▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	Value *rewriteIntegerLoad(LoadInst &LI) {
return V;		return V;
}		}

bool visitLoadInst(LoadInst &LI) {		bool visitLoadInst(LoadInst &LI) {
DEBUG(dbgs() << " original: " << LI << "\n");		DEBUG(dbgs() << " original: " << LI << "\n");
Value *OldOp = LI.getOperand(0);		Value *OldOp = LI.getOperand(0);
assert(OldOp == OldPtr);		assert(OldOp == OldPtr);

		unsigned AS = LI.getPointerAddressSpace();

Type TargetTy = IsSplit ? Type::getIntNTy(LI.getContext(), SliceSize 8)		Type TargetTy = IsSplit ? Type::getIntNTy(LI.getContext(), SliceSize 8)
: LI.getType();		: LI.getType();
const bool IsLoadPastEnd = DL.getTypeStoreSize(TargetTy) > SliceSize;		const bool IsLoadPastEnd = DL.getTypeStoreSize(TargetTy) > SliceSize;
bool IsPtrAdjusted = false;		bool IsPtrAdjusted = false;
Value *V;		Value *V;
if (VecTy) {		if (VecTy) {
V = rewriteVectorizedLoadInst();		V = rewriteVectorizedLoadInst();
} else if (IntTy && LI.getType()->isIntegerTy()) {		} else if (IntTy && LI.getType()->isIntegerTy()) {
Show All 16 Lines	if (VecTy) {
if (auto *TITy = dyn_cast<IntegerType>(TargetTy))		if (auto *TITy = dyn_cast<IntegerType>(TargetTy))
if (AITy->getBitWidth() < TITy->getBitWidth()) {		if (AITy->getBitWidth() < TITy->getBitWidth()) {
V = IRB.CreateZExt(V, TITy, "load.ext");		V = IRB.CreateZExt(V, TITy, "load.ext");
if (DL.isBigEndian())		if (DL.isBigEndian())
V = IRB.CreateShl(V, TITy->getBitWidth() - AITy->getBitWidth(),		V = IRB.CreateShl(V, TITy->getBitWidth() - AITy->getBitWidth(),
"endian_shift");		"endian_shift");
}		}
} else {		} else {
Type *LTy = TargetTy->getPointerTo();		Type *LTy = TargetTy->getPointerTo(AS);
LoadInst *NewLI = IRB.CreateAlignedLoad(getNewAllocaSlicePtr(IRB, LTy),		LoadInst *NewLI = IRB.CreateAlignedLoad(getNewAllocaSlicePtr(IRB, LTy),
getSliceAlign(TargetTy),		getSliceAlign(TargetTy),
LI.isVolatile(), LI.getName());		LI.isVolatile(), LI.getName());
if (LI.isVolatile())		if (LI.isVolatile())
NewLI->setAtomic(LI.getOrdering(), LI.getSynchScope());		NewLI->setAtomic(LI.getOrdering(), LI.getSynchScope());

V = NewLI;		V = NewLI;
IsPtrAdjusted = true;		IsPtrAdjusted = true;
Show All 11 Lines	if (IsSplit) {
"Non-byte-multiple bit width");		"Non-byte-multiple bit width");
// Move the insertion point just past the load so that we can refer to it.		// Move the insertion point just past the load so that we can refer to it.
IRB.SetInsertPoint(&*std::next(BasicBlock::iterator(&LI)));		IRB.SetInsertPoint(&*std::next(BasicBlock::iterator(&LI)));
// Create a placeholder value with the same type as LI to use as the		// Create a placeholder value with the same type as LI to use as the
// basis for the new value. This allows us to replace the uses of LI with		// basis for the new value. This allows us to replace the uses of LI with
// the computed value, and then replace the placeholder with LI, leaving		// the computed value, and then replace the placeholder with LI, leaving
// LI only used for this computation.		// LI only used for this computation.
Value *Placeholder =		Value *Placeholder =
new LoadInst(UndefValue::get(LI.getType()->getPointerTo()));		new LoadInst(UndefValue::get(LI.getType()->getPointerTo(AS)));
V = insertInteger(DL, IRB, Placeholder, V, NewBeginOffset - BeginOffset,		V = insertInteger(DL, IRB, Placeholder, V, NewBeginOffset - BeginOffset,
"insert");		"insert");
LI.replaceAllUsesWith(V);		LI.replaceAllUsesWith(V);
Placeholder->replaceAllUsesWith(&LI);		Placeholder->replaceAllUsesWith(&LI);
delete Placeholder;		delete Placeholder;
} else {		} else {
LI.replaceAllUsesWith(V);		LI.replaceAllUsesWith(V);
}		}
▲ Show 20 Lines • Show All 96 Lines • ▼ Show 20 Lines	if (NewBeginOffset == NewAllocaBeginOffset &&
"endian_shift");		"endian_shift");
V = IRB.CreateTrunc(V, AITy, "load.trunc");		V = IRB.CreateTrunc(V, AITy, "load.trunc");
}		}

V = convertValue(DL, IRB, V, NewAllocaTy);		V = convertValue(DL, IRB, V, NewAllocaTy);
NewSI = IRB.CreateAlignedStore(V, &NewAI, NewAI.getAlignment(),		NewSI = IRB.CreateAlignedStore(V, &NewAI, NewAI.getAlignment(),
SI.isVolatile());		SI.isVolatile());
} else {		} else {
Value *NewPtr = getNewAllocaSlicePtr(IRB, V->getType()->getPointerTo());		unsigned AS = SI.getPointerAddressSpace();
		Value *NewPtr = getNewAllocaSlicePtr(IRB, V->getType()->getPointerTo(AS));
NewSI = IRB.CreateAlignedStore(V, NewPtr, getSliceAlign(V->getType()),		NewSI = IRB.CreateAlignedStore(V, NewPtr, getSliceAlign(V->getType()),
SI.isVolatile());		SI.isVolatile());
}		}
NewSI->copyMetadata(SI, LLVMContext::MD_mem_parallel_loop_access);		NewSI->copyMetadata(SI, LLVMContext::MD_mem_parallel_loop_access);
if (SI.isVolatile())		if (SI.isVolatile())
NewSI->setAtomic(SI.getOrdering(), SI.getSynchScope());		NewSI->setAtomic(SI.getOrdering(), SI.getSynchScope());
Pass.DeadInsts.insert(&SI);		Pass.DeadInsts.insert(&SI);
deleteIfTriviallyDead(OldOp);		deleteIfTriviallyDead(OldOp);
▲ Show 20 Lines • Show All 1,298 Lines • ▼ Show 20 Lines	if (!Alignment) {
Alignment = DL.getABITypeAlignment(AI.getAllocatedType());		Alignment = DL.getABITypeAlignment(AI.getAllocatedType());
}		}
Alignment = MinAlign(Alignment, P.beginOffset());		Alignment = MinAlign(Alignment, P.beginOffset());
// If we will get at least this much alignment from the type alone, leave		// If we will get at least this much alignment from the type alone, leave
// the alloca's alignment unconstrained.		// the alloca's alignment unconstrained.
if (Alignment <= DL.getABITypeAlignment(SliceTy))		if (Alignment <= DL.getABITypeAlignment(SliceTy))
Alignment = 0;		Alignment = 0;
NewAI = new AllocaInst(		NewAI = new AllocaInst(
SliceTy, nullptr, Alignment,		SliceTy, AI.getType()->getAddressSpace(), nullptr, Alignment,
AI.getName() + ".sroa." + Twine(P.begin() - AS.begin()), &AI);		AI.getName() + ".sroa." + Twine(P.begin() - AS.begin()), &AI);
++NumNewAllocas;		++NumNewAllocas;
}		}

DEBUG(dbgs() << "Rewriting alloca partition "		DEBUG(dbgs() << "Rewriting alloca partition "
<< "[" << P.beginOffset() << "," << P.endOffset()		<< "[" << P.beginOffset() << "," << P.endOffset()
<< ") to: " << *NewAI << "\n");		<< ") to: " << *NewAI << "\n");

▲ Show 20 Lines • Show All 421 Lines • Show Last 20 Lines

lib/Transforms/Utils/CodeExtractor.cpp

	Show First 20 Lines • Show All 434 Lines • ▼ Show 20 Lines
	/// the call instruction, splitting any PHI nodes in the header block as			/// the call instruction, splitting any PHI nodes in the header block as
	/// necessary.			/// necessary.
	void CodeExtractor::			void CodeExtractor::
	emitCallAndSwitchStatement(Function newFunction, BasicBlock codeReplacer,			emitCallAndSwitchStatement(Function newFunction, BasicBlock codeReplacer,
	ValueSet &inputs, ValueSet &outputs) {			ValueSet &inputs, ValueSet &outputs) {
	// Emit a call to the new function, passing in: *pointer to struct (if			// Emit a call to the new function, passing in: *pointer to struct (if
	// aggregating parameters), or plan inputs and allocated memory for outputs			// aggregating parameters), or plan inputs and allocated memory for outputs
	std::vector<Value*> params, StructValues, ReloadOutputs, Reloads;			std::vector<Value*> params, StructValues, ReloadOutputs, Reloads;

	LLVMContext &Context = newFunction->getContext();			Module *M = newFunction->getParent();
				LLVMContext &Context = M->getContext();
				const DataLayout &DL = M->getDataLayout();

	// Add inputs as params, or to be filled into the struct			// Add inputs as params, or to be filled into the struct
	for (Value *input : inputs)			for (Value *input : inputs)
	if (AggregateArgs)			if (AggregateArgs)
	StructValues.push_back(input);			StructValues.push_back(input);
	else			else
	params.push_back(input);			params.push_back(input);

	// Create allocas for the outputs			// Create allocas for the outputs
	for (Value *output : outputs) {			for (Value *output : outputs) {
	if (AggregateArgs) {			if (AggregateArgs) {
	StructValues.push_back(output);			StructValues.push_back(output);
	} else {			} else {
	AllocaInst *alloca =			AllocaInst *alloca =
	new AllocaInst(output->getType(), nullptr, output->getName() + ".loc",			new AllocaInst(output->getType(), DL.getStackAddrSpace(),
				nullptr, output->getName() + ".loc",
	&codeReplacer->getParent()->front().front());			&codeReplacer->getParent()->front().front());
	ReloadOutputs.push_back(alloca);			ReloadOutputs.push_back(alloca);
	params.push_back(alloca);			params.push_back(alloca);
	}			}
	}			}

	StructType *StructArgTy = nullptr;			StructType *StructArgTy = nullptr;
	AllocaInst *Struct = nullptr;			AllocaInst *Struct = nullptr;
	if (AggregateArgs && (inputs.size() + outputs.size() > 0)) {			if (AggregateArgs && (inputs.size() + outputs.size() > 0)) {
	std::vector<Type*> ArgTypes;			std::vector<Type*> ArgTypes;
	for (ValueSet::iterator v = StructValues.begin(),			for (ValueSet::iterator v = StructValues.begin(),
	ve = StructValues.end(); v != ve; ++v)			ve = StructValues.end(); v != ve; ++v)
	ArgTypes.push_back((*v)->getType());			ArgTypes.push_back((*v)->getType());

	// Allocate a struct at the beginning of this function			// Allocate a struct at the beginning of this function
	StructArgTy = StructType::get(newFunction->getContext(), ArgTypes);			StructArgTy = StructType::get(newFunction->getContext(), ArgTypes);
	Struct = new AllocaInst(StructArgTy, nullptr, "structArg",			Struct = new AllocaInst(StructArgTy, DL.getStackAddrSpace(), nullptr,
				"structArg",
	&codeReplacer->getParent()->front().front());			&codeReplacer->getParent()->front().front());
	params.push_back(Struct);			params.push_back(Struct);

	for (unsigned i = 0, e = inputs.size(); i != e; ++i) {			for (unsigned i = 0, e = inputs.size(); i != e; ++i) {
	Value *Idx[2];			Value *Idx[2];
	Idx[0] = Constant::getNullValue(Type::getInt32Ty(Context));			Idx[0] = Constant::getNullValue(Type::getInt32Ty(Context));
	Idx[1] = ConstantInt::get(Type::getInt32Ty(Context), i);			Idx[1] = ConstantInt::get(Type::getInt32Ty(Context), i);
	GetElementPtrInst *GEP = GetElementPtrInst::Create(			GetElementPtrInst *GEP = GetElementPtrInst::Create(
	▲ Show 20 Lines • Show All 391 Lines • Show Last 20 Lines

lib/Transforms/Utils/DemoteRegToStack.cpp

	Show All 22 Lines
	/// the alloca inserted to create a stack slot for I.			/// the alloca inserted to create a stack slot for I.
	AllocaInst *llvm::DemoteRegToStack(Instruction &I, bool VolatileLoads,			AllocaInst *llvm::DemoteRegToStack(Instruction &I, bool VolatileLoads,
	Instruction *AllocaPoint) {			Instruction *AllocaPoint) {
	if (I.use_empty()) {			if (I.use_empty()) {
	I.eraseFromParent();			I.eraseFromParent();
	return nullptr;			return nullptr;
	}			}

				Function *F = I.getParent()->getParent();
				const DataLayout &DL = F->getParent()->getDataLayout();

	// Create a stack slot to hold the value.			// Create a stack slot to hold the value.
	AllocaInst *Slot;			AllocaInst *Slot;
	if (AllocaPoint) {			if (AllocaPoint) {
	Slot = new AllocaInst(I.getType(), nullptr,			Slot = new AllocaInst(I.getType(), DL.getStackAddrSpace(), nullptr,
	I.getName()+".reg2mem", AllocaPoint);			I.getName()+".reg2mem", AllocaPoint);
	} else {			} else {
	Function *F = I.getParent()->getParent();			Slot = new AllocaInst(I.getType(), DL.getStackAddrSpace(), nullptr,
	Slot = new AllocaInst(I.getType(), nullptr, I.getName() + ".reg2mem",			I.getName() + ".reg2mem", &F->getEntryBlock().front());
	&F->getEntryBlock().front());
	}			}

	// We cannot demote invoke instructions to the stack if their normal edge			// We cannot demote invoke instructions to the stack if their normal edge
	// is critical. Therefore, split the critical edge and create a basic block			// is critical. Therefore, split the critical edge and create a basic block
	// into which the store can be inserted.			// into which the store can be inserted.
	if (InvokeInst *II = dyn_cast<InvokeInst>(&I)) {			if (InvokeInst *II = dyn_cast<InvokeInst>(&I)) {
	if (!II->getNormalDest()->getSinglePredecessor()) {			if (!II->getNormalDest()->getSinglePredecessor()) {
	unsigned SuccNum = GetSuccessorNumber(II->getParent(), II->getNormalDest());			unsigned SuccNum = GetSuccessorNumber(II->getParent(), II->getNormalDest());
	▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines
	/// node and replaces it with a slot in the stack frame allocated via alloca.			/// node and replaces it with a slot in the stack frame allocated via alloca.
	/// The PHI node is deleted. It returns the pointer to the alloca inserted.			/// The PHI node is deleted. It returns the pointer to the alloca inserted.
	AllocaInst llvm::DemotePHIToStack(PHINode P, Instruction *AllocaPoint) {			AllocaInst llvm::DemotePHIToStack(PHINode P, Instruction *AllocaPoint) {
	if (P->use_empty()) {			if (P->use_empty()) {
	P->eraseFromParent();			P->eraseFromParent();
	return nullptr;			return nullptr;
	}			}

				const DataLayout &DL = P->getModule()->getDataLayout();

	// Create a stack slot to hold the value.			// Create a stack slot to hold the value.
	AllocaInst *Slot;			AllocaInst *Slot;
	if (AllocaPoint) {			if (AllocaPoint) {
	Slot = new AllocaInst(P->getType(), nullptr,			Slot = new AllocaInst(P->getType(), DL.getStackAddrSpace(), nullptr,
	P->getName()+".reg2mem", AllocaPoint);			P->getName()+".reg2mem", AllocaPoint);
	} else {			} else {
	Function *F = P->getParent()->getParent();			Function *F = P->getParent()->getParent();
	Slot = new AllocaInst(P->getType(), nullptr, P->getName() + ".reg2mem",			Slot = new AllocaInst(P->getType(), DL.getStackAddrSpace(), nullptr,
				P->getName() + ".reg2mem",
	&F->getEntryBlock().front());			&F->getEntryBlock().front());
	}			}

	// Iterate over each operand inserting a store in each predecessor.			// Iterate over each operand inserting a store in each predecessor.
	for (unsigned i = 0, e = P->getNumIncomingValues(); i < e; ++i) {			for (unsigned i = 0, e = P->getNumIncomingValues(); i < e; ++i) {
	if (InvokeInst *II = dyn_cast<InvokeInst>(P->getIncomingValue(i))) {			if (InvokeInst *II = dyn_cast<InvokeInst>(P->getIncomingValue(i))) {
	assert(II->getParent() != P->getIncomingBlock(i) &&			assert(II->getParent() != P->getIncomingBlock(i) &&
	"Invoke edge not supported yet"); (void)II;			"Invoke edge not supported yet"); (void)II;
	Show All 18 Lines

lib/Transforms/Utils/InlineFunction.cpp

	Show First 20 Lines • Show All 1,218 Lines • ▼ Show 20 Lines
	static Value HandleByValArgument(Value Arg, Instruction *TheCall,			static Value HandleByValArgument(Value Arg, Instruction *TheCall,
	const Function *CalledFunc,			const Function *CalledFunc,
	InlineFunctionInfo &IFI,			InlineFunctionInfo &IFI,
	unsigned ByValAlignment) {			unsigned ByValAlignment) {
	PointerType *ArgTy = cast<PointerType>(Arg->getType());			PointerType *ArgTy = cast<PointerType>(Arg->getType());
	Type *AggTy = ArgTy->getElementType();			Type *AggTy = ArgTy->getElementType();

	Function *Caller = TheCall->getFunction();			Function *Caller = TheCall->getFunction();
				const DataLayout &DL = Caller->getParent()->getDataLayout();

	// If the called function is readonly, then it could not mutate the caller's			// If the called function is readonly, then it could not mutate the caller's
	// copy of the byval'd memory. In this case, it is safe to elide the copy and			// copy of the byval'd memory. In this case, it is safe to elide the copy and
	// temporary.			// temporary.
	if (CalledFunc->onlyReadsMemory()) {			if (CalledFunc->onlyReadsMemory()) {
	// If the byval argument has a specified alignment that is greater than the			// If the byval argument has a specified alignment that is greater than the
	// passed in pointer, then we either have to round up the input pointer or			// passed in pointer, then we either have to round up the input pointer or
	// give up on this transformation.			// give up on this transformation.
	if (ByValAlignment <= 1) // 0 = unspecified, 1 = no particular alignment.			if (ByValAlignment <= 1) // 0 = unspecified, 1 = no particular alignment.
	return Arg;			return Arg;

	AssumptionCache *AC =			AssumptionCache *AC =
	IFI.GetAssumptionCache ? &(IFI.GetAssumptionCache)(Caller) : nullptr;			IFI.GetAssumptionCache ? &(IFI.GetAssumptionCache)(Caller) : nullptr;
	const DataLayout &DL = Caller->getParent()->getDataLayout();

	// If the pointer is already known to be sufficiently aligned, or if we can			// If the pointer is already known to be sufficiently aligned, or if we can
	// round it up to a larger alignment, then we don't need a temporary.			// round it up to a larger alignment, then we don't need a temporary.
	if (getOrEnforceKnownAlignment(Arg, ByValAlignment, DL, TheCall, AC) >=			if (getOrEnforceKnownAlignment(Arg, ByValAlignment, DL, TheCall, AC) >=
	ByValAlignment)			ByValAlignment)
	return Arg;			return Arg;

	// Otherwise, we have to make a memcpy to get a safe alignment. This is bad			// Otherwise, we have to make a memcpy to get a safe alignment. This is bad
	// for code quality, but rarely happens and is required for correctness.			// for code quality, but rarely happens and is required for correctness.
	}			}

	// Create the alloca. If we have DataLayout, use nice alignment.			// Create the alloca. If we have DataLayout, use nice alignment.
	unsigned Align =			unsigned Align = DL.getPrefTypeAlignment(AggTy);
	Caller->getParent()->getDataLayout().getPrefTypeAlignment(AggTy);

	// If the byval had an alignment specified, we must use at least that			// If the byval had an alignment specified, we must use at least that
	// alignment, as it is required by the byval argument (and uses of the			// alignment, as it is required by the byval argument (and uses of the
	// pointer inside the callee).			// pointer inside the callee).
	Align = std::max(Align, ByValAlignment);			Align = std::max(Align, ByValAlignment);

	Value *NewAlloca = new AllocaInst(AggTy, nullptr, Align, Arg->getName(),			Value *NewAlloca = new AllocaInst(AggTy, DL.getStackAddrSpace(),
				nullptr, Align, Arg->getName(),
	&*Caller->begin()->begin());			&*Caller->begin()->begin());
	IFI.StaticAllocas.push_back(cast<AllocaInst>(NewAlloca));			IFI.StaticAllocas.push_back(cast<AllocaInst>(NewAlloca));

	// Uses of the argument in the function should use our new alloca			// Uses of the argument in the function should use our new alloca
	// instead.			// instead.
	return NewAlloca;			return NewAlloca;
	}			}

	// Check whether this Value is used by a lifetime intrinsic.			// Check whether this Value is used by a lifetime intrinsic.
	static bool isUsedByLifetimeMarker(Value *V) {			static bool isUsedByLifetimeMarker(Value *V) {
	for (User *U : V->users()) {			for (User *U : V->users()) {
	▲ Show 20 Lines • Show All 1,012 Lines • Show Last 20 Lines

test/Assembler/alloca-addrspace-parse-error-0.ll

This file was added.

				; RUN: not llvm-as < %s 2>&1 \| FileCheck %s

				target datalayout = "A1"

				; CHECK: :8:3: error: expected metadata after comma
				define void @use_alloca() {
				%alloca = alloca i32, addrspace(1),
				ret void
				}

				!0 = !{}

test/Assembler/alloca-addrspace-parse-error-1.ll

This file was added.

				; RUN: not llvm-as < %s 2>&1 \| FileCheck %s

				target datalayout = "A1"

				; addrspace and align in wrong order
				; CHECK: :8:39: error: expected metadata after comma
				define void @use_alloca() {
				%alloca = alloca i32, addrspace(1), align 4
				ret void
				}

				!0 = !{}

test/Assembler/alloca-addrspace0.ll

This file was added.

				; RUN: llvm-as < %s \| llvm-dis \| FileCheck %s

				target datalayout = "A0"
				; CHECK: target datalayout = "A0"


				; CHECK: %alloca_scalar_no_align = alloca i32
				; CHECK-NEXT: %alloca_scalar_align4 = alloca i32, align 4
				; CHECK-NEXT: %alloca_scalar_no_align_metadata = alloca i32, !foo !0
				; CHECK-NEXT: %alloca_scalar_align4_metadata = alloca i32, align 4, !foo !0
				; CHECK-NEXT: %alloca_inalloca_scalar_no_align = alloca inalloca i32
				; CHECK-NEXT: %alloca_inalloca_scalar_align4_metadata = alloca inalloca i32, align 4, !foo !0
				define void @use_alloca() {
				%alloca_scalar_no_align = alloca i32, addrspace(0)
				%alloca_scalar_align4 = alloca i32, align 4, addrspace(0)
				%alloca_scalar_no_align_metadata = alloca i32, addrspace(0), !foo !0
				%alloca_scalar_align4_metadata = alloca i32, align 4, addrspace(0), !foo !0
				%alloca_inalloca_scalar_no_align = alloca inalloca i32, addrspace(0)
				%alloca_inalloca_scalar_align4_metadata = alloca inalloca i32, align 4, addrspace(0), !foo !0

				ret void
				}

				!0 = !{}

test/Assembler/datalayout-alloca-addrspace-mismatch-0.ll

This file was added.

				; RUN: not llvm-as < %s 2>&1 \| FileCheck %s

				target datalayout = "A1"

				; CHECK: :7:41: error: address space must match datalayout
				define void @use_alloca() {
				%alloca_scalar_no_align = alloca i32, addrspace(2)
				ret void
				}

test/Assembler/datalayout-alloca-addrspace-mismatch-1.ll

This file was added.

				; RUN: not llvm-as < %s 2>&1 \| FileCheck %s

				target datalayout = "A1"

				; CHECK: :7:50: error: address space must match datalayout
				define void @use_alloca() {
				%alloca_scalar_no_align = alloca i32, align 4, addrspace(2)
				ret void
				}

test/Assembler/datalayout-alloca-addrspace-mismatch-2.ll

This file was added.

				; RUN: not llvm-as < %s 2>&1 \| FileCheck %s

				target datalayout = "A1"

				; CHECK: :7:50: error: address space must match datalayout
				define void @use_alloca() {
				%alloca_scalar_no_align = alloca i32, align 4, addrspace(2), !foo !0
				ret void
				}

				!0 = !{}

test/Assembler/datalayout-alloca-addrspace.ll

This file was added.

				; RUN: llvm-as < %s \| llvm-dis \| FileCheck %s

				target datalayout = "A1"
				; CHECK: target datalayout = "A1"

				; CHECK: %alloca_scalar_no_align = alloca i32, addrspace(1)
				; CHECK-NEXT: %alloca_scalar_align4 = alloca i32, align 4, addrspace(1)
				; CHECK-NEXT: %alloca_scalar_no_align_metadata = alloca i32, addrspace(1), !foo !0
				; CHECK-NEXT: %alloca_scalar_align4_metadata = alloca i32, align 4, addrspace(1), !foo !0
				; CHECK-NEXT: %alloca_inalloca_scalar_no_align = alloca inalloca i32, addrspace(1)
				; CHECK-NEXT: %alloca_inalloca_scalar_align4_metadata = alloca inalloca i32, align 4, addrspace(1), !foo !0
				define void @use_alloca() {
				%alloca_scalar_no_align = alloca i32, addrspace(1)
				%alloca_scalar_align4 = alloca i32, align 4, addrspace(1)
				%alloca_scalar_no_align_metadata = alloca i32, addrspace(1), !foo !0
				%alloca_scalar_align4_metadata = alloca i32, align 4, addrspace(1), !foo !0
				%alloca_inalloca_scalar_no_align = alloca inalloca i32, addrspace(1)
				%alloca_inalloca_scalar_align4_metadata = alloca inalloca i32, align 4, addrspace(1), !foo !0

				ret void
				}

				!0 = !{}

test/Assembler/invalid-datalayout-alloca-addrspace.ll

This file was added.

				; RUN: not llvm-as < %s 2>&1 \| FileCheck %s

				target datalayout = "A16777216"
				; CHECK: Invalid address space, must be a 24bit integer

test/Transforms/SROA/alloca-address-space.ll

This file was added.

				; RUN: opt < %s -sroa -S \| FileCheck %s
				target datalayout = "e-p:64:64:64-p1:16:16:16-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-n8:16:32:64-A2"

				declare void @llvm.memcpy.p2i8.p2i8.i32(i8 addrspace(2)* nocapture, i8 addrspace(2)* nocapture readonly, i32, i32, i1)
				declare void @llvm.memcpy.p1i8.p2i8.i32(i8 addrspace(1)* nocapture, i8 addrspace(2)* nocapture readonly, i32, i32, i1)
				declare void @llvm.memcpy.p2i8.p1i8.i32(i8 addrspace(2)* nocapture, i8 addrspace(1)* nocapture readonly, i32, i32, i1)
				declare void @llvm.memcpy.p1i8.p1i8.i32(i8 addrspace(1)* nocapture, i8 addrspace(1)* nocapture readonly, i32, i32, i1)



				; CHECK-LABEL: @test_address_space_1_1(
				; CHECK: load <2 x i64>, <2 x i64> addrspace(1)* %a, align 2
				; CHECK: store <2 x i64> {{.}}, <2 x i64> addrspace(1) {{.*}}, align 2
				; CHECK: ret void
				define void @test_address_space_1_1(<2 x i64> addrspace(1)* %a, i16 addrspace(1)* %b) {
				%aa = alloca <2 x i64>, align 16, addrspace(2)
				%aptr = bitcast <2 x i64> addrspace(1)* %a to i8 addrspace(1)*
				%aaptr = bitcast <2 x i64> addrspace(2)* %aa to i8 addrspace(2)*
				call void @llvm.memcpy.p2i8.p1i8.i32(i8 addrspace(2)* %aaptr, i8 addrspace(1)* %aptr, i32 16, i32 2, i1 false)
				%bptr = bitcast i16 addrspace(1)* %b to i8 addrspace(1)*
				call void @llvm.memcpy.p1i8.p2i8.i32(i8 addrspace(1)* %bptr, i8 addrspace(2)* %aaptr, i32 16, i32 2, i1 false)
				ret void
				}

				; CHECK-LABEL: @test_address_space_1_0(
				; CHECK: load <2 x i64>, <2 x i64> addrspace(1)* %a, align 2
				; CHECK: store <2 x i64> {{.}}, <2 x i64> addrspace(2) {{.*}}, align 2
				; CHECK: ret void
				define void @test_address_space_1_0(<2 x i64> addrspace(1)* %a, i16 addrspace(2)* %b) {
				%aa = alloca <2 x i64>, align 16, addrspace(2)
				%aptr = bitcast <2 x i64> addrspace(1)* %a to i8 addrspace(1)*
				%aaptr = bitcast <2 x i64> addrspace(2)* %aa to i8 addrspace(2)*
				call void @llvm.memcpy.p2i8.p1i8.i32(i8 addrspace(2)* %aaptr, i8 addrspace(1)* %aptr, i32 16, i32 2, i1 false)
				%bptr = bitcast i16 addrspace(2)* %b to i8 addrspace(2)*
				call void @llvm.memcpy.p2i8.p2i8.i32(i8 addrspace(2)* %bptr, i8 addrspace(2)* %aaptr, i32 16, i32 2, i1 false)
				ret void
				}

				; CHECK-LABEL: @test_address_space_0_1(
				; CHECK: load <2 x i64>, <2 x i64> addrspace(2)* %a, align 2
				; CHECK: store <2 x i64> {{.}}, <2 x i64> addrspace(1) {{.*}}, align 2
				; CHECK: ret void
				define void @test_address_space_0_1(<2 x i64> addrspace(2)* %a, i16 addrspace(1)* %b) {
				%aa = alloca <2 x i64>, align 16, addrspace(2)
				%aptr = bitcast <2 x i64> addrspace(2)* %a to i8 addrspace(2)*
				%aaptr = bitcast <2 x i64> addrspace(2)* %aa to i8 addrspace(2)*
				call void @llvm.memcpy.p2i8.p2i8.i32(i8 addrspace(2)* %aaptr, i8 addrspace(2)* %aptr, i32 16, i32 2, i1 false)
				%bptr = bitcast i16 addrspace(1)* %b to i8 addrspace(1)*
				call void @llvm.memcpy.p1i8.p2i8.i32(i8 addrspace(1)* %bptr, i8 addrspace(2)* %aaptr, i32 16, i32 2, i1 false)
				ret void
				}

				%struct.struct_test_27.0.13 = type { i32, float, i64, i8, [4 x i32] }

				; CHECK-LABEL: @copy_struct(
				; CHECK-NOT: memcpy
				define void @copy_struct([5 x i64] %in.coerce) {
				for.end:
				%in = alloca %struct.struct_test_27.0.13, align 8, addrspace(2)
				%0 = bitcast %struct.struct_test_27.0.13 addrspace(2)* %in to [5 x i64] addrspace(2)*
				store [5 x i64] %in.coerce, [5 x i64] addrspace(2)* %0, align 8
				%scevgep9 = getelementptr %struct.struct_test_27.0.13, %struct.struct_test_27.0.13 addrspace(2)* %in, i32 0, i32 4, i32 0
				%scevgep910 = bitcast i32 addrspace(2)* %scevgep9 to i8 addrspace(2)*
				call void @llvm.memcpy.p1i8.p2i8.i32(i8 addrspace(1)* undef, i8 addrspace(2)* %scevgep910, i32 16, i32 4, i1 false)
				ret void
				}

				%union.anon = type { i32* }

				@g = common global i32 0, align 4
				@l = common addrspace(3) global i32 0, align 4

				; Make sure an illegal bitcast isn't introduced
				; CHECK-LABEL: @pr27557(
				; CHECK: %[[CAST:.]] = bitcast i32 addrspace(2)* {{.}} to i32 addrspace(3) addrspace(2)*
				; CHECK: store i32 addrspace(3)* @l, i32 addrspace(3)* addrspace(2)* %[[CAST]]
				define void @pr27557() {
				%1 = alloca %union.anon, align 8, addrspace(2)
				%2 = bitcast %union.anon addrspace(2)* %1 to i32* addrspace(2)*
				store i32* @g, i32* addrspace(2)* %2, align 8
				%3 = bitcast %union.anon addrspace(2)* %1 to i32 addrspace(3)* addrspace(2)*
				store i32 addrspace(3)* @l, i32 addrspace(3)* addrspace(2)* %3, align 8
				ret void
				}

tools/llvm-stress/llvm-stress.cpp

Show First 20 Lines • Show All 418 Lines • ▼ Show 20 Lines	struct ConstModifier: public Modifier {
}		}
};		};

struct AllocaModifier: public Modifier {		struct AllocaModifier: public Modifier {
AllocaModifier(BasicBlock BB, PieceTable PT, Random *R):Modifier(BB, PT, R){}		AllocaModifier(BasicBlock BB, PieceTable PT, Random *R):Modifier(BB, PT, R){}

void Act() override {		void Act() override {
Type *Tp = pickType();		Type *Tp = pickType();
PT->push_back(new AllocaInst(Tp, "A", BB->getFirstNonPHI()));		const DataLayout &DL = BB->getModule()->getDataLayout();
		PT->push_back(new AllocaInst(Tp, DL.getStackAddrSpace(),
		"A", BB->getFirstNonPHI()));
}		}
};		};

struct ExtractElementModifier: public Modifier {		struct ExtractElementModifier: public Modifier {
ExtractElementModifier(BasicBlock BB, PieceTable PT, Random *R):		ExtractElementModifier(BasicBlock BB, PieceTable PT, Random *R):
Modifier(BB, PT, R) {}		Modifier(BB, PT, R) {}

void Act() override {		void Act() override {
▲ Show 20 Lines • Show All 291 Lines • Show Last 20 Lines

unittests/Analysis/ScalarEvolutionTest.cpp

Show First 20 Lines • Show All 300 Lines • ▼ Show 20 Lines	TEST_F(ScalarEvolutionsTest, ExpandPtrTypeSCEV) {
// %bitcast1 = bitcast i32* %gep0 to i8*		// %bitcast1 = bitcast i32* %gep0 to i8*
// %gep1 = getelementptr i8, i8* %bitcast1, i32 1		// %gep1 = getelementptr i8, i8* %bitcast1, i32 1
// %gep2 = getelementptr i8, i8* undef, i32 1		// %gep2 = getelementptr i8, i8* undef, i32 1
// %cmp = icmp ult i8* undef, %bitcast1		// %cmp = icmp ult i8* undef, %bitcast1
// %select = select i1 %cmp, i8* %gep1, i8* %gep2		// %select = select i1 %cmp, i8* %gep1, i8* %gep2
// %bitcast2 = bitcast i8* %select to i32*		// %bitcast2 = bitcast i8* %select to i32*
// br i1 undef, label %loop, label %exit		// br i1 undef, label %loop, label %exit

		const DataLayout &DL = F->getParent()->getDataLayout();
BranchInst *Br = BranchInst::Create(		BranchInst *Br = BranchInst::Create(
LoopBB, ExitBB, UndefValue::get(Type::getInt1Ty(Context)), LoopBB);		LoopBB, ExitBB, UndefValue::get(Type::getInt1Ty(Context)), LoopBB);
AllocaInst *Alloca = new AllocaInst(I32Ty, "alloca", Br);		AllocaInst *Alloca = new AllocaInst(I32Ty, DL.getStackAddrSpace(),
		"alloca", Br);
ConstantInt *Ci32 = ConstantInt::get(Context, APInt(32, 1));		ConstantInt *Ci32 = ConstantInt::get(Context, APInt(32, 1));
GetElementPtrInst *Gep0 =		GetElementPtrInst *Gep0 =
GetElementPtrInst::Create(I32Ty, Alloca, Ci32, "gep0", Br);		GetElementPtrInst::Create(I32Ty, Alloca, Ci32, "gep0", Br);
CastInst *CastA =		CastInst *CastA =
CastInst::CreateBitOrPointerCast(Gep0, I8PtrTy, "bitcast1", Br);		CastInst::CreateBitOrPointerCast(Gep0, I8PtrTy, "bitcast1", Br);
GetElementPtrInst *Gep1 =		GetElementPtrInst *Gep1 =
GetElementPtrInst::Create(I8Ty, CastA, Ci32, "gep1", Br);		GetElementPtrInst::Create(I8Ty, CastA, Ci32, "gep1", Br);
GetElementPtrInst *Gep2 = GetElementPtrInst::Create(		GetElementPtrInst *Gep2 = GetElementPtrInst::Create(
▲ Show 20 Lines • Show All 285 Lines • Show Last 20 Lines

unittests/IR/IRBuilderTest.cpp

Show All 28 Lines	protected:
void SetUp() override {		void SetUp() override {
M.reset(new Module("MyModule", Ctx));		M.reset(new Module("MyModule", Ctx));
FunctionType *FTy = FunctionType::get(Type::getVoidTy(Ctx),		FunctionType *FTy = FunctionType::get(Type::getVoidTy(Ctx),
/isVarArg=/false);		/isVarArg=/false);
F = Function::Create(FTy, Function::ExternalLinkage, "", M.get());		F = Function::Create(FTy, Function::ExternalLinkage, "", M.get());
BB = BasicBlock::Create(Ctx, "", F);		BB = BasicBlock::Create(Ctx, "", F);
GV = new GlobalVariable(*M, Type::getFloatTy(Ctx), true,		GV = new GlobalVariable(*M, Type::getFloatTy(Ctx), true,
GlobalValue::ExternalLinkage, nullptr);		GlobalValue::ExternalLinkage, nullptr);
		DL = &M->getDataLayout();
}		}

void TearDown() override {		void TearDown() override {
BB = nullptr;		BB = nullptr;
M.reset();		M.reset();
}		}

LLVMContext Ctx;		LLVMContext Ctx;
std::unique_ptr<Module> M;		std::unique_ptr<Module> M;
Function *F;		Function *F;
BasicBlock *BB;		BasicBlock *BB;
GlobalVariable *GV;		GlobalVariable *GV;
		const DataLayout *DL;
};		};

TEST_F(IRBuilderTest, Lifetime) {		TEST_F(IRBuilderTest, Lifetime) {
IRBuilder<> Builder(BB);		IRBuilder<> Builder(BB);
AllocaInst *Var1 = Builder.CreateAlloca(Builder.getInt8Ty());		AllocaInst Var1 = Builder.CreateAlloca(DL, Builder.getInt8Ty());
AllocaInst *Var2 = Builder.CreateAlloca(Builder.getInt32Ty());		AllocaInst Var2 = Builder.CreateAlloca(DL, Builder.getInt32Ty());
AllocaInst *Var3 = Builder.CreateAlloca(Builder.getInt8Ty(),		AllocaInst Var3 = Builder.CreateAlloca(DL, Builder.getInt8Ty(),
Builder.getInt32(123));		Builder.getInt32(123));

CallInst *Start1 = Builder.CreateLifetimeStart(Var1);		CallInst *Start1 = Builder.CreateLifetimeStart(Var1);
CallInst *Start2 = Builder.CreateLifetimeStart(Var2);		CallInst *Start2 = Builder.CreateLifetimeStart(Var2);
CallInst *Start3 = Builder.CreateLifetimeStart(Var3, Builder.getInt64(100));		CallInst *Start3 = Builder.CreateLifetimeStart(Var3, Builder.getInt64(100));

EXPECT_EQ(Start1->getArgOperand(0), Builder.getInt64(-1));		EXPECT_EQ(Start1->getArgOperand(0), Builder.getInt64(-1));
EXPECT_EQ(Start2->getArgOperand(0), Builder.getInt64(-1));		EXPECT_EQ(Start2->getArgOperand(0), Builder.getInt64(-1));
▲ Show 20 Lines • Show All 278 Lines • ▼ Show 20 Lines	TEST_F(IRBuilderTest, DIBuilder) {
auto File = DIB.createFile("F.CBL", "/");		auto File = DIB.createFile("F.CBL", "/");
auto CU = DIB.createCompileUnit(dwarf::DW_LANG_Cobol74,		auto CU = DIB.createCompileUnit(dwarf::DW_LANG_Cobol74,
DIB.createFile("F.CBL", "/"), "llvm-cobol74",		DIB.createFile("F.CBL", "/"), "llvm-cobol74",
true, "", 0);		true, "", 0);
auto Type = DIB.createSubroutineType(DIB.getOrCreateTypeArray(None));		auto Type = DIB.createSubroutineType(DIB.getOrCreateTypeArray(None));
auto SP = DIB.createFunction(CU, "foo", "", File, 1, Type, false, true, 1,		auto SP = DIB.createFunction(CU, "foo", "", File, 1, Type, false, true, 1,
DINode::FlagZero, true);		DINode::FlagZero, true);
F->setSubprogram(SP);		F->setSubprogram(SP);
AllocaInst *I = Builder.CreateAlloca(Builder.getInt8Ty());		AllocaInst I = Builder.CreateAlloca(DL, Builder.getInt8Ty());
auto BarSP = DIB.createFunction(CU, "bar", "", File, 1, Type, false, true, 1,		auto BarSP = DIB.createFunction(CU, "bar", "", File, 1, Type, false, true, 1,
DINode::FlagZero, true);		DINode::FlagZero, true);
auto BadScope = DIB.createLexicalBlockFile(BarSP, File, 0);		auto BadScope = DIB.createLexicalBlockFile(BarSP, File, 0);
I->setDebugLoc(DebugLoc::get(2, 0, BadScope));		I->setDebugLoc(DebugLoc::get(2, 0, BadScope));
DIB.finalize();		DIB.finalize();
EXPECT_TRUE(verifyModule(*M));		EXPECT_TRUE(verifyModule(*M));
}		}

▲ Show 20 Lines • Show All 149 Lines • Show Last 20 Lines

unittests/Transforms/Utils/Cloning.cpp

Show First 20 Lines • Show All 287 Lines • ▼ Show 20 Lines	auto *Subprogram =
DINode::FlagZero, false);		DINode::FlagZero, false);
OldFunc->setSubprogram(Subprogram);		OldFunc->setSubprogram(Subprogram);

// Function body		// Function body
BasicBlock* Entry = BasicBlock::Create(C, "", OldFunc);		BasicBlock* Entry = BasicBlock::Create(C, "", OldFunc);
IBuilder.SetInsertPoint(Entry);		IBuilder.SetInsertPoint(Entry);
DebugLoc Loc = DebugLoc::get(3, 2, Subprogram);		DebugLoc Loc = DebugLoc::get(3, 2, Subprogram);
IBuilder.SetCurrentDebugLocation(Loc);		IBuilder.SetCurrentDebugLocation(Loc);
AllocaInst* Alloca = IBuilder.CreateAlloca(IntegerType::getInt32Ty(C));		AllocaInst* Alloca = IBuilder.CreateAlloca(M->getDataLayout(),
		IntegerType::getInt32Ty(C));
IBuilder.SetCurrentDebugLocation(DebugLoc::get(4, 2, Subprogram));		IBuilder.SetCurrentDebugLocation(DebugLoc::get(4, 2, Subprogram));
Value* AllocaContent = IBuilder.getInt32(1);		Value* AllocaContent = IBuilder.getInt32(1);
Instruction* Store = IBuilder.CreateStore(AllocaContent, Alloca);		Instruction* Store = IBuilder.CreateStore(AllocaContent, Alloca);
IBuilder.SetCurrentDebugLocation(DebugLoc::get(5, 2, Subprogram));		IBuilder.SetCurrentDebugLocation(DebugLoc::get(5, 2, Subprogram));
Instruction* Terminator = IBuilder.CreateRetVoid();		Instruction* Terminator = IBuilder.CreateRetVoid();

// Create a local variable around the alloca		// Create a local variable around the alloca
auto *IntType = DBuilder.createBasicType("int", 32, dwarf::DW_ATE_signed);		auto *IntType = DBuilder.createBasicType("int", 32, dwarf::DW_ATE_signed);
▲ Show 20 Lines • Show All 234 Lines • Show Last 20 Lines

unittests/Transforms/Utils/MemorySSA.cpp

	Show First 20 Lines • Show All 547 Lines • ▼ Show 20 Lines
	// ; 3 = MemoryDef(2)			// ; 3 = MemoryDef(2)
	// store i8 2, i8* %A			// store i8 2, i8* %A
	// }			// }
	TEST_F(MemorySSATest, TestTripleStore) {			TEST_F(MemorySSATest, TestTripleStore) {
	F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),			F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),
	GlobalValue::ExternalLinkage, "F", &M);			GlobalValue::ExternalLinkage, "F", &M);
	B.SetInsertPoint(BasicBlock::Create(C, "", F));			B.SetInsertPoint(BasicBlock::Create(C, "", F));
	Type *Int8 = Type::getInt8Ty(C);			Type *Int8 = Type::getInt8Ty(C);
	Value *Alloca = B.CreateAlloca(Int8, ConstantInt::get(Int8, 1), "A");			Value *Alloca = B.CreateAlloca(DL, Int8, ConstantInt::get(Int8, 1), "A");
	StoreInst *S1 = B.CreateStore(ConstantInt::get(Int8, 0), Alloca);			StoreInst *S1 = B.CreateStore(ConstantInt::get(Int8, 0), Alloca);
	StoreInst *S2 = B.CreateStore(ConstantInt::get(Int8, 1), Alloca);			StoreInst *S2 = B.CreateStore(ConstantInt::get(Int8, 1), Alloca);
	StoreInst *S3 = B.CreateStore(ConstantInt::get(Int8, 2), Alloca);			StoreInst *S3 = B.CreateStore(ConstantInt::get(Int8, 2), Alloca);

	setupAnalyses();			setupAnalyses();
	MemorySSA &MSSA = *Analyses->MSSA;			MemorySSA &MSSA = *Analyses->MSSA;
	MemorySSAWalker *Walker = Analyses->Walker;			MemorySSAWalker *Walker = Analyses->Walker;

	Show All 14 Lines
	// walker was caching the initial node it walked. This was fine (albeit			// walker was caching the initial node it walked. This was fine (albeit
	// mostly redundant) unless the initial node being walked is a clobber for the			// mostly redundant) unless the initial node being walked is a clobber for the
	// query. In that case, we'd cache that the node clobbered itself.			// query. In that case, we'd cache that the node clobbered itself.
	TEST_F(MemorySSATest, TestStoreAndLoad) {			TEST_F(MemorySSATest, TestStoreAndLoad) {
	F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),			F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),
	GlobalValue::ExternalLinkage, "F", &M);			GlobalValue::ExternalLinkage, "F", &M);
	B.SetInsertPoint(BasicBlock::Create(C, "", F));			B.SetInsertPoint(BasicBlock::Create(C, "", F));
	Type *Int8 = Type::getInt8Ty(C);			Type *Int8 = Type::getInt8Ty(C);
	Value *Alloca = B.CreateAlloca(Int8, ConstantInt::get(Int8, 1), "A");			Value *Alloca = B.CreateAlloca(DL, Int8, ConstantInt::get(Int8, 1), "A");
	Instruction *SI = B.CreateStore(ConstantInt::get(Int8, 0), Alloca);			Instruction *SI = B.CreateStore(ConstantInt::get(Int8, 0), Alloca);
	Instruction *LI = B.CreateLoad(Alloca);			Instruction *LI = B.CreateLoad(Alloca);

	setupAnalyses();			setupAnalyses();
	MemorySSA &MSSA = *Analyses->MSSA;			MemorySSA &MSSA = *Analyses->MSSA;
	MemorySSAWalker *Walker = Analyses->Walker;			MemorySSAWalker *Walker = Analyses->Walker;

	MemoryAccess *LoadClobber = Walker->getClobberingMemoryAccess(LI);			MemoryAccess *LoadClobber = Walker->getClobberingMemoryAccess(LI);
	Show All 13 Lines
	//			//
	// This test checks that repeated calls to either function returns what they're			// This test checks that repeated calls to either function returns what they're
	// meant to.			// meant to.
	TEST_F(MemorySSATest, TestStoreDoubleQuery) {			TEST_F(MemorySSATest, TestStoreDoubleQuery) {
	F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),			F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),
	GlobalValue::ExternalLinkage, "F", &M);			GlobalValue::ExternalLinkage, "F", &M);
	B.SetInsertPoint(BasicBlock::Create(C, "", F));			B.SetInsertPoint(BasicBlock::Create(C, "", F));
	Type *Int8 = Type::getInt8Ty(C);			Type *Int8 = Type::getInt8Ty(C);
	Value *Alloca = B.CreateAlloca(Int8, ConstantInt::get(Int8, 1), "A");			Value *Alloca = B.CreateAlloca(DL, Int8, ConstantInt::get(Int8, 1), "A");
	StoreInst *SI = B.CreateStore(ConstantInt::get(Int8, 0), Alloca);			StoreInst *SI = B.CreateStore(ConstantInt::get(Int8, 0), Alloca);

	setupAnalyses();			setupAnalyses();
	MemorySSA &MSSA = *Analyses->MSSA;			MemorySSA &MSSA = *Analyses->MSSA;
	MemorySSAWalker *Walker = Analyses->Walker;			MemorySSAWalker *Walker = Analyses->Walker;

	MemoryAccess *StoreAccess = MSSA.getMemoryAccess(SI);			MemoryAccess *StoreAccess = MSSA.getMemoryAccess(SI);
	MemoryLocation StoreLoc = MemoryLocation::get(SI);			MemoryLocation StoreLoc = MemoryLocation::get(SI);
	Show All 29 Lines
	// walking* 'B'.			// walking* 'B'.
	TEST_F(MemorySSATest, PartialWalkerCacheWithPhis) {			TEST_F(MemorySSATest, PartialWalkerCacheWithPhis) {
	F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),			F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),
	GlobalValue::ExternalLinkage, "F", &M);			GlobalValue::ExternalLinkage, "F", &M);
	B.SetInsertPoint(BasicBlock::Create(C, "A", F));			B.SetInsertPoint(BasicBlock::Create(C, "A", F));
	Type *Int8 = Type::getInt8Ty(C);			Type *Int8 = Type::getInt8Ty(C);
	Constant *One = ConstantInt::get(Int8, 1);			Constant *One = ConstantInt::get(Int8, 1);
	Constant *Zero = ConstantInt::get(Int8, 0);			Constant *Zero = ConstantInt::get(Int8, 0);
	Value *AllocA = B.CreateAlloca(Int8, One, "a");			Value *AllocA = B.CreateAlloca(DL, Int8, One, "a");
	Value *AllocB = B.CreateAlloca(Int8, One, "b");			Value *AllocB = B.CreateAlloca(DL, Int8, One, "b");
	BasicBlock *IfThen = BasicBlock::Create(C, "B", F);			BasicBlock *IfThen = BasicBlock::Create(C, "B", F);
	BasicBlock *IfEnd = BasicBlock::Create(C, "C", F);			BasicBlock *IfEnd = BasicBlock::Create(C, "C", F);

	B.CreateCondBr(UndefValue::get(Type::getInt1Ty(C)), IfThen, IfEnd);			B.CreateCondBr(UndefValue::get(Type::getInt1Ty(C)), IfThen, IfEnd);

	B.SetInsertPoint(IfThen);			B.SetInsertPoint(IfThen);
	Instruction *FirstStore = B.CreateStore(Zero, AllocA);			Instruction *FirstStore = B.CreateStore(Zero, AllocA);
	B.CreateStore(Zero, AllocB);			B.CreateStore(Zero, AllocB);
	▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
	// isn't what we want.			// isn't what we want.
	// FIXME: It may be easier/cleaner to just add an 'optimize uses?' flag to MSSA.			// FIXME: It may be easier/cleaner to just add an 'optimize uses?' flag to MSSA.
	TEST_F(MemorySSATest, WalkerInvariantLoadOpt) {			TEST_F(MemorySSATest, WalkerInvariantLoadOpt) {
	F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),			F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),
	GlobalValue::ExternalLinkage, "F", &M);			GlobalValue::ExternalLinkage, "F", &M);
	B.SetInsertPoint(BasicBlock::Create(C, "", F));			B.SetInsertPoint(BasicBlock::Create(C, "", F));
	Type *Int8 = Type::getInt8Ty(C);			Type *Int8 = Type::getInt8Ty(C);
	Constant *One = ConstantInt::get(Int8, 1);			Constant *One = ConstantInt::get(Int8, 1);
	Value *AllocA = B.CreateAlloca(Int8, One, "");			Value *AllocA = B.CreateAlloca(DL, Int8, One, "");

	Instruction *Store = B.CreateStore(One, AllocA);			Instruction *Store = B.CreateStore(One, AllocA);
	Instruction *Load = B.CreateLoad(AllocA);			Instruction *Load = B.CreateLoad(AllocA);

	setupAnalyses();			setupAnalyses();
	MemorySSA &MSSA = *Analyses->MSSA;			MemorySSA &MSSA = *Analyses->MSSA;
	MemorySSAWalker *Walker = Analyses->Walker;			MemorySSAWalker *Walker = Analyses->Walker;

	Show All 11 Lines
	}			}

	// Test loads get reoptimized properly by the walker.			// Test loads get reoptimized properly by the walker.
	TEST_F(MemorySSATest, WalkerReopt) {			TEST_F(MemorySSATest, WalkerReopt) {
	F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),			F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),
	GlobalValue::ExternalLinkage, "F", &M);			GlobalValue::ExternalLinkage, "F", &M);
	B.SetInsertPoint(BasicBlock::Create(C, "", F));			B.SetInsertPoint(BasicBlock::Create(C, "", F));
	Type *Int8 = Type::getInt8Ty(C);			Type *Int8 = Type::getInt8Ty(C);
	Value *AllocaA = B.CreateAlloca(Int8, ConstantInt::get(Int8, 1), "A");			Value *AllocaA = B.CreateAlloca(DL, Int8, ConstantInt::get(Int8, 1), "A");
	Instruction *SIA = B.CreateStore(ConstantInt::get(Int8, 0), AllocaA);			Instruction *SIA = B.CreateStore(ConstantInt::get(Int8, 0), AllocaA);
	Value *AllocaB = B.CreateAlloca(Int8, ConstantInt::get(Int8, 1), "B");			Value *AllocaB = B.CreateAlloca(DL, Int8, ConstantInt::get(Int8, 1), "B");
	Instruction *SIB = B.CreateStore(ConstantInt::get(Int8, 0), AllocaB);			Instruction *SIB = B.CreateStore(ConstantInt::get(Int8, 0), AllocaB);
	Instruction *LIA = B.CreateLoad(AllocaA);			Instruction *LIA = B.CreateLoad(AllocaA);

	setupAnalyses();			setupAnalyses();
	MemorySSA &MSSA = *Analyses->MSSA;			MemorySSA &MSSA = *Analyses->MSSA;
	MemorySSAWalker *Walker = Analyses->Walker;			MemorySSAWalker *Walker = Analyses->Walker;
	MemorySSAUpdater Updater(&MSSA);			MemorySSAUpdater Updater(&MSSA);

	Show All 13 Lines

	// Test out MemorySSAUpdater::moveBefore			// Test out MemorySSAUpdater::moveBefore
	TEST_F(MemorySSATest, MoveAboveMemoryDef) {			TEST_F(MemorySSATest, MoveAboveMemoryDef) {
	F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),			F = Function::Create(FunctionType::get(B.getVoidTy(), {}, false),
	GlobalValue::ExternalLinkage, "F", &M);			GlobalValue::ExternalLinkage, "F", &M);
	B.SetInsertPoint(BasicBlock::Create(C, "", F));			B.SetInsertPoint(BasicBlock::Create(C, "", F));

	Type *Int8 = Type::getInt8Ty(C);			Type *Int8 = Type::getInt8Ty(C);
	Value *A = B.CreateAlloca(Int8, ConstantInt::get(Int8, 1), "A");			Value *A = B.CreateAlloca(DL, Int8, ConstantInt::get(Int8, 1), "A");
	Value *B_ = B.CreateAlloca(Int8, ConstantInt::get(Int8, 1), "B");			Value *B_ = B.CreateAlloca(DL, Int8, ConstantInt::get(Int8, 1), "B");
	Value *C = B.CreateAlloca(Int8, ConstantInt::get(Int8, 1), "C");			Value *C = B.CreateAlloca(DL, Int8, ConstantInt::get(Int8, 1), "C");

	StoreInst *StoreA0 = B.CreateStore(ConstantInt::get(Int8, 0), A);			StoreInst *StoreA0 = B.CreateStore(ConstantInt::get(Int8, 0), A);
	StoreInst *StoreB = B.CreateStore(ConstantInt::get(Int8, 0), B_);			StoreInst *StoreB = B.CreateStore(ConstantInt::get(Int8, 0), B_);
	LoadInst *LoadB = B.CreateLoad(B_);			LoadInst *LoadB = B.CreateLoad(B_);
	StoreInst *StoreA1 = B.CreateStore(ConstantInt::get(Int8, 4), A);			StoreInst *StoreA1 = B.CreateStore(ConstantInt::get(Int8, 4), A);
	StoreInst *StoreC = B.CreateStore(ConstantInt::get(Int8, 4), C);			StoreInst *StoreC = B.CreateStore(ConstantInt::get(Int8, 4), C);
	StoreInst *StoreA2 = B.CreateStore(ConstantInt::get(Int8, 4), A);			StoreInst *StoreA2 = B.CreateStore(ConstantInt::get(Int8, 4), A);
	LoadInst *LoadC = B.CreateLoad(C);			LoadInst *LoadC = B.CreateLoad(C);
	▲ Show 20 Lines • Show All 69 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Allow DataLayout to specify addrspace for allocas.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 92080

docs/LangRef.rst

include/llvm/IR/DataLayout.h

include/llvm/IR/IRBuilder.h

include/llvm/IR/Instructions.h

lib/AsmParser/LLParser.h

lib/AsmParser/LLParser.cpp

lib/Bitcode/Reader/BitcodeReader.cpp

lib/CodeGen/AtomicExpandPass.cpp

lib/CodeGen/SafeStack.cpp

lib/CodeGen/ShadowStackGCLowering.cpp

lib/CodeGen/SjLjEHPrepare.cpp

lib/CodeGen/StackProtector.cpp

lib/CodeGen/WinEHPrepare.cpp

lib/IR/AsmWriter.cpp

lib/IR/Core.cpp

lib/IR/DataLayout.cpp

lib/IR/Instructions.cpp

lib/IR/Verifier.cpp

lib/Target/NVPTX/NVPTXLowerArgs.cpp

lib/Target/X86/X86WinEHState.cpp

lib/Transforms/Coroutines/CoroElide.cpp

lib/Transforms/Coroutines/CoroSplit.cpp

lib/Transforms/IPO/ArgumentPromotion.cpp

lib/Transforms/IPO/GlobalOpt.cpp

lib/Transforms/InstCombine/InstCombineCasts.cpp

lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp

lib/Transforms/Instrumentation/AddressSanitizer.cpp

lib/Transforms/Instrumentation/DataFlowSanitizer.cpp

lib/Transforms/Instrumentation/MemorySanitizer.cpp

lib/Transforms/Scalar/RewriteStatepointsForGC.cpp

lib/Transforms/Scalar/SROA.cpp

lib/Transforms/Utils/CodeExtractor.cpp

lib/Transforms/Utils/DemoteRegToStack.cpp

lib/Transforms/Utils/InlineFunction.cpp

test/Assembler/alloca-addrspace-parse-error-0.ll

test/Assembler/alloca-addrspace-parse-error-1.ll

test/Assembler/alloca-addrspace0.ll

test/Assembler/datalayout-alloca-addrspace-mismatch-0.ll

test/Assembler/datalayout-alloca-addrspace-mismatch-1.ll

test/Assembler/datalayout-alloca-addrspace-mismatch-2.ll

test/Assembler/datalayout-alloca-addrspace.ll

test/Assembler/invalid-datalayout-alloca-addrspace.ll

test/Transforms/SROA/alloca-address-space.ll

tools/llvm-stress/llvm-stress.cpp

unittests/Analysis/ScalarEvolutionTest.cpp

unittests/IR/IRBuilderTest.cpp

unittests/Transforms/Utils/Cloning.cpp

unittests/Transforms/Utils/MemorySSA.cpp

Allow DataLayout to specify addrspace for allocas.
ClosedPublic