This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Coroutines/
-
Transforms/
-
Coroutines/
1/5
CoroFrame.cpp
1
CoroInternal.h
-
test/Transforms/Coroutines/
-
Transforms/
-
Coroutines/
-
coro-debug-frame-variable.ll
-
coro-debug.ll
-
coro-frame-arrayalloca.ll
-
coro-frame-reuse-alloca-01.ll
-
coro-frame-reuse-alloca-02.ll
-
coro-retcon-frame.ll
-
coro-retcon-once-value2.ll
-
coro-split-sink-lifetime-01.ll
-
coro-split-sink-lifetime-03.ll
-
coro-split-sink-lifetime-04.ll

Differential D88872

[Coroutines] Refactor/Rewrite Spill and Alloca processing
ClosedPublic

Authored by lxfind on Oct 5 2020, 11:11 PM.

Download Raw Diff

Details

Reviewers

junparser
ChuanqiXu
rjmccall
wenlei
lewissbaker
bruno

Commits

rG667dfe39caa0: [Coroutines] Refactor/Rewrite Spill and Alloca processing

Summary

This patch is a refactoring of how we process spills and allocas during CoroSplit.
In the previous implementation, everything that needs to go to the heap is put into Spills, including all the values defined by allocas.
And the way to identify a Spill, is to check whether there exists a use-def relationship that crosses suspension points.

This approach is fundamentally confusing, and unfortunately, incorrect.
First of all, allocas are always process differently than spills, hence it's quite confusing to put them together. It's a much cleaner to separate them and process them separately.
Doing so simplify lots of code and makes the logic more clear and easier to reason about.

Secondly, use-def relationship is insufficient to decide whether a value defined by AllocaInst needs to go to the heap.
There are many cases where a value defined by AllocaInst can implicitly be used across suspension points without a direct use-def relationship.
For example, you can store the address of an alloca into the heap, and load that address after suspension. Or you can escape the address into an object through a function call.
Or you can have a PHINode that takes two allocas, and this PHINode is used across suspension point (when this happens, the existing implementation will spill the PHINode, a.k.a a stack adddress to the heap!).
All these issues suggest that we need to separate spill and alloca in order to properly implement this.
This patch does not yet fix these bugs, however it sets up the code in a better shape so that we can start fixing them in the next patch.

The core idea of this patch is to add a new struct called FrameDataInfo, which contains all Spills, all Allocas, and a map from each definition to its layout index in the frame (FieldIndexMap).
Spills and Allocas are identified, stored and processed independently. When they are initially added to the frame, we record their field index through FieldIndexMap. When the frame layout is finalized, we update each index into their final layout index.

In doing so, I also cleaned up a few things and also discovered a few other bugs.

Cleanups:

Found out that PromiseFieldId is not used, delete it.
Previously, SpillInfo is a vector, which is strange because every def can have multiple users. This patch cleans it up by turning it into a map from def to users.
Previously, a frame Field struct contains a list of Spills that field corresponds to. This isn't necessary since we only need the layout index for each given definition. This patch removes that list. Instead, we connect each field and definition using the FieldIndexMap.
All the loops that process Spills are simplified now because we use a map instead of a vector.

Bugs:
It seems that we are only keeping llvm.dbg.declare intrinsics in the .resume part of the function. The ramp function will no longer has it. This means we are dropping some debug information in the ramp function.

The next step is to start fixing the bugs where the implementation fails to identify some allocas that should live on the frame.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

lxfind created this revision.Oct 5 2020, 11:11 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 5 2020, 11:11 PM

Herald added subscribers: llvm-commits, modimo, dexonsmith and 2 others. · View Herald Transcript

lxfind requested review of this revision.Oct 5 2020, 11:11 PM

Harbormaster completed remote builds in B74093: Diff 296363.Oct 5 2020, 11:24 PM

Thanks for your patch! It mentions some bugs about allocas we need to handle in the future.

For this patch, I'm little confusing about why we need to separate alloca from spills. In my mind, a spill means something we need to put in the frame. And an alloca which would be in the frame is naturally a spill.
I think the patch benefits from replacing

using SpillInfo = SmallVector<Spill, 8>;

using SpillInfo = SmallMapVector<Value *, SmallVector<Instruction *, 2>, 8>;

llvm/lib/Transforms/Coroutines/CoroFrame.cpp
1104	I'm confusing about the comment. It says it would migrate dbg.declare from alloca. But how could it know the `CurrentValue` must be alloca from the context ?

In D88872#2320823, @ChuanqiXu wrote:
Thanks for your patch! It mentions some bugs about allocas we need to handle in the future.

For this patch, I'm little confusing about why we need to separate alloca from spills. In my mind, a spill means something we need to put in the frame. And an alloca which would be in the frame is naturally a spill.
I think the patch benefits from replacing
using SpillInfo = SmallVector<Spill, 8>;
to
using SpillInfo = SmallMapVector<Value *, SmallVector<Instruction *, 2>, 8>;

Thanks for taking a look at the patch. If you look at the implementation, the handling of "Spills" and always different from the handling of allocas. They do share the same concept that they need to go to the frame (which is why they both belong to FrameDataInfo).
The primary reason to separate them (and hence set up the code for future fixes), is this one primary difference between them: A Spill is a direct def-use relationship that crosses suspension points; while an alloca may not be exposed to a direct def-use relationship that crosses suspension points but still need to go to the frame. The condition for them to go to the frame is fundamentally different.

llvm/lib/Transforms/Coroutines/CoroFrame.cpp
1104	I think dbg.declare instructions were all generated for alloca instructions.

In D88872#2321902, @lxfind wrote:
In D88872#2320823, @ChuanqiXu wrote:
Thanks for your patch! It mentions some bugs about allocas we need to handle in the future.

For this patch, I'm little confusing about why we need to separate alloca from spills. In my mind, a spill means something we need to put in the frame. And an alloca which would be in the frame is naturally a spill.
I think the patch benefits from replacing
using SpillInfo = SmallVector<Spill, 8>;
to
using SpillInfo = SmallMapVector<Value *, SmallVector<Instruction *, 2>, 8>;
Thanks for taking a look at the patch. If you look at the implementation, the handling of "Spills" and always different from the handling of allocas. They do share the same concept that they need to go to the frame (which is why they both belong to FrameDataInfo).
The primary reason to separate them (and hence set up the code for future fixes), is this one primary difference between them: A Spill is a direct def-use relationship that crosses suspension points; while an alloca may not be exposed to a direct def-use relationship that crosses suspension points but still need to go to the frame. The condition for them to go to the frame is fundamentally different.

I agree with we can benefit from separating alloca from spills. At least we don't need extract allocas from spills by a redundant loop any more. After we separate allocas from spills, the name spills seems a little strange. But I think it doesn't really matter.

Here is a question about the bug mentioned in summary. I write a simple code like this:

void consuming(int* pa);

task escape_alloca() {
  int a;
  consuming(&a);
  co_await something();
}

But clang would still put a in the frame in O0 or O2. I guess it is because the def of a and the use of a is cross initial_suspend (in O0 mode) or the lifetime markers is cross the co_await something point. Is there anything wrong? Or what is the example about the bug?

In D88872#2323145, @ChuanqiXu wrote:
In D88872#2321902, @lxfind wrote:
In D88872#2320823, @ChuanqiXu wrote:
Thanks for your patch! It mentions some bugs about allocas we need to handle in the future.

For this patch, I'm little confusing about why we need to separate alloca from spills. In my mind, a spill means something we need to put in the frame. And an alloca which would be in the frame is naturally a spill.
I think the patch benefits from replacing
using SpillInfo = SmallVector<Spill, 8>;
to
using SpillInfo = SmallMapVector<Value *, SmallVector<Instruction *, 2>, 8>;
Thanks for taking a look at the patch. If you look at the implementation, the handling of "Spills" and always different from the handling of allocas. They do share the same concept that they need to go to the frame (which is why they both belong to FrameDataInfo).
The primary reason to separate them (and hence set up the code for future fixes), is this one primary difference between them: A Spill is a direct def-use relationship that crosses suspension points; while an alloca may not be exposed to a direct def-use relationship that crosses suspension points but still need to go to the frame. The condition for them to go to the frame is fundamentally different.
I agree with we can benefit from separating alloca from spills. At least we don't need extract allocas from spills by a redundant loop any more. After we separate allocas from spills, the name spills seems a little strange. But I think it doesn't really matter.

Here is a question about the bug mentioned in summary. I write a simple code like this:
void consuming(int* pa);

task escape_alloca() {
  int a;
  consuming(&a);
  co_await something();
}
But clang would still put a in the frame in O0 or O2. I guess it is because the def of a and the use of a is cross initial_suspend (in O0 mode) or the lifetime markers is cross the co_await something point. Is there anything wrong? Or what is the example about the bug?

It's harder to generate an example from source code (it has to be quite complicated, I have some production code, but not small enough to share). But it's easy to see from IR examples.
Consider the following IR (you can run it through opt --coro-split:

define i8* @f(i1 %n) "coroutine.presplit"="1" {
entry:
  %a = alloca i32, align 8
  %id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
  %size = call i32 @llvm.coro.size.i32()
  %alloc = call i8* @malloc(i32 %size)
  %hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)

  %flag = call i1 @check()
  br i1 %flag, label %flag_true, label %flag_false

flag_true:
  %b = bitcast i32* %a to i8*
  br label %merge

flag_false:
  %c = bitcast i32* %a to i8*
  br label %merge

merge:
  %d = phi i8* [ %b, %flag_true ], [ %c, %flag_false ]
  %sp1 = call i8 @llvm.coro.suspend(token none, i1 false)
  switch i8 %sp1, label %suspend [i8 0, label %resume
                                  i8 1, label %cleanup]
resume:
  call void @print(i8* %d)
  br label %cleanup

cleanup:
  %mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
  call void @free(i8* %mem)
  br label %suspend
suspend:
  call i1 @llvm.coro.end(i8* %hdl, i1 0)
  ret i8* %hdl
}

declare i8* @llvm.coro.free(token, i8*)
declare i32 @llvm.coro.size.i32()
declare i8  @llvm.coro.suspend(token, i1)
declare void @llvm.coro.resume(i8*)
declare void @llvm.coro.destroy(i8*)

declare token @llvm.coro.id(i32, i8*, i8*, i8*)
declare i1 @llvm.coro.alloc(token)
declare i8* @llvm.coro.begin(token, i8*)
declare i1 @llvm.coro.end(i8*, i1)

declare noalias i8* @malloc(i32)
declare i1 @check()
declare void @print(i8*)
declare void @free(i8*)

Notice that %a's alias is stored in a PHI, which is used after suspend. When you run coro-split on it, the current implementation will think that only the PHI needs to be spilled, while %a can stay on stack.
So in the generated IR, %a will still be an alloca, but a pointer to it will be store to the frame! The generated IR looks like this:

define i8* @f(i1 %n) {
entry:
  %a = alloca i32, align 8
  ...
  %b = bitcast i32* %a to i8*
  %d.spill.addr = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 2
  store i8* %b, i8** %d.spill.addr, align 8
  ...
}

This is just one example, but you can also escape it by storing the address, or call a function.

This is a nice cleanup, thank you. Just minor comments.

llvm/lib/Transforms/Coroutines/CoroFrame.cpp
721	Well, it needs to be explicitly added here because it's a header field which has to have a fixed offset based on its alignment, which is why it's not in `FrameData.Allocas`.
753	It's just "index" now.
1958	I see that this seems to be the existing algorithm, and I know this is intended to be a refactoring patch rather than a functional one, so I'll let this go without comment beyond expressing my eagerness to see a follow-up. :)
llvm/lib/Transforms/Coroutines/CoroInternal.h
125	Yeah, I think this is never used directly — clients expect to find it by offset, and within the function it's of course just referenced as a value.

This revision is now accepted and ready to land.Oct 9 2020, 10:48 PM

In D88872#2323195, @lxfind wrote:
In D88872#2323145, @ChuanqiXu wrote:
In D88872#2321902, @lxfind wrote:
In D88872#2320823, @ChuanqiXu wrote:
Thanks for your patch! It mentions some bugs about allocas we need to handle in the future.

For this patch, I'm little confusing about why we need to separate alloca from spills. In my mind, a spill means something we need to put in the frame. And an alloca which would be in the frame is naturally a spill.
I think the patch benefits from replacing
using SpillInfo = SmallVector<Spill, 8>;
to
using SpillInfo = SmallMapVector<Value *, SmallVector<Instruction *, 2>, 8>;
Thanks for taking a look at the patch. If you look at the implementation, the handling of "Spills" and always different from the handling of allocas. They do share the same concept that they need to go to the frame (which is why they both belong to FrameDataInfo).
The primary reason to separate them (and hence set up the code for future fixes), is this one primary difference between them: A Spill is a direct def-use relationship that crosses suspension points; while an alloca may not be exposed to a direct def-use relationship that crosses suspension points but still need to go to the frame. The condition for them to go to the frame is fundamentally different.
I agree with we can benefit from separating alloca from spills. At least we don't need extract allocas from spills by a redundant loop any more. After we separate allocas from spills, the name spills seems a little strange. But I think it doesn't really matter.

Here is a question about the bug mentioned in summary. I write a simple code like this:
void consuming(int* pa);

task escape_alloca() {
  int a;
  consuming(&a);
  co_await something();
}
But clang would still put a in the frame in O0 or O2. I guess it is because the def of a and the use of a is cross initial_suspend (in O0 mode) or the lifetime markers is cross the co_await something point. Is there anything wrong? Or what is the example about the bug?
It's harder to generate an example from source code (it has to be quite complicated, I have some production code, but not small enough to share). But it's easy to see from IR examples.
Consider the following IR (you can run it through opt --coro-split:
define i8* @f(i1 %n) "coroutine.presplit"="1" {
entry:
  %a = alloca i32, align 8
  %id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
  %size = call i32 @llvm.coro.size.i32()
  %alloc = call i8* @malloc(i32 %size)
  %hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)

  %flag = call i1 @check()
  br i1 %flag, label %flag_true, label %flag_false

flag_true:
  %b = bitcast i32* %a to i8*
  br label %merge

flag_false:
  %c = bitcast i32* %a to i8*
  br label %merge

merge:
  %d = phi i8* [ %b, %flag_true ], [ %c, %flag_false ]
  %sp1 = call i8 @llvm.coro.suspend(token none, i1 false)
  switch i8 %sp1, label %suspend [i8 0, label %resume
                                  i8 1, label %cleanup]
resume:
  call void @print(i8* %d)
  br label %cleanup

cleanup:
  %mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
  call void @free(i8* %mem)
  br label %suspend
suspend:
  call i1 @llvm.coro.end(i8* %hdl, i1 0)
  ret i8* %hdl
}

declare i8* @llvm.coro.free(token, i8*)
declare i32 @llvm.coro.size.i32()
declare i8  @llvm.coro.suspend(token, i1)
declare void @llvm.coro.resume(i8*)
declare void @llvm.coro.destroy(i8*)

declare token @llvm.coro.id(i32, i8*, i8*, i8*)
declare i1 @llvm.coro.alloc(token)
declare i8* @llvm.coro.begin(token, i8*)
declare i1 @llvm.coro.end(i8*, i1)

declare noalias i8* @malloc(i32)
declare i1 @check()
declare void @print(i8*)
declare void @free(i8*)
Notice that %a's alias is stored in a PHI, which is used after suspend. When you run coro-split on it, the current implementation will think that only the PHI needs to be spilled, while %a can stay on stack.
So in the generated IR, %a will still be an alloca, but a pointer to it will be store to the frame! The generated IR looks like this:
define i8* @f(i1 %n) {
entry:
  %a = alloca i32, align 8
  ...
  %b = bitcast i32* %a to i8*
  %d.spill.addr = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 2
  store i8* %b, i8** %d.spill.addr, align 8
  ...
}
This is just one example, but you can also escape it by storing the address, or call a function.

Good example! I got it. This patch looks good to me.

address comments; rebase

Harbormaster completed remote builds in B74716: Diff 297439.Oct 10 2020, 10:00 PM

This revision was landed with ongoing or failed builds.Oct 10 2020, 10:23 PM

Closed by commit rG667dfe39caa0: [Coroutines] Refactor/Rewrite Spill and Alloca processing (authored by lxfind). · Explain Why

This revision was automatically updated to reflect the committed changes.

lxfind added a commit: rG667dfe39caa0: [Coroutines] Refactor/Rewrite Spill and Alloca processing.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Coroutines/

CoroFrame.cpp

692 lines

CoroInternal.h

7 lines

test/

Transforms/

Coroutines/

coro-debug-frame-variable.ll

10 lines

coro-debug.ll

4 lines

coro-frame-arrayalloca.ll

16 lines

coro-frame-reuse-alloca-01.ll

8 lines

coro-frame-reuse-alloca-02.ll

6 lines

coro-retcon-frame.ll

2 lines

coro-retcon-once-value2.ll

3 lines

coro-split-sink-lifetime-01.ll

2 lines

coro-split-sink-lifetime-03.ll

2 lines

coro-split-sink-lifetime-04.ll

2 lines

Diff 297440

llvm/lib/Transforms/Coroutines/CoroFrame.cpp

Show First 20 Lines • Show All 286 Lines • ▼ Show 20 Lines	do {
}		}
} while (Changed);		} while (Changed);
LLVM_DEBUG(dump());		LLVM_DEBUG(dump());
}		}

#undef DEBUG_TYPE // "coro-suspend-crossing"		#undef DEBUG_TYPE // "coro-suspend-crossing"
#define DEBUG_TYPE "coro-frame"		#define DEBUG_TYPE "coro-frame"

// We build up the list of spills for every case where a use is separated
// from the definition by a suspend point.

static const unsigned InvalidFieldIndex = ~0U;

namespace {		namespace {
class Spill {		class FrameTypeBuilder;
Value *Def = nullptr;		// Mapping from the to-be-spilled value to all the users that need reload.
Instruction *User = nullptr;		using SpillInfo = SmallMapVector<Value , SmallVector<Instruction , 2>, 8>;
unsigned FieldNo = InvalidFieldIndex;		struct FrameDataInfo {
		// All the values (that are not allocas) that needs to be spilled to the
public:		// frame.
Spill(Value Def, llvm::User U) : Def(Def), User(cast<Instruction>(U)) {}		SpillInfo Spills;
		// Allocas contains all values defined as allocas that need to live in the
		// frame.
		SmallVector<AllocaInst *, 8> Allocas;

Value *def() const { return Def; }		SmallVector<Value *, 8> getAllDefs() const {
Instruction *user() const { return User; }		SmallVector<Value *, 8> Defs;
BasicBlock *userBlock() const { return User->getParent(); }		for (const auto &P : Spills)
		Defs.push_back(P.first);
// Note that field index is stored in the first SpillEntry for a particular		for (auto *A : Allocas)
// definition. Subsequent mentions of a defintion do not have fieldNo		Defs.push_back(A);
// assigned. This works out fine as the users of Spills capture the info about		return Defs;
// the definition the first time they encounter it. Consider refactoring		}
// SpillInfo into two arrays to normalize the spill representation.
unsigned fieldIndex() const {		uint32_t getFieldIndex(Value *V) const {
assert(FieldNo != InvalidFieldIndex && "Accessing unassigned field");		auto Itr = FieldIndexMap.find(V);
return FieldNo;		assert(Itr != FieldIndexMap.end() &&
}		"Value does not have a frame field index");
void setFieldIndex(unsigned FieldNumber) {		return Itr->second;
assert(FieldNo == InvalidFieldIndex && "Reassigning field number");		}
FieldNo = FieldNumber;
		void setFieldIndex(Value *V, uint32_t Index) {
		assert((LayoutIndexUpdateStarted \|\| FieldIndexMap.count(V) == 0) &&
		"Cannot set the index for the same field twice.");
		FieldIndexMap[V] = Index;
}		}

		// Remap the index of every field in the frame, using the final layout index.
		void updateLayoutIndex(FrameTypeBuilder &B);

		private:
		// LayoutIndexUpdateStarted is used to avoid updating the index of any field
		// twice by mistake.
		bool LayoutIndexUpdateStarted = false;
		// Map from values to their slot indexes on the frame. They will be first set
		// with their original insertion field index. After the frame is built, their
		// indexes will be updated into the final layout index.
		DenseMap<Value *, uint32_t> FieldIndexMap;
};		};
} // namespace		} // namespace

// Note that there may be more than one record with the same value of Def in
// the SpillInfo vector.
using SpillInfo = SmallVector<Spill, 8>;

#ifndef NDEBUG		#ifndef NDEBUG
static void dump(StringRef Title, SpillInfo const &Spills) {		static void dumpSpills(StringRef Title, const SpillInfo &Spills) {
dbgs() << "------------- " << Title << "--------------\n";		dbgs() << "------------- " << Title << "--------------\n";
Value *CurrentValue = nullptr;		for (const auto &E : Spills) {
for (auto const &E : Spills) {		E.first->dump();
if (CurrentValue != E.def()) {
CurrentValue = E.def();
CurrentValue->dump();
}
dbgs() << " user: ";		dbgs() << " user: ";
E.user()->dump();		for (auto *I : E.second)
		I->dump();
}		}
}		}

		static void dumpAllocas(const SmallVectorImpl<AllocaInst *> &Allocas) {
		dbgs() << "------------- Allocas --------------\n";
		for (auto *A : Allocas)
		A->dump();
		}
#endif		#endif

namespace {		namespace {
		using FieldIDType = size_t;
// We cannot rely solely on natural alignment of a type when building a		// We cannot rely solely on natural alignment of a type when building a
// coroutine frame and if the alignment specified on the Alloca instruction		// coroutine frame and if the alignment specified on the Alloca instruction
// differs from the natural alignment of the alloca type we will need to insert		// differs from the natural alignment of the alloca type we will need to insert
// padding.		// padding.
class FrameTypeBuilder {		class FrameTypeBuilder {
public:
using ForSpillType = SmallVector<Spill *, 8>;

private:		private:
struct Field {		struct Field {
uint64_t Size;		uint64_t Size;
uint64_t Offset;		uint64_t Offset;
ForSpillType ForSpill;
Type *Ty;		Type *Ty;
unsigned FieldIndex;		FieldIDType LayoutFieldIndex;
Align Alignment;		Align Alignment;
Align TyAlignment;		Align TyAlignment;
};		};

const DataLayout &DL;		const DataLayout &DL;
LLVMContext &Context;		LLVMContext &Context;
uint64_t StructSize = 0;		uint64_t StructSize = 0;
Align StructAlign;		Align StructAlign;
bool IsFinished = false;		bool IsFinished = false;

SmallVector<Field, 8> Fields;		SmallVector<Field, 8> Fields;
DenseMap<Value*, unsigned> FieldIndexByKey;		DenseMap<Value*, unsigned> FieldIndexByKey;

public:		public:
FrameTypeBuilder(LLVMContext &Context, DataLayout const &DL)		FrameTypeBuilder(LLVMContext &Context, DataLayout const &DL)
: DL(DL), Context(Context) {}		: DL(DL), Context(Context) {}

class FieldId {
size_t Value;
explicit FieldId(size_t Value) : Value(Value) {}

friend class FrameTypeBuilder;
};

/// Add a field to this structure for the storage of an `alloca`		/// Add a field to this structure for the storage of an `alloca`
/// instruction.		/// instruction.
FieldId addFieldForAlloca(AllocaInst *AI, ForSpillType ForSpill = {},		LLVM_NODISCARD FieldIDType addFieldForAlloca(AllocaInst *AI,
bool IsHeader = false) {		bool IsHeader = false) {
Type *Ty = AI->getAllocatedType();		Type *Ty = AI->getAllocatedType();

// Make an array type if this is a static array allocation.		// Make an array type if this is a static array allocation.
if (AI->isArrayAllocation()) {		if (AI->isArrayAllocation()) {
if (auto *CI = dyn_cast<ConstantInt>(AI->getArraySize()))		if (auto *CI = dyn_cast<ConstantInt>(AI->getArraySize()))
Ty = ArrayType::get(Ty, CI->getValue().getZExtValue());		Ty = ArrayType::get(Ty, CI->getValue().getZExtValue());
else		else
report_fatal_error("Coroutines cannot handle non static allocas yet");		report_fatal_error("Coroutines cannot handle non static allocas yet");
}		}

return addField(Ty, AI->getAlign(), ForSpill, IsHeader);		return addField(Ty, AI->getAlign(), IsHeader);
}		}

/// We want to put the allocas whose lifetime-ranges are not overlapped		/// We want to put the allocas whose lifetime-ranges are not overlapped
/// into one slot of coroutine frame.		/// into one slot of coroutine frame.
/// Consider the example at:https://bugs.llvm.org/show_bug.cgi?id=45566		/// Consider the example at:https://bugs.llvm.org/show_bug.cgi?id=45566
///		///
/// cppcoro::task<void> alternative_paths(bool cond) {		/// cppcoro::task<void> alternative_paths(bool cond) {
/// if (cond) {		/// if (cond) {
Show All 13 Lines	public:
/// This function use StackLifetime algorithm to partition the AllocaInsts in		/// This function use StackLifetime algorithm to partition the AllocaInsts in
/// Spills to non-overlapped sets in order to put Alloca in the same		/// Spills to non-overlapped sets in order to put Alloca in the same
/// non-overlapped set into the same slot in the Coroutine Frame. Then add		/// non-overlapped set into the same slot in the Coroutine Frame. Then add
/// field for the allocas in the same non-overlapped set by using the largest		/// field for the allocas in the same non-overlapped set by using the largest
/// type as the field type.		/// type as the field type.
///		///
/// Side Effects: Because We sort the allocas, the order of allocas in the		/// Side Effects: Because We sort the allocas, the order of allocas in the
/// frame may be different with the order in the source code.		/// frame may be different with the order in the source code.
void addFieldForAllocas(const Function &F, SpillInfo &Spills,		void addFieldForAllocas(const Function &F, FrameDataInfo &FrameData,
coro::Shape &Shape);		coro::Shape &Shape);

/// Add a field to this structure.		/// Add a field to this structure.
FieldId addField(Type *Ty, MaybeAlign FieldAlignment,		LLVM_NODISCARD FieldIDType addField(Type *Ty, MaybeAlign FieldAlignment,
ForSpillType ForSpill = {}, bool IsHeader = false) {		bool IsHeader = false) {
assert(!IsFinished && "adding fields to a finished builder");		assert(!IsFinished && "adding fields to a finished builder");
assert(Ty && "must provide a type for a field");		assert(Ty && "must provide a type for a field");

// The field size is always the alloc size of the type.		// The field size is always the alloc size of the type.
uint64_t FieldSize = DL.getTypeAllocSize(Ty);		uint64_t FieldSize = DL.getTypeAllocSize(Ty);

// The field alignment might not be the type alignment, but we need		// The field alignment might not be the type alignment, but we need
// to remember the type alignment anyway to build the type.		// to remember the type alignment anyway to build the type.
Align TyAlignment = DL.getABITypeAlign(Ty);		Align TyAlignment = DL.getABITypeAlign(Ty);
if (!FieldAlignment) FieldAlignment = TyAlignment;		if (!FieldAlignment) FieldAlignment = TyAlignment;

// Lay out header fields immediately.		// Lay out header fields immediately.
uint64_t Offset;		uint64_t Offset;
if (IsHeader) {		if (IsHeader) {
Offset = alignTo(StructSize, FieldAlignment);		Offset = alignTo(StructSize, FieldAlignment);
StructSize = Offset + FieldSize;		StructSize = Offset + FieldSize;

// Everything else has a flexible offset.		// Everything else has a flexible offset.
} else {		} else {
Offset = OptimizedStructLayoutField::FlexibleOffset;		Offset = OptimizedStructLayoutField::FlexibleOffset;
}		}

Fields.push_back({FieldSize, Offset, ForSpill, Ty, 0,		Fields.push_back({FieldSize, Offset, Ty, 0, *FieldAlignment, TyAlignment});
*FieldAlignment, TyAlignment});		return Fields.size() - 1;
return FieldId(Fields.size() - 1);
}		}

/// Finish the layout and set the body on the given type.		/// Finish the layout and set the body on the given type.
void finish(StructType *Ty);		void finish(StructType *Ty);

uint64_t getStructSize() const {		uint64_t getStructSize() const {
assert(IsFinished && "not yet finished!");		assert(IsFinished && "not yet finished!");
return StructSize;		return StructSize;
}		}

Align getStructAlign() const {		Align getStructAlign() const {
assert(IsFinished && "not yet finished!");		assert(IsFinished && "not yet finished!");
return StructAlign;		return StructAlign;
}		}

unsigned getFieldIndex(FieldId Id) const {		FieldIDType getLayoutFieldIndex(FieldIDType Id) const {
assert(IsFinished && "not yet finished!");		assert(IsFinished && "not yet finished!");
return Fields[Id.Value].FieldIndex;		return Fields[Id].LayoutFieldIndex;
}		}
};		};
} // namespace		} // namespace

void FrameTypeBuilder::addFieldForAllocas(const Function &F, SpillInfo &Spills,		void FrameDataInfo::updateLayoutIndex(FrameTypeBuilder &B) {
		auto Updater = [&](Value *I) {
		setFieldIndex(I, B.getLayoutFieldIndex(getFieldIndex(I)));
		};
		LayoutIndexUpdateStarted = true;
		for (auto &S : Spills)
		Updater(S.first);
		for (auto *A : Allocas)
		Updater(A);
		LayoutIndexUpdateStarted = false;
		}

		void FrameTypeBuilder::addFieldForAllocas(const Function &F,
		FrameDataInfo &FrameData,
coro::Shape &Shape) {		coro::Shape &Shape) {
DenseMap<AllocaInst *, unsigned int> AllocaIndex;		DenseMap<AllocaInst *, unsigned int> AllocaIndex;
SmallVector<AllocaInst *, 8> Allocas;
DenseMap<AllocaInst , Spill > SpillOfAllocas;
using AllocaSetType = SmallVector<AllocaInst *, 4>;		using AllocaSetType = SmallVector<AllocaInst *, 4>;
SmallVector<AllocaSetType, 4> NonOverlapedAllocas;		SmallVector<AllocaSetType, 4> NonOverlapedAllocas;

// We need to add field for allocas at the end of this function. However, this		// We need to add field for allocas at the end of this function. However, this
// function has multiple exits, so we use this helper to avoid redundant code.		// function has multiple exits, so we use this helper to avoid redundant code.
struct RTTIHelper {		struct RTTIHelper {
std::function<void()> func;		std::function<void()> func;
RTTIHelper(std::function<void()> &&func) : func(func) {}		RTTIHelper(std::function<void()> &&func) : func(func) {}
~RTTIHelper() { func(); }		~RTTIHelper() { func(); }
} Helper([&]() {		} Helper([&]() {
for (auto AllocaSet : NonOverlapedAllocas) {		for (auto AllocaList : NonOverlapedAllocas) {
ForSpillType ForSpills;		auto LargestAI = AllocaList.begin();
for (auto Alloca : AllocaSet)		FieldIDType Id = addFieldForAlloca(LargestAI);
ForSpills.push_back(SpillOfAllocas[Alloca]);		for (auto *Alloca : AllocaList)
auto LargestAI = AllocaSet.begin();		FrameData.setFieldIndex(Alloca, Id);
addFieldForAlloca(LargestAI, ForSpills);
}		}
});		});

for (auto &Spill : Spills)
if (AllocaInst *AI = dyn_cast<AllocaInst>(Spill.def()))
if (find(Allocas, AI) == Allocas.end()) {
SpillOfAllocas[AI] = &Spill;
Allocas.emplace_back(AI);
}

if (!Shape.ReuseFrameSlot && !EnableReuseStorageInFrame) {		if (!Shape.ReuseFrameSlot && !EnableReuseStorageInFrame) {
for (auto Alloca : Allocas) {		for (auto *Alloca : FrameData.Allocas) {
AllocaIndex[Alloca] = NonOverlapedAllocas.size();		AllocaIndex[Alloca] = NonOverlapedAllocas.size();
NonOverlapedAllocas.emplace_back(AllocaSetType(1, Alloca));		NonOverlapedAllocas.emplace_back(AllocaSetType(1, Alloca));
}		}
return;		return;
}		}

// Because there are pathes from the lifetime.start to coro.end		// Because there are pathes from the lifetime.start to coro.end
// for each alloca, the liferanges for every alloca is overlaped		// for each alloca, the liferanges for every alloca is overlaped
Show All 13 Lines	for (auto U : CoroSuspendInst->users()) {
if (auto *ConstSWI = dyn_cast<SwitchInst>(U)) {		if (auto *ConstSWI = dyn_cast<SwitchInst>(U)) {
auto SWI = const_cast<SwitchInst >(ConstSWI);		auto SWI = const_cast<SwitchInst >(ConstSWI);
DefaultSuspendDest[SWI] = SWI->getDefaultDest();		DefaultSuspendDest[SWI] = SWI->getDefaultDest();
SWI->setDefaultDest(SWI->getSuccessor(1));		SWI->setDefaultDest(SWI->getSuccessor(1));
}		}
}		}
}		}

StackLifetime StackLifetimeAnalyzer(F, Allocas,		StackLifetime StackLifetimeAnalyzer(F, FrameData.Allocas,
StackLifetime::LivenessType::May);		StackLifetime::LivenessType::May);
StackLifetimeAnalyzer.run();		StackLifetimeAnalyzer.run();
auto IsAllocaInferenre = [&](const AllocaInst AI1, const AllocaInst AI2) {		auto IsAllocaInferenre = [&](const AllocaInst AI1, const AllocaInst AI2) {
return StackLifetimeAnalyzer.getLiveRange(AI1).overlaps(		return StackLifetimeAnalyzer.getLiveRange(AI1).overlaps(
StackLifetimeAnalyzer.getLiveRange(AI2));		StackLifetimeAnalyzer.getLiveRange(AI2));
};		};
auto GetAllocaSize = [&](const AllocaInst *AI) {		auto GetAllocaSize = [&](const AllocaInst *AI) {
Optional<uint64_t> RetSize = AI->getAllocationSizeInBits(DL);		Optional<uint64_t> RetSize = AI->getAllocationSizeInBits(DL);
assert(RetSize && "We can't handle scalable type now.\n");		assert(RetSize && "We can't handle scalable type now.\n");
return RetSize.getValue();		return RetSize.getValue();
};		};
// Put larger allocas in the front. So the larger allocas have higher		// Put larger allocas in the front. So the larger allocas have higher
// priority to merge, which can save more space potentially. Also each		// priority to merge, which can save more space potentially. Also each
// AllocaSet would be ordered. So we can get the largest Alloca in one		// AllocaSet would be ordered. So we can get the largest Alloca in one
// AllocaSet easily.		// AllocaSet easily.
sort(Allocas, [&](auto Iter1, auto Iter2) {		sort(FrameData.Allocas, [&](auto Iter1, auto Iter2) {
return GetAllocaSize(Iter1) > GetAllocaSize(Iter2);		return GetAllocaSize(Iter1) > GetAllocaSize(Iter2);
});		});
for (auto Alloca : Allocas) {		for (auto *Alloca : FrameData.Allocas) {
bool Merged = false;		bool Merged = false;
// Try to find if the Alloca is not inferenced with any existing		// Try to find if the Alloca is not inferenced with any existing
// NonOverlappedAllocaSet. If it is true, insert the alloca to that		// NonOverlappedAllocaSet. If it is true, insert the alloca to that
// NonOverlappedAllocaSet.		// NonOverlappedAllocaSet.
for (auto &AllocaSet : NonOverlapedAllocas) {		for (auto &AllocaSet : NonOverlapedAllocas) {
assert(!AllocaSet.empty() && "Processing Alloca Set is not empty.\n");		assert(!AllocaSet.empty() && "Processing Alloca Set is not empty.\n");
bool CouldMerge = none_of(AllocaSet, [&](auto Iter) {		bool CouldMerge = none_of(AllocaSet, [&](auto Iter) {
return IsAllocaInferenre(Alloca, Iter);		return IsAllocaInferenre(Alloca, Iter);
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	for (auto &LayoutField : LayoutFields) {
// get from aligning to the field type's natural alignment.		// get from aligning to the field type's natural alignment.
assert(Offset >= LastOffset);		assert(Offset >= LastOffset);
if (Offset != LastOffset) {		if (Offset != LastOffset) {
if (Packed \|\| alignTo(LastOffset, F.TyAlignment) != Offset)		if (Packed \|\| alignTo(LastOffset, F.TyAlignment) != Offset)
FieldTypes.push_back(ArrayType::get(Type::getInt8Ty(Context),		FieldTypes.push_back(ArrayType::get(Type::getInt8Ty(Context),
Offset - LastOffset));		Offset - LastOffset));
}		}

// Record the layout information into both the Field and the
// original Spill, if there is one.
F.Offset = Offset;		F.Offset = Offset;
F.FieldIndex = FieldTypes.size();		F.LayoutFieldIndex = FieldTypes.size();
for (auto Spill : F.ForSpill) {
Spill->setFieldIndex(F.FieldIndex);
}

FieldTypes.push_back(F.Ty);		FieldTypes.push_back(F.Ty);
LastOffset = Offset + F.Size;		LastOffset = Offset + F.Size;
}		}

Ty->setBody(FieldTypes, Packed);		Ty->setBody(FieldTypes, Packed);

#ifndef NDEBUG		#ifndef NDEBUG
// Check that the IR layout matches the offsets we expect.		// Check that the IR layout matches the offsets we expect.
auto Layout = DL.getStructLayout(Ty);		auto Layout = DL.getStructLayout(Ty);
for (auto &F : Fields) {		for (auto &F : Fields) {
assert(Ty->getElementType(F.FieldIndex) == F.Ty);		assert(Ty->getElementType(F.LayoutFieldIndex) == F.Ty);
assert(Layout->getElementOffset(F.FieldIndex) == F.Offset);		assert(Layout->getElementOffset(F.LayoutFieldIndex) == F.Offset);
}		}
#endif		#endif

IsFinished = true;		IsFinished = true;
}		}

// Build a struct that will keep state for an active coroutine.		// Build a struct that will keep state for an active coroutine.
// struct f.frame {		// struct f.frame {
// ResumeFnTy ResumeFnAddr;		// ResumeFnTy ResumeFnAddr;
// ResumeFnTy DestroyFnAddr;		// ResumeFnTy DestroyFnAddr;
// int ResumeIndex;		// int ResumeIndex;
// ... promise (if present) ...		// ... promise (if present) ...
// ... spills ...		// ... spills ...
// };		// };
static StructType *buildFrameType(Function &F, coro::Shape &Shape,		static StructType *buildFrameType(Function &F, coro::Shape &Shape,
SpillInfo &Spills) {		FrameDataInfo &FrameData) {
LLVMContext &C = F.getContext();		LLVMContext &C = F.getContext();
const DataLayout &DL = F.getParent()->getDataLayout();		const DataLayout &DL = F.getParent()->getDataLayout();
StructType *FrameTy = [&] {		StructType *FrameTy = [&] {
SmallString<32> Name(F.getName());		SmallString<32> Name(F.getName());
Name.append(".Frame");		Name.append(".Frame");
return StructType::create(C, Name);		return StructType::create(C, Name);
}();		}();

FrameTypeBuilder B(C, DL);		FrameTypeBuilder B(C, DL);

AllocaInst *PromiseAlloca = Shape.getPromiseAlloca();		AllocaInst *PromiseAlloca = Shape.getPromiseAlloca();
Optional<FrameTypeBuilder::FieldId> PromiseFieldId;		Optional<FieldIDType> SwitchIndexFieldId;
Optional<FrameTypeBuilder::FieldId> SwitchIndexFieldId;

if (Shape.ABI == coro::ABI::Switch) {		if (Shape.ABI == coro::ABI::Switch) {
auto *FramePtrTy = FrameTy->getPointerTo();		auto *FramePtrTy = FrameTy->getPointerTo();
auto *FnTy = FunctionType::get(Type::getVoidTy(C), FramePtrTy,		auto *FnTy = FunctionType::get(Type::getVoidTy(C), FramePtrTy,
/IsVarArg=/false);		/IsVarArg=/false);
auto *FnPtrTy = FnTy->getPointerTo();		auto *FnPtrTy = FnTy->getPointerTo();

// Add header fields for the resume and destroy functions.		// Add header fields for the resume and destroy functions.
// We can rely on these being perfectly packed.		// We can rely on these being perfectly packed.
B.addField(FnPtrTy, None, {}, /header/ true);		(void)B.addField(FnPtrTy, None, /header/ true);
B.addField(FnPtrTy, None, {}, /header/ true);		(void)B.addField(FnPtrTy, None, /header/ true);

// Add a header field for the promise if there is one.		// PromiseAlloca field needs to be explicitly added here because it's
if (PromiseAlloca) {		// a header field with a fixed offset based on its alignment. Hence it
		rjmccallUnsubmitted Not Done Reply Inline Actions Well, it needs to be explicitly added here because it's a header field which has to have a fixed offset based on its alignment, which is why it's not in `FrameData.Allocas`. rjmccall: Well, it needs to be explicitly added here because it's a header field which has to have a…
PromiseFieldId = B.addFieldForAlloca(PromiseAlloca, {}, /header/ true);		// needs special handling and cannot be added to FrameData.Allocas.
}		if (PromiseAlloca)
		FrameData.setFieldIndex(
		PromiseAlloca, B.addFieldForAlloca(PromiseAlloca, /header/ true));

// Add a field to store the suspend index. This doesn't need to		// Add a field to store the suspend index. This doesn't need to
// be in the header.		// be in the header.
unsigned IndexBits = std::max(1U, Log2_64_Ceil(Shape.CoroSuspends.size()));		unsigned IndexBits = std::max(1U, Log2_64_Ceil(Shape.CoroSuspends.size()));
Type *IndexType = Type::getIntNTy(C, IndexBits);		Type *IndexType = Type::getIntNTy(C, IndexBits);

SwitchIndexFieldId = B.addField(IndexType, None);		SwitchIndexFieldId = B.addField(IndexType, None);
} else {		} else {
assert(PromiseAlloca == nullptr && "lowering doesn't support promises");		assert(PromiseAlloca == nullptr && "lowering doesn't support promises");
}		}

// Because multiple allocas may own the same field slot,		// Because multiple allocas may own the same field slot,
// we add allocas to field here.		// we add allocas to field here.
B.addFieldForAllocas(F, Spills, Shape);		B.addFieldForAllocas(F, FrameData, Shape);
Value *CurrentDef = nullptr;		// Create an entry for every spilled value.
// Create an entry for every spilled value which is not an AllocaInst.		for (auto &S : FrameData.Spills) {
for (auto &S : Spills) {		FieldIDType Id = B.addField(S.first->getType(), None);
// We can have multiple entries in Spills for a single value, but		FrameData.setFieldIndex(S.first, Id);
// they should form a contiguous run. Ignore all but the first.
if (CurrentDef == S.def())
continue;

CurrentDef = S.def();

assert(CurrentDef != PromiseAlloca &&
"recorded spill use of promise alloca?");

if (!isa<AllocaInst>(CurrentDef)) {
Type *Ty = CurrentDef->getType();
B.addField(Ty, None, {&S});
}
}		}

B.finish(FrameTy);		B.finish(FrameTy);
		FrameData.updateLayoutIndex(B);
Shape.FrameAlign = B.getStructAlign();		Shape.FrameAlign = B.getStructAlign();
Shape.FrameSize = B.getStructSize();		Shape.FrameSize = B.getStructSize();

switch (Shape.ABI) {		switch (Shape.ABI) {
// In the switch ABI, remember the field indices for the promise and
// switch-index fields.
case coro::ABI::Switch:		case coro::ABI::Switch:
		// In the switch ABI, remember the switch-index field.
		rjmccallUnsubmitted Not Done Reply Inline Actions It's just "index" now. rjmccall: It's just "index" now.
Shape.SwitchLowering.IndexField =		Shape.SwitchLowering.IndexField =
B.getFieldIndex(*SwitchIndexFieldId);		B.getLayoutFieldIndex(*SwitchIndexFieldId);
Shape.SwitchLowering.PromiseField =
(PromiseAlloca ? B.getFieldIndex(*PromiseFieldId) : 0);

// Also round the frame size up to a multiple of its alignment, as is		// Also round the frame size up to a multiple of its alignment, as is
// generally expected in C/C++.		// generally expected in C/C++.
Shape.FrameSize = alignTo(Shape.FrameSize, Shape.FrameAlign);		Shape.FrameSize = alignTo(Shape.FrameSize, Shape.FrameAlign);
break;		break;

// In the retcon ABI, remember whether the frame is inline in the storage.		// In the retcon ABI, remember whether the frame is inline in the storage.
case coro::ABI::Retcon:		case coro::ABI::Retcon:
▲ Show 20 Lines • Show All 151 Lines • ▼ Show 20 Lines
// AllocaSpillBB:		// AllocaSpillBB:
// ; geps corresponding to allocas that were moved to coroutine frame		// ; geps corresponding to allocas that were moved to coroutine frame
// br label PostSpill		// br label PostSpill
//		//
// PostSpill:		// PostSpill:
// whatever		// whatever
//		//
//		//
static Instruction *insertSpills(const SpillInfo &Spills, coro::Shape &Shape) {		static Instruction *insertSpills(const FrameDataInfo &FrameData,
		coro::Shape &Shape) {
auto *CB = Shape.CoroBegin;		auto *CB = Shape.CoroBegin;
LLVMContext &C = CB->getContext();		LLVMContext &C = CB->getContext();
IRBuilder<> Builder(CB->getNextNode());		IRBuilder<> Builder(CB->getNextNode());
StructType *FrameTy = Shape.FrameTy;		StructType *FrameTy = Shape.FrameTy;
PointerType *FramePtrTy = FrameTy->getPointerTo();		PointerType *FramePtrTy = FrameTy->getPointerTo();
auto *FramePtr =		auto *FramePtr =
cast<Instruction>(Builder.CreateBitCast(CB, FramePtrTy, "FramePtr"));		cast<Instruction>(Builder.CreateBitCast(CB, FramePtrTy, "FramePtr"));
DominatorTree DT(*CB->getFunction());		DominatorTree DT(*CB->getFunction());

Value *CurrentValue = nullptr;
BasicBlock *CurrentBlock = nullptr;
Value *CurrentReload = nullptr;

// Proper field number will be read from field definition.
unsigned Index = InvalidFieldIndex;

// We need to keep track of any allocas that need "spilling"
// since they will live in the coroutine frame now, all access to them
// need to be changed, not just the access across suspend points
// we remember allocas and their indices to be handled once we processed
// all the spills.
SmallVector<std::pair<AllocaInst *, unsigned>, 4> Allocas;

// Promise alloca (if present) doesn't show in the spills and has a
// special field number.
if (auto *PromiseAlloca = Shape.getPromiseAlloca()) {
assert(Shape.ABI == coro::ABI::Switch);
Allocas.emplace_back(PromiseAlloca, Shape.getPromiseField());
}

// Create a GEP with the given index into the coroutine frame for the original		// Create a GEP with the given index into the coroutine frame for the original
// value Orig. Appends an extra 0 index for array-allocas, preserving the		// value Orig. Appends an extra 0 index for array-allocas, preserving the
// original type.		// original type.
auto GetFramePointer = [&](uint32_t Index, Value Orig) -> Value {		auto GetFramePointer = [&](Value Orig) -> Value {
		FieldIDType Index = FrameData.getFieldIndex(Orig);
SmallVector<Value *, 3> Indices = {		SmallVector<Value *, 3> Indices = {
ConstantInt::get(Type::getInt32Ty(C), 0),		ConstantInt::get(Type::getInt32Ty(C), 0),
ConstantInt::get(Type::getInt32Ty(C), Index),		ConstantInt::get(Type::getInt32Ty(C), Index),
};		};

if (auto *AI = dyn_cast<AllocaInst>(Orig)) {		if (auto *AI = dyn_cast<AllocaInst>(Orig)) {
if (auto *CI = dyn_cast<ConstantInt>(AI->getArraySize())) {		if (auto *CI = dyn_cast<ConstantInt>(AI->getArraySize())) {
auto Count = CI->getValue().getZExtValue();		auto Count = CI->getValue().getZExtValue();
Show All 13 Lines	if (isa<AllocaInst>(Orig)) {
// AllocaInst. So we cast the GEP to the type of AllocaInst.		// AllocaInst. So we cast the GEP to the type of AllocaInst.
if (GEP->getResultElementType() != Orig->getType())		if (GEP->getResultElementType() != Orig->getType())
return Builder.CreateBitCast(GEP, Orig->getType(),		return Builder.CreateBitCast(GEP, Orig->getType(),
Orig->getName() + Twine(".cast"));		Orig->getName() + Twine(".cast"));
}		}
return GEP;		return GEP;
};		};

// Create a load instruction to reload the spilled value from the coroutine		for (auto const &E : FrameData.Spills) {
// frame. Populates the Value pointer reference provided with the frame GEP.		Value *Def = E.first;
auto CreateReload = [&](Instruction InsertBefore, Value &G) {		// Create a store instruction storing the value into the
assert(Index != InvalidFieldIndex && "accessing unassigned field number");
Builder.SetInsertPoint(InsertBefore);

G = GetFramePointer(Index, CurrentValue);
G->setName(CurrentValue->getName() + Twine(".reload.addr"));

return isa<AllocaInst>(CurrentValue)
? G
: Builder.CreateLoad(FrameTy->getElementType(Index), G,
CurrentValue->getName() + Twine(".reload"));
};

Value GEP = nullptr, CurrentGEP = nullptr;
for (auto const &E : Spills) {
// If we have not seen the value, generate a spill.
if (CurrentValue != E.def()) {
CurrentValue = E.def();
CurrentBlock = nullptr;
CurrentReload = nullptr;

Index = E.fieldIndex();

if (auto *AI = dyn_cast<AllocaInst>(CurrentValue)) {
// Spilled AllocaInst will be replaced with GEP from the coroutine frame
// there is no spill required.
Allocas.emplace_back(AI, Index);
if (!AI->isStaticAlloca())
report_fatal_error("Coroutines cannot handle non static allocas yet");
} else {
// Otherwise, create a store instruction storing the value into the
// coroutine frame.		// coroutine frame.

Instruction *InsertPt = nullptr;		Instruction *InsertPt = nullptr;
if (auto Arg = dyn_cast<Argument>(CurrentValue)) {		if (auto *Arg = dyn_cast<Argument>(Def)) {
// For arguments, we will place the store instruction right after		// For arguments, we will place the store instruction right after
// the coroutine frame pointer instruction, i.e. bitcast of		// the coroutine frame pointer instruction, i.e. bitcast of
// coro.begin from i8* to %f.frame*.		// coro.begin from i8* to %f.frame*.
InsertPt = FramePtr->getNextNode();		InsertPt = FramePtr->getNextNode();

// If we're spilling an Argument, make sure we clear 'nocapture'		// If we're spilling an Argument, make sure we clear 'nocapture'
// from the coroutine function.		// from the coroutine function.
Arg->getParent()->removeParamAttr(Arg->getArgNo(),		Arg->getParent()->removeParamAttr(Arg->getArgNo(), Attribute::NoCapture);
Attribute::NoCapture);

} else if (auto CSI = dyn_cast<AnyCoroSuspendInst>(CurrentValue)) {		} else if (auto *CSI = dyn_cast<AnyCoroSuspendInst>(Def)) {
// Don't spill immediately after a suspend; splitting assumes		// Don't spill immediately after a suspend; splitting assumes
// that the suspend will be followed by a branch.		// that the suspend will be followed by a branch.
InsertPt = CSI->getParent()->getSingleSuccessor()->getFirstNonPHI();		InsertPt = CSI->getParent()->getSingleSuccessor()->getFirstNonPHI();
} else {		} else {
auto *I = cast<Instruction>(CurrentValue);		auto *I = cast<Instruction>(Def);
if (!DT.dominates(CB, I)) {		if (!DT.dominates(CB, I)) {
// If it is not dominated by CoroBegin, then spill should be		// If it is not dominated by CoroBegin, then spill should be
// inserted immediately after CoroFrame is computed.		// inserted immediately after CoroFrame is computed.
InsertPt = FramePtr->getNextNode();		InsertPt = FramePtr->getNextNode();
} else if (auto *II = dyn_cast<InvokeInst>(I)) {		} else if (auto *II = dyn_cast<InvokeInst>(I)) {
// If we are spilling the result of the invoke instruction, split		// If we are spilling the result of the invoke instruction, split
// the normal edge and insert the spill in the new block.		// the normal edge and insert the spill in the new block.
auto *NewBB = SplitEdge(II->getParent(), II->getNormalDest());		auto *NewBB = SplitEdge(II->getParent(), II->getNormalDest());
InsertPt = NewBB->getTerminator();		InsertPt = NewBB->getTerminator();
} else if (isa<PHINode>(I)) {		} else if (isa<PHINode>(I)) {
// Skip the PHINodes and EH pads instructions.		// Skip the PHINodes and EH pads instructions.
BasicBlock *DefBlock = I->getParent();		BasicBlock *DefBlock = I->getParent();
if (auto *CSI =		if (auto *CSI = dyn_cast<CatchSwitchInst>(DefBlock->getTerminator()))
dyn_cast<CatchSwitchInst>(DefBlock->getTerminator()))
InsertPt = splitBeforeCatchSwitch(CSI);		InsertPt = splitBeforeCatchSwitch(CSI);
else		else
InsertPt = &*DefBlock->getFirstInsertionPt();		InsertPt = &*DefBlock->getFirstInsertionPt();
} else {		} else {
assert(!I->isTerminator() && "unexpected terminator");		assert(!I->isTerminator() && "unexpected terminator");
// For all other values, the spill is placed immediately after		// For all other values, the spill is placed immediately after
// the definition.		// the definition.
InsertPt = I->getNextNode();		InsertPt = I->getNextNode();
}		}
}		}

		auto Index = FrameData.getFieldIndex(Def);
Builder.SetInsertPoint(InsertPt);		Builder.SetInsertPoint(InsertPt);
auto *G = Builder.CreateConstInBoundsGEP2_32(		auto *G = Builder.CreateConstInBoundsGEP2_32(
FrameTy, FramePtr, 0, Index,		FrameTy, FramePtr, 0, Index, Def->getName() + Twine(".spill.addr"));
CurrentValue->getName() + Twine(".spill.addr"));		Builder.CreateStore(Def, G);
Builder.CreateStore(CurrentValue, G);
}
}

// If we have not seen the use block, generate a reload in it.		BasicBlock *CurrentBlock = nullptr;
if (CurrentBlock != E.userBlock()) {		Value *CurrentReload = nullptr;
CurrentBlock = E.userBlock();		for (auto *U : E.second) {
CurrentReload = CreateReload(&*CurrentBlock->getFirstInsertionPt(), GEP);		// If we have not seen the use block, create a load instruction to reload
}		// the spilled value from the coroutine frame. Populates the Value pointer
		// reference provided with the frame GEP.
// If we have a single edge PHINode, remove it and replace it with a reload		if (CurrentBlock != U->getParent()) {
// from the coroutine frame. (We already took care of multi edge PHINodes		CurrentBlock = U->getParent();
// by rewriting them in the rewritePHIs function).		Builder.SetInsertPoint(&*CurrentBlock->getFirstInsertionPt());
if (auto *PN = dyn_cast<PHINode>(E.user())) {
assert(PN->getNumIncomingValues() == 1 && "unexpected number of incoming "		auto *GEP = GetFramePointer(E.first);
		GEP->setName(E.first->getName() + Twine(".reload.addr"));
		CurrentReload = Builder.CreateLoad(
		FrameTy->getElementType(FrameData.getFieldIndex(E.first)), GEP,
		E.first->getName() + Twine(".reload"));
		}

		// If we have a single edge PHINode, remove it and replace it with a
		// reload from the coroutine frame. (We already took care of multi edge
		// PHINodes by rewriting them in the rewritePHIs function).
		if (auto *PN = dyn_cast<PHINode>(U)) {
		assert(PN->getNumIncomingValues() == 1 &&
		"unexpected number of incoming "
"values in the PHINode");		"values in the PHINode");
PN->replaceAllUsesWith(CurrentReload);		PN->replaceAllUsesWith(CurrentReload);
PN->eraseFromParent();		PN->eraseFromParent();
continue;		continue;
}		}

// If we have not seen this GEP instruction, migrate any dbg.declare from		// Replace all uses of CurrentValue in the current instruction with
ChuanqiXuUnsubmitted Not Done Reply Inline Actions I'm confusing about the comment. It says it would migrate dbg.declare from alloca. But how could it know the `CurrentValue` must be alloca from the context ? ChuanqiXu: I'm confusing about the comment. It says it would migrate dbg.declare from alloca. But how…
lxfindAuthorUnsubmitted Done Reply Inline Actions I think dbg.declare instructions were all generated for alloca instructions. lxfind: I think dbg.declare instructions were all generated for alloca instructions.
// the alloca to it.		// reload.
if (CurrentGEP != GEP) {		U->replaceUsesOfWith(Def, CurrentReload);
CurrentGEP = GEP;
TinyPtrVector<DbgDeclareInst *> DIs = FindDbgDeclareUses(CurrentValue);
if (!DIs.empty())
DIBuilder(*CurrentBlock->getParent()->getParent(),
/AllowUnresolved/ false)
.insertDeclare(CurrentGEP, DIs.front()->getVariable(),
DIs.front()->getExpression(),
DIs.front()->getDebugLoc(), DIs.front());
}		}

// Replace all uses of CurrentValue in the current instruction with reload.
E.user()->replaceUsesOfWith(CurrentValue, CurrentReload);
}		}

BasicBlock *FramePtrBB = FramePtr->getParent();		BasicBlock *FramePtrBB = FramePtr->getParent();

auto SpillBlock =		auto SpillBlock =
FramePtrBB->splitBasicBlock(FramePtr->getNextNode(), "AllocaSpillBB");		FramePtrBB->splitBasicBlock(FramePtr->getNextNode(), "AllocaSpillBB");
SpillBlock->splitBasicBlock(&SpillBlock->front(), "PostSpill");		SpillBlock->splitBasicBlock(&SpillBlock->front(), "PostSpill");
Shape.AllocaSpillBlock = SpillBlock;		Shape.AllocaSpillBlock = SpillBlock;

// retcon and retcon.once lowering assumes all uses have been sunk.		// retcon and retcon.once lowering assumes all uses have been sunk.
if (Shape.ABI == coro::ABI::Retcon \|\| Shape.ABI == coro::ABI::RetconOnce) {		if (Shape.ABI == coro::ABI::Retcon \|\| Shape.ABI == coro::ABI::RetconOnce) {
// If we found any allocas, replace all of their remaining uses with Geps.		// If we found any allocas, replace all of their remaining uses with Geps.
Builder.SetInsertPoint(&SpillBlock->front());		Builder.SetInsertPoint(&SpillBlock->front());
for (auto &P : Allocas) {		for (const auto &P : FrameData.Allocas) {
auto *G = GetFramePointer(P.second, P.first);		auto *G = GetFramePointer(P);

// We are not using ReplaceInstWithInst(P.first, cast<Instruction>(G))		// We are not using ReplaceInstWithInst(P.first, cast<Instruction>(G))
// here, as we are changing location of the instruction.		// here, as we are changing location of the instruction.
G->takeName(P.first);		G->takeName(P);
P.first->replaceAllUsesWith(G);		P->replaceAllUsesWith(G);
P.first->eraseFromParent();		P->eraseFromParent();
}		}
return FramePtr;		return FramePtr;
}		}

// If we found any alloca, replace all of their remaining uses with GEP		// If we found any alloca, replace all of their remaining uses with GEP
// instructions. Because new dbg.declare have been created for these alloca,		// instructions. Because new dbg.declare have been created for these alloca,
// we also delete the original dbg.declare and replace other uses with undef.		// we also delete the original dbg.declare and replace other uses with undef.
// Note: We cannot replace the alloca with GEP instructions indiscriminately,		// Note: We cannot replace the alloca with GEP instructions indiscriminately,
// as some of the uses may not be dominated by CoroBegin.		// as some of the uses may not be dominated by CoroBegin.
bool MightNeedToCopy = false;		bool MightNeedToCopy = false;
Builder.SetInsertPoint(&Shape.AllocaSpillBlock->front());		Builder.SetInsertPoint(&Shape.AllocaSpillBlock->front());
SmallVector<Instruction *, 4> UsersToUpdate;		SmallVector<Instruction *, 4> UsersToUpdate;
for (auto &P : Allocas) {		for (AllocaInst *A : FrameData.Allocas) {
AllocaInst *const A = P.first;

for (auto *DI : FindDbgDeclareUses(A))
DI->eraseFromParent();
replaceDbgUsesWithUndef(A);

UsersToUpdate.clear();		UsersToUpdate.clear();
for (User *U : A->users()) {		for (User *U : A->users()) {
auto *I = cast<Instruction>(U);		auto *I = cast<Instruction>(U);
if (DT.dominates(CB, I))		if (DT.dominates(CB, I))
UsersToUpdate.push_back(I);		UsersToUpdate.push_back(I);
else		else
MightNeedToCopy = true;		MightNeedToCopy = true;
}		}
if (!UsersToUpdate.empty()) {		if (!UsersToUpdate.empty()) {
auto *G = GetFramePointer(P.second, A);		auto *G = GetFramePointer(A);
G->takeName(A);		G->setName(A->getName() + Twine(".reload.addr"));
		TinyPtrVector<DbgDeclareInst *> DIs = FindDbgDeclareUses(A);
		if (!DIs.empty())
		DIBuilder(*A->getModule(),
		/AllowUnresolved/ false)
		.insertDeclare(G, DIs.front()->getVariable(),
		DIs.front()->getExpression(),
		DIs.front()->getDebugLoc(), DIs.front());
		for (auto *DI : FindDbgDeclareUses(A))
		DI->eraseFromParent();
		replaceDbgUsesWithUndef(A);

for (Instruction *I : UsersToUpdate)		for (Instruction *I : UsersToUpdate)
I->replaceUsesOfWith(A, G);		I->replaceUsesOfWith(A, G);
}		}
}		}
// If we discovered such uses not dominated by CoroBegin, see if any of them		// If we discovered such uses not dominated by CoroBegin, see if any of them
// preceed coro begin and have instructions that can modify the		// preceed coro begin and have instructions that can modify the
// value of the alloca and therefore would require a copying the value into		// value of the alloca and therefore would require a copying the value into
// the spill slot in the coroutine frame.		// the spill slot in the coroutine frame.
if (MightNeedToCopy) {		if (MightNeedToCopy) {
Builder.SetInsertPoint(FramePtr->getNextNode());		Builder.SetInsertPoint(FramePtr->getNextNode());

for (auto &P : Allocas) {		for (AllocaInst *A : FrameData.Allocas) {
AllocaInst *const A = P.first;
AllocaUseVisitor Visitor(A->getModule()->getDataLayout(), DT, *CB);		AllocaUseVisitor Visitor(A->getModule()->getDataLayout(), DT, *CB);
auto PtrI = Visitor.visitPtr(*A);		auto PtrI = Visitor.visitPtr(*A);
assert(!PtrI.isAborted());		assert(!PtrI.isAborted());
if (PtrI.isEscaped()) {		if (PtrI.isEscaped()) {
// isEscaped really means potentially modified before CoroBegin.		// isEscaped really means potentially modified before CoroBegin.
if (A->isArrayAllocation())		if (A->isArrayAllocation())
report_fatal_error(		report_fatal_error(
"Coroutines cannot handle copying of array allocas yet");		"Coroutines cannot handle copying of array allocas yet");

auto *G = GetFramePointer(P.second, A);		auto *G = GetFramePointer(A);
auto *Value = Builder.CreateLoad(A->getAllocatedType(), A);		auto *Value = Builder.CreateLoad(A->getAllocatedType(), A);
Builder.CreateStore(Value, G);		Builder.CreateStore(Value, G);
}		}
// For each alias to Alloca created before CoroBegin but used after		// For each alias to Alloca created before CoroBegin but used after
// CoroBegin, we recreate them after CoroBegin by appplying the offset		// CoroBegin, we recreate them after CoroBegin by appplying the offset
// to the pointer in the frame.		// to the pointer in the frame.
for (const auto &Alias : Visitor.getAliases()) {		for (const auto &Alias : Visitor.getAliases()) {
auto *FramePtr = GetFramePointer(P.second, A);		auto *FramePtr = GetFramePointer(A);
auto *FramePtrRaw =		auto *FramePtrRaw =
Builder.CreateBitCast(FramePtr, Type::getInt8PtrTy(C));		Builder.CreateBitCast(FramePtr, Type::getInt8PtrTy(C));
auto *AliasPtr = Builder.CreateGEP(		auto *AliasPtr = Builder.CreateGEP(
FramePtrRaw, ConstantInt::get(Type::getInt64Ty(C), Alias.second));		FramePtrRaw, ConstantInt::get(Type::getInt64Ty(C), Alias.second));
auto *AliasPtrTyped =		auto *AliasPtrTyped =
Builder.CreateBitCast(AliasPtr, Alias.first->getType());		Builder.CreateBitCast(AliasPtr, Alias.first->getType());
Alias.first->replaceUsesWithIf(		Alias.first->replaceUsesWithIf(
AliasPtrTyped, [&](Use &U) { return DT.dominates(CB, U); });		AliasPtrTyped, [&](Use &U) { return DT.dominates(CB, U); });
▲ Show 20 Lines • Show All 263 Lines • ▼ Show 20 Lines
static bool isCoroutineStructureIntrinsic(Instruction &I) {		static bool isCoroutineStructureIntrinsic(Instruction &I) {
return isa<CoroIdInst>(&I) \|\| isa<CoroSaveInst>(&I) \|\|		return isa<CoroIdInst>(&I) \|\| isa<CoroSaveInst>(&I) \|\|
isa<CoroSuspendInst>(&I);		isa<CoroSuspendInst>(&I);
}		}

// For every use of the value that is across suspend point, recreate that value		// For every use of the value that is across suspend point, recreate that value
// after a suspend point.		// after a suspend point.
static void rewriteMaterializableInstructions(IRBuilder<> &IRB,		static void rewriteMaterializableInstructions(IRBuilder<> &IRB,
SpillInfo const &Spills) {		const SpillInfo &Spills) {
		for (const auto &E : Spills) {
		Value *Def = E.first;
BasicBlock *CurrentBlock = nullptr;		BasicBlock *CurrentBlock = nullptr;
Instruction *CurrentMaterialization = nullptr;		Instruction *CurrentMaterialization = nullptr;
Instruction *CurrentDef = nullptr;		for (Instruction *U : E.second) {

for (auto const &E : Spills) {
// If it is a new definition, update CurrentXXX variables.
if (CurrentDef != E.def()) {
CurrentDef = cast<Instruction>(E.def());
CurrentBlock = nullptr;
CurrentMaterialization = nullptr;
}

// If we have not seen this block, materialize the value.		// If we have not seen this block, materialize the value.
if (CurrentBlock != E.userBlock()) {		if (CurrentBlock != U->getParent()) {
CurrentBlock = E.userBlock();		CurrentBlock = U->getParent();
CurrentMaterialization = cast<Instruction>(CurrentDef)->clone();		CurrentMaterialization = cast<Instruction>(Def)->clone();
CurrentMaterialization->setName(CurrentDef->getName());		CurrentMaterialization->setName(Def->getName());
CurrentMaterialization->insertBefore(		CurrentMaterialization->insertBefore(
&*CurrentBlock->getFirstInsertionPt());		&*CurrentBlock->getFirstInsertionPt());
}		}
		if (auto *PN = dyn_cast<PHINode>(U)) {
if (auto *PN = dyn_cast<PHINode>(E.user())) {		assert(PN->getNumIncomingValues() == 1 &&
assert(PN->getNumIncomingValues() == 1 && "unexpected number of incoming "		"unexpected number of incoming "
"values in the PHINode");		"values in the PHINode");
PN->replaceAllUsesWith(CurrentMaterialization);		PN->replaceAllUsesWith(CurrentMaterialization);
PN->eraseFromParent();		PN->eraseFromParent();
continue;		continue;
}		}
		// Replace all uses of Def in the current instruction with the
// Replace all uses of CurrentDef in the current instruction with the
// CurrentMaterialization for the block.		// CurrentMaterialization for the block.
E.user()->replaceUsesOfWith(CurrentDef, CurrentMaterialization);		U->replaceUsesOfWith(Def, CurrentMaterialization);
		}
}		}
}		}

// Splits the block at a particular instruction unless it is the first		// Splits the block at a particular instruction unless it is the first
// instruction in the block with a single predecessor.		// instruction in the block with a single predecessor.
static BasicBlock splitBlockIfNotFirst(Instruction I, const Twine &Name) {		static BasicBlock splitBlockIfNotFirst(Instruction I, const Twine &Name) {
auto *BB = I->getParent();		auto *BB = I->getParent();
if (&BB->front() == I) {		if (&BB->front() == I) {
▲ Show 20 Lines • Show All 322 Lines • ▼ Show 20 Lines	static void eliminateSwiftError(Function &F, coro::Shape &Shape) {
if (!AllocasToPromote.empty()) {		if (!AllocasToPromote.empty()) {
DominatorTree DT(F);		DominatorTree DT(F);
PromoteMemToReg(AllocasToPromote, DT);		PromoteMemToReg(AllocasToPromote, DT);
}		}
}		}

/// retcon and retcon.once conventions assume that all spill uses can be sunk		/// retcon and retcon.once conventions assume that all spill uses can be sunk
/// after the coro.begin intrinsic.		/// after the coro.begin intrinsic.
static void sinkSpillUsesAfterCoroBegin(Function &F, const SpillInfo &Spills,		static void sinkSpillUsesAfterCoroBegin(Function &F,
		const FrameDataInfo &FrameData,
CoroBeginInst *CoroBegin) {		CoroBeginInst *CoroBegin) {
DominatorTree Dom(F);		DominatorTree Dom(F);

SmallSetVector<Instruction *, 32> ToMove;		SmallSetVector<Instruction *, 32> ToMove;
SmallVector<Instruction *, 32> Worklist;		SmallVector<Instruction *, 32> Worklist;

// Collect all users that precede coro.begin.		// Collect all users that precede coro.begin.
for (auto const &Entry : Spills) {		for (auto *Def : FrameData.getAllDefs()) {
auto *SpillDef = Entry.def();		for (User *U : Def->users()) {
for (User *U : SpillDef->users()) {
auto Inst = cast<Instruction>(U);		auto Inst = cast<Instruction>(U);
if (Inst->getParent() != CoroBegin->getParent() \|\|		if (Inst->getParent() != CoroBegin->getParent() \|\|
Dom.dominates(CoroBegin, Inst))		Dom.dominates(CoroBegin, Inst))
continue;		continue;
if (ToMove.insert(Inst))		if (ToMove.insert(Inst))
Worklist.push_back(Inst);		Worklist.push_back(Inst);
}		}
}		}
▲ Show 20 Lines • Show All 109 Lines • ▼ Show 20 Lines	for (BasicBlock *DomBB : DomSet) {
S->eraseFromParent();		S->eraseFromParent();

break;		break;
}		}
}		}
}		}
}		}

		static void collectFrameAllocas(Function &F, coro::Shape &Shape,
		SuspendCrossingInfo &Checker,
		SmallVectorImpl<AllocaInst *> &Allocas) {
		// Collect lifetime.start info for each alloca.
		using LifetimeStart = SmallPtrSet<Instruction *, 2>;
		llvm::DenseMap<AllocaInst *, std::unique_ptr<LifetimeStart>> LifetimeMap;
		for (Instruction &I : instructions(F)) {
		auto *II = dyn_cast<IntrinsicInst>(&I);
		if (!II \|\| II->getIntrinsicID() != Intrinsic::lifetime_start)
		continue;

		if (auto *OpInst = dyn_cast<Instruction>(II->getOperand(1))) {
		if (auto *AI = dyn_cast<AllocaInst>(OpInst->stripPointerCasts())) {

		if (LifetimeMap.find(AI) == LifetimeMap.end())
		LifetimeMap[AI] = std::make_unique<LifetimeStart>();
		LifetimeMap[AI]->insert(isa<AllocaInst>(OpInst) ? II : OpInst);
		}
		}
		}

		for (Instruction &I : instructions(F)) {
		auto *AI = dyn_cast<AllocaInst>(&I);
		if (!AI)
		continue;
		// The PromiseAlloca will be specially handled since it needs to be in a
		// fixed position in the frame.
		if (AI == Shape.SwitchLowering.PromiseAlloca) {
		continue;
		}
		auto Iter = LifetimeMap.find(AI);
		for (User *U : I.users()) {
		bool ShouldLiveOnFrame = false;

		// Check against lifetime.start if the instruction has the info.
		if (Iter != LifetimeMap.end())
		for (auto S : Iter->second) {
		if ((ShouldLiveOnFrame = Checker.isDefinitionAcrossSuspend(*S, U)))
		break;
		}
		else
		ShouldLiveOnFrame = Checker.isDefinitionAcrossSuspend(I, U);
		rjmccallUnsubmitted Not Done Reply Inline Actions I see that this seems to be the existing algorithm, and I know this is intended to be a refactoring patch rather than a functional one, so I'll let this go without comment beyond expressing my eagerness to see a follow-up. :) rjmccall: I see that this seems to be the existing algorithm, and I know this is intended to be a…

		if (ShouldLiveOnFrame) {
		Allocas.push_back(AI);
		break;
		}
		}
		}
		}

void coro::buildCoroutineFrame(Function &F, Shape &Shape) {		void coro::buildCoroutineFrame(Function &F, Shape &Shape) {
eliminateSwiftError(F, Shape);		eliminateSwiftError(F, Shape);

if (Shape.ABI == coro::ABI::Switch &&		if (Shape.ABI == coro::ABI::Switch &&
Shape.SwitchLowering.PromiseAlloca) {		Shape.SwitchLowering.PromiseAlloca) {
Shape.getSwitchCoroId()->clearPromise();		Shape.getSwitchCoroId()->clearPromise();
}		}

Show All 13 Lines	void coro::buildCoroutineFrame(Function &F, Shape &Shape) {
// Transforms multi-edge PHI Nodes, so that any value feeding into a PHI will		// Transforms multi-edge PHI Nodes, so that any value feeding into a PHI will
// never has its definition separated from the PHI by the suspend point.		// never has its definition separated from the PHI by the suspend point.
rewritePHIs(F);		rewritePHIs(F);

// Build suspend crossing info.		// Build suspend crossing info.
SuspendCrossingInfo Checker(F, Shape);		SuspendCrossingInfo Checker(F, Shape);

IRBuilder<> Builder(F.getContext());		IRBuilder<> Builder(F.getContext());
SpillInfo Spills;		FrameDataInfo FrameData;
SmallVector<CoroAllocaAllocInst*, 4> LocalAllocas;		SmallVector<CoroAllocaAllocInst*, 4> LocalAllocas;
SmallVector<Instruction*, 4> DeadInstructions;		SmallVector<Instruction*, 4> DeadInstructions;

		{
		SpillInfo Spills;
for (int Repeat = 0; Repeat < 4; ++Repeat) {		for (int Repeat = 0; Repeat < 4; ++Repeat) {
// See if there are materializable instructions across suspend points.		// See if there are materializable instructions across suspend points.
for (Instruction &I : instructions(F))		for (Instruction &I : instructions(F))
if (materializable(I))		if (materializable(I))
for (User *U : I.users())		for (User *U : I.users())
if (Checker.isDefinitionAcrossSuspend(I, U))		if (Checker.isDefinitionAcrossSuspend(I, U))
Spills.emplace_back(&I, U);		Spills[&I].push_back(cast<Instruction>(U));

if (Spills.empty())		if (Spills.empty())
break;		break;

// Rewrite materializable instructions to be materialized at the use point.		// Rewrite materializable instructions to be materialized at the use
LLVM_DEBUG(dump("Materializations", Spills));		// point.
		LLVM_DEBUG(dumpSpills("Materializations", Spills));
rewriteMaterializableInstructions(Builder, Spills);		rewriteMaterializableInstructions(Builder, Spills);
Spills.clear();		Spills.clear();
}		}
		}

sinkLifetimeStartMarkers(F, Shape, Checker);		sinkLifetimeStartMarkers(F, Shape, Checker);
// Collect lifetime.start info for each alloca.		collectFrameAllocas(F, Shape, Checker, FrameData.Allocas);
using LifetimeStart = SmallPtrSet<Instruction *, 2>;		LLVM_DEBUG(dumpAllocas(FrameData.Allocas));
llvm::DenseMap<Instruction *, std::unique_ptr<LifetimeStart>> LifetimeMap;
for (Instruction &I : instructions(F)) {
auto *II = dyn_cast<IntrinsicInst>(&I);
if (!II \|\| II->getIntrinsicID() != Intrinsic::lifetime_start)
continue;

if (auto *OpInst = dyn_cast<Instruction>(II->getOperand(1))) {
if (auto *AI = dyn_cast<AllocaInst>(OpInst->stripPointerCasts())) {

if (LifetimeMap.find(AI) == LifetimeMap.end())
LifetimeMap[AI] = std::make_unique<LifetimeStart>();
LifetimeMap[AI]->insert(isa<AllocaInst>(OpInst) ? II : OpInst);
}
}
}

// Collect the spills for arguments and other not-materializable values.		// Collect the spills for arguments and other not-materializable values.
for (Argument &A : F.args())		for (Argument &A : F.args())
for (User *U : A.users())		for (User *U : A.users())
if (Checker.isDefinitionAcrossSuspend(A, U))		if (Checker.isDefinitionAcrossSuspend(A, U))
Spills.emplace_back(&A, U);		FrameData.Spills[&A].push_back(cast<Instruction>(U));

for (Instruction &I : instructions(F)) {		for (Instruction &I : instructions(F)) {
// Values returned from coroutine structure intrinsics should not be part		// Values returned from coroutine structure intrinsics should not be part
// of the Coroutine Frame.		// of the Coroutine Frame.
if (isCoroutineStructureIntrinsic(I) \|\| &I == Shape.CoroBegin)		if (isCoroutineStructureIntrinsic(I) \|\| &I == Shape.CoroBegin)
continue;		continue;

// The Coroutine Promise always included into coroutine frame, no need to		// The Coroutine Promise always included into coroutine frame, no need to
Show All 14 Lines	if (auto AI = dyn_cast<CoroAllocaAllocInst>(&I)) {
// the rewritten value. The rewrite doesn't invalidate anything in		// the rewritten value. The rewrite doesn't invalidate anything in
// Spills because the other alloca intrinsics have no other operands		// Spills because the other alloca intrinsics have no other operands
// besides AI, and it doesn't invalidate the iteration because we delay		// besides AI, and it doesn't invalidate the iteration because we delay
// erasing AI.		// erasing AI.
auto Alloc = lowerNonLocalAlloca(AI, Shape, DeadInstructions);		auto Alloc = lowerNonLocalAlloca(AI, Shape, DeadInstructions);

for (User *U : Alloc->users()) {		for (User *U : Alloc->users()) {
if (Checker.isDefinitionAcrossSuspend(*Alloc, U))		if (Checker.isDefinitionAcrossSuspend(*Alloc, U))
Spills.emplace_back(Alloc, U);		FrameData.Spills[Alloc].push_back(cast<Instruction>(U));
}		}
continue;		continue;
}		}

// Ignore alloca.get; we process this as part of coro.alloca.alloc.		// Ignore alloca.get; we process this as part of coro.alloca.alloc.
if (isa<CoroAllocaGetInst>(I)) {		if (isa<CoroAllocaGetInst>(I))
continue;		continue;
}

auto Iter = LifetimeMap.find(&I);
for (User *U : I.users()) {
bool NeedSpill = false;

// Check against lifetime.start if the instruction has the info.		if (isa<AllocaInst>(I))
if (Iter != LifetimeMap.end())		continue;
for (auto S : Iter->second) {
if ((NeedSpill = Checker.isDefinitionAcrossSuspend(*S, U)))
break;
}
else
NeedSpill = Checker.isDefinitionAcrossSuspend(I, U);

if (NeedSpill) {		for (User *U : I.users())
		if (Checker.isDefinitionAcrossSuspend(I, U)) {
// We cannot spill a token.		// We cannot spill a token.
if (I.getType()->isTokenTy())		if (I.getType()->isTokenTy())
report_fatal_error(		report_fatal_error(
"token definition is separated from the use by a suspend point");		"token definition is separated from the use by a suspend point");
Spills.emplace_back(&I, U);		FrameData.Spills[&I].push_back(cast<Instruction>(U));
}
}		}
}		}
LLVM_DEBUG(dump("Spills", Spills));		LLVM_DEBUG(dumpSpills("Spills", FrameData.Spills));
if (Shape.ABI == coro::ABI::Retcon \|\| Shape.ABI == coro::ABI::RetconOnce)		if (Shape.ABI == coro::ABI::Retcon \|\| Shape.ABI == coro::ABI::RetconOnce)
sinkSpillUsesAfterCoroBegin(F, Spills, Shape.CoroBegin);		sinkSpillUsesAfterCoroBegin(F, FrameData, Shape.CoroBegin);
Shape.FrameTy = buildFrameType(F, Shape, Spills);		Shape.FrameTy = buildFrameType(F, Shape, FrameData);
Shape.FramePtr = insertSpills(Spills, Shape);		// Add PromiseAlloca to Allocas list so that it is processed in insertSpills.
		if (Shape.ABI == coro::ABI::Switch && Shape.SwitchLowering.PromiseAlloca)
		FrameData.Allocas.push_back(Shape.SwitchLowering.PromiseAlloca);
		Shape.FramePtr = insertSpills(FrameData, Shape);
lowerLocalAllocas(LocalAllocas, DeadInstructions);		lowerLocalAllocas(LocalAllocas, DeadInstructions);

for (auto I : DeadInstructions)		for (auto I : DeadInstructions)
I->eraseFromParent();		I->eraseFromParent();
}		}

llvm/lib/Transforms/Coroutines/CoroInternal.h

Show First 20 Lines • Show All 116 Lines • ▼ Show 20 Lines	struct LLVM_LIBRARY_VISIBILITY Shape {

bool ReuseFrameSlot;		bool ReuseFrameSlot;

struct SwitchLoweringStorage {		struct SwitchLoweringStorage {
SwitchInst *ResumeSwitch;		SwitchInst *ResumeSwitch;
AllocaInst *PromiseAlloca;		AllocaInst *PromiseAlloca;
BasicBlock *ResumeEntryBlock;		BasicBlock *ResumeEntryBlock;
unsigned IndexField;		unsigned IndexField;
unsigned PromiseField;
rjmccallUnsubmitted Not Done Reply Inline Actions Yeah, I think this is never used directly — clients expect to find it by offset, and within the function it's of course just referenced as a value. rjmccall: Yeah, I think this is never used directly — clients expect to find it by offset, and within the…
bool HasFinalSuspend;		bool HasFinalSuspend;
};		};

struct RetconLoweringStorage {		struct RetconLoweringStorage {
Function *ResumePrototype;		Function *ResumePrototype;
Function *Alloc;		Function *Alloc;
Function *Dealloc;		Function *Dealloc;
BasicBlock *ReturnBlock;		BasicBlock *ReturnBlock;
▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	CallingConv::ID getResumeFunctionCC() const {
llvm_unreachable("Unknown coro::ABI enum");		llvm_unreachable("Unknown coro::ABI enum");
}		}

AllocaInst *getPromiseAlloca() const {		AllocaInst *getPromiseAlloca() const {
if (ABI == coro::ABI::Switch)		if (ABI == coro::ABI::Switch)
return SwitchLowering.PromiseAlloca;		return SwitchLowering.PromiseAlloca;
return nullptr;		return nullptr;
}		}
unsigned getPromiseField() const {
assert(ABI == coro::ABI::Switch);
assert(FrameTy && "frame type not assigned");
assert(SwitchLowering.PromiseAlloca && "no promise alloca");
return SwitchLowering.PromiseField;
}

/// Allocate memory according to the rules of the active lowering.		/// Allocate memory according to the rules of the active lowering.
///		///
/// \param CG - if non-null, will be updated for the new call		/// \param CG - if non-null, will be updated for the new call
Value emitAlloc(IRBuilder<> &Builder, Value Size, CallGraph *CG) const;		Value emitAlloc(IRBuilder<> &Builder, Value Size, CallGraph *CG) const;

/// Deallocate memory according to the rules of the active lowering.		/// Deallocate memory according to the rules of the active lowering.
///		///
Show All 16 Lines

llvm/test/Transforms/Coroutines/coro-debug-frame-variable.ll

	Show All 19 Lines
	;			;
	; The CHECKs verify that dbg.declare intrinsics are created for the coroutine			; The CHECKs verify that dbg.declare intrinsics are created for the coroutine
	; funclet 'f.resume', and that they reference the address of the variables on			; funclet 'f.resume', and that they reference the address of the variables on
	; the coroutine frame. The debug locations for the original function 'f' are			; the coroutine frame. The debug locations for the original function 'f' are
	; static (!11 and !13), whereas the coroutine funclet will have its own new			; static (!11 and !13), whereas the coroutine funclet will have its own new
	; ones with identical line and column numbers.			; ones with identical line and column numbers.
	;			;
	; CHECK-LABEL: define void @f() {			; CHECK-LABEL: define void @f() {
	; CHECK: init.ready:			; CHECK: entry:
	; CHECK: [[IGEP:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4			; CHECK: [[IGEP:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4
				; CHECK: [[JGEP:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 5
				; CHECK: init.ready:
	; CHECK: call void @llvm.dbg.declare(metadata i32* [[IGEP]], metadata ![[IVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* [[IGEP]], metadata ![[IVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC:[0-9]+]]
	; CHECK: await.ready:			; CHECK: await.ready:
	; CHECK: [[JGEP:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 5
	; CHECK: call void @llvm.dbg.declare(metadata i32* [[JGEP]], metadata ![[JVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[JDBGLOC:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* [[JGEP]], metadata ![[JVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[JDBGLOC:[0-9]+]]
	;			;
	; CHECK-LABEL: define internal fastcc void @f.resume({{.*}}) {			; CHECK-LABEL: define internal fastcc void @f.resume({{.*}}) {
	; CHECK: init.ready:			; CHECK: entry.resume:
	; CHECK: [[IGEP_RESUME:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4			; CHECK: [[IGEP_RESUME:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4
				; CHECK: [[JGEP_RESUME:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 5
				; CHECK: init.ready:
	; CHECK: call void @llvm.dbg.declare(metadata i32* [[IGEP_RESUME]], metadata ![[IVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC_RESUME:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* [[IGEP_RESUME]], metadata ![[IVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC_RESUME:[0-9]+]]
	; CHECK: await.ready:			; CHECK: await.ready:
	; CHECK: [[JGEP_RESUME:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 5
	; CHECK: call void @llvm.dbg.declare(metadata i32* [[JGEP_RESUME]], metadata ![[JVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[JDBGLOC_RESUME:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* [[JGEP_RESUME]], metadata ![[JVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[JDBGLOC_RESUME:[0-9]+]]
	;			;
	; CHECK: ![[IVAR]] = !DILocalVariable(name: "i"			; CHECK: ![[IVAR]] = !DILocalVariable(name: "i"
	; CHECK: ![[SCOPE:[0-9]+]] = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)			; CHECK: ![[SCOPE:[0-9]+]] = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)
	; CHECK: ![[IDBGLOC]] = !DILocation(line: 24, column: 7, scope: ![[SCOPE]])			; CHECK: ![[IDBGLOC]] = !DILocation(line: 24, column: 7, scope: ![[SCOPE]])
	; CHECK: ![[JVAR]] = !DILocalVariable(name: "j"			; CHECK: ![[JVAR]] = !DILocalVariable(name: "j"
	; CHECK: ![[JDBGLOC]] = !DILocation(line: 32, column: 7, scope: ![[SCOPE]])			; CHECK: ![[JDBGLOC]] = !DILocation(line: 32, column: 7, scope: ![[SCOPE]])
	; CHECK: ![[IVAR_RESUME]] = !DILocalVariable(name: "i"			; CHECK: ![[IVAR_RESUME]] = !DILocalVariable(name: "i"
	▲ Show 20 Lines • Show All 166 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-debug.ll

	Show First 20 Lines • Show All 124 Lines • ▼ Show 20 Lines
	!21 = !DILocation(line: 59, column: 10, scope: !6)			!21 = !DILocation(line: 59, column: 10, scope: !6)
	!22 = !DILocation(line: 59, column: 7, scope: !6)			!22 = !DILocation(line: 59, column: 7, scope: !6)
	!23 = !DILocation(line: 59, column: 5, scope: !6)			!23 = !DILocation(line: 59, column: 5, scope: !6)
	!24 = !DILocation(line: 62, column: 3, scope: !6)			!24 = !DILocation(line: 62, column: 3, scope: !6)

	; CHECK: define i8* @f(i32 %x) #0 !dbg ![[ORIG:[0-9]+]]			; CHECK: define i8* @f(i32 %x) #0 !dbg ![[ORIG:[0-9]+]]
	; CHECK: define internal fastcc void @f.resume(%f.Frame* noalias nonnull align 8 dereferenceable(32) %FramePtr) #0 !dbg ![[RESUME:[0-9]+]]			; CHECK: define internal fastcc void @f.resume(%f.Frame* noalias nonnull align 8 dereferenceable(32) %FramePtr) #0 !dbg ![[RESUME:[0-9]+]]
	; CHECK: entry.resume:			; CHECK: entry.resume:
	; CHECK-NEXT: call void @coro.devirt.trigger(i8* null)			; CHECK: call void @coro.devirt.trigger(i8* null)
	; CHECK-NEXT: call void @llvm.dbg.declare(metadata i32* %x.addr.reload.addr, metadata ![[RESUME_VAR:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* %x.addr.reload.addr, metadata ![[RESUME_VAR:[0-9]+]]
	; CHECK: define internal fastcc void @f.destroy(%f.Frame* noalias nonnull align 8 dereferenceable(32) %FramePtr) #0 !dbg ![[DESTROY:[0-9]+]]			; CHECK: define internal fastcc void @f.destroy(%f.Frame* noalias nonnull align 8 dereferenceable(32) %FramePtr) #0 !dbg ![[DESTROY:[0-9]+]]
	; CHECK: define internal fastcc void @f.cleanup(%f.Frame* noalias nonnull align 8 dereferenceable(32) %FramePtr) #0 !dbg ![[CLEANUP:[0-9]+]]			; CHECK: define internal fastcc void @f.cleanup(%f.Frame* noalias nonnull align 8 dereferenceable(32) %FramePtr) #0 !dbg ![[CLEANUP:[0-9]+]]

	; CHECK: ![[ORIG]] = distinct !DISubprogram(name: "f", linkageName: "flink"			; CHECK: ![[ORIG]] = distinct !DISubprogram(name: "f", linkageName: "flink"

	; CHECK: ![[RESUME]] = distinct !DISubprogram(name: "f", linkageName: "flink"			; CHECK: ![[RESUME]] = distinct !DISubprogram(name: "f", linkageName: "flink"
	; CHECK: ![[RESUME_VAR]] = !DILocalVariable(name: "x", arg: 1, scope: ![[RESUME]]			; CHECK: ![[RESUME_VAR]] = !DILocalVariable(name: "x", arg: 1, scope: ![[RESUME]]

	; CHECK: ![[DESTROY]] = distinct !DISubprogram(name: "f", linkageName: "flink"			; CHECK: ![[DESTROY]] = distinct !DISubprogram(name: "f", linkageName: "flink"

	; CHECK: ![[CLEANUP]] = distinct !DISubprogram(name: "f", linkageName: "flink"			; CHECK: ![[CLEANUP]] = distinct !DISubprogram(name: "f", linkageName: "flink"

llvm/test/Transforms/Coroutines/coro-frame-arrayalloca.ll

Show All 34 Lines	suspend:
ret i8* %hdl		ret i8* %hdl
}		}

; See if the array alloca was stored as an array field.		; See if the array alloca was stored as an array field.
; CHECK-LABEL: %f.Frame = type { void (%f.Frame), void (%f.Frame), double, double, [4 x i32], i1 }		; CHECK-LABEL: %f.Frame = type { void (%f.Frame), void (%f.Frame), double, double, [4 x i32], i1 }

; See if we used correct index to access prefix, data, suffix (@f)		; See if we used correct index to access prefix, data, suffix (@f)
; CHECK-LABEL: @f(		; CHECK-LABEL: @f(
; CHECK: %prefix = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 2		; CHECK: %[[PREFIX:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 2
; CHECK-NEXT: %data = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4		; CHECK-NEXT: %[[DATA:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4
; CHECK-NEXT: %suffix = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 3		; CHECK-NEXT: %[[SUFFIX:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 3
; CHECK-NEXT: call void @consume.double.ptr(double* %prefix)		; CHECK-NEXT: call void @consume.double.ptr(double* %[[PREFIX:.+]])
; CHECK-NEXT: call void @consume.i32.ptr(i32* %data)		; CHECK-NEXT: call void @consume.i32.ptr(i32* %[[DATA:.+]])
; CHECK-NEXT: call void @consume.double.ptr(double* %suffix)		; CHECK-NEXT: call void @consume.double.ptr(double* %[[SUFFIX:.+]])
; CHECK: ret i8*		; CHECK: ret i8*

; See if we used correct index to access prefix, data, suffix (@f.resume)		; See if we used correct index to access prefix, data, suffix (@f.resume)
; CHECK-LABEL: @f.resume(		; CHECK-LABEL: @f.resume(
; CHECK: %[[SUFFIX:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 3
; CHECK: %[[DATA:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4
; CHECK: %[[PREFIX:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 2		; CHECK: %[[PREFIX:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 2
		; CHECK: %[[DATA:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4
		; CHECK: %[[SUFFIX:.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 3
; CHECK: call void @consume.double.ptr(double* %[[PREFIX]])		; CHECK: call void @consume.double.ptr(double* %[[PREFIX]])
; CHECK-NEXT: call void @consume.i32.ptr(i32* %[[DATA]])		; CHECK-NEXT: call void @consume.i32.ptr(i32* %[[DATA]])
; CHECK-NEXT: call void @consume.double.ptr(double* %[[SUFFIX]])		; CHECK-NEXT: call void @consume.double.ptr(double* %[[SUFFIX]])

declare i8* @llvm.coro.free(token, i8*)		declare i8* @llvm.coro.free(token, i8*)
declare i32 @llvm.coro.size.i32()		declare i32 @llvm.coro.size.i32()
declare i8 @llvm.coro.suspend(token, i1)		declare i8 @llvm.coro.suspend(token, i1)
declare void @llvm.coro.resume(i8*)		declare void @llvm.coro.resume(i8*)
Show All 10 Lines

llvm/test/Transforms/Coroutines/coro-frame-reuse-alloca-01.ll

Show First 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	cleanup2:
br label %cleanup		br label %cleanup
cleanup:		cleanup:
call i8* @llvm.coro.free(token %1, i8* %2)		call i8* @llvm.coro.free(token %1, i8* %2)
br label %coro.ret		br label %coro.ret
coro.ret:		coro.ret:
call i1 @llvm.coro.end(i8* null, i1 false)		call i1 @llvm.coro.end(i8* null, i1 false)
ret void		ret void
}		}
; CHECK-LABEL: @a.resume(
; CHECK: %a.reload.addr{{[0-9]+}} = getelementptr inbounds %a.Frame, %a.Frame* %FramePtr[[APositon:.*]]		; check that there is only one %struct.big_structure in the frame.
; CHECK: %b.reload.addr{{[0-9]+}} = getelementptr inbounds %a.Frame, %a.Frame* %FramePtr[[APositon]]		; CHECK: %a.Frame = type { void (%a.Frame), void (%a.Frame), %"struct.task::promise_type", %struct.big_structure, i1 }

declare token @llvm.coro.id(i32, i8* readnone, i8* nocapture readonly, i8*)		declare token @llvm.coro.id(i32, i8* readnone, i8* nocapture readonly, i8*)
declare i1 @llvm.coro.alloc(token) #3		declare i1 @llvm.coro.alloc(token) #3
declare i64 @llvm.coro.size.i64() #5		declare i64 @llvm.coro.size.i64() #5
declare i8* @llvm.coro.begin(token, i8* writeonly) #3		declare i8* @llvm.coro.begin(token, i8* writeonly) #3
declare token @llvm.coro.save(i8*) #3		declare token @llvm.coro.save(i8*) #3
declare i8* @llvm.coro.frame() #5		declare i8* @llvm.coro.frame() #5
declare i8 @llvm.coro.suspend(token, i1) #3		declare i8 @llvm.coro.suspend(token, i1) #3
declare i8* @llvm.coro.free(token, i8* nocapture readonly) #2		declare i8* @llvm.coro.free(token, i8* nocapture readonly) #2
declare i1 @llvm.coro.end(i8*, i1) #3		declare i1 @llvm.coro.end(i8*, i1) #3
declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture) #4		declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture) #4
declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture) #4		declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture) #4
No newline at end of file

llvm/test/Transforms/Coroutines/coro-frame-reuse-alloca-02.ll

Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	cleanup2:
br label %cleanup		br label %cleanup
cleanup:		cleanup:
call i8* @llvm.coro.free(token %1, i8* %2)		call i8* @llvm.coro.free(token %1, i8* %2)
br label %coro.ret		br label %coro.ret
coro.ret:		coro.ret:
call i1 @llvm.coro.end(i8* null, i1 false)		call i1 @llvm.coro.end(i8* null, i1 false)
ret void		ret void
}		}
		; CHECK: %a.Frame = type { void (%a.Frame), void (%a.Frame), %"struct.task::promise_type", %struct.big_structure, i1 }
; CHECK-LABEL: @a.resume(		; CHECK-LABEL: @a.resume(
; CHECK: %b.reload.addr = bitcast %struct.big_structure* %0 to %struct.big_structure.2*		; CHECK: %[[A:.]] = getelementptr inbounds %a.Frame, %a.Frame %FramePtr, i32 0, i32 3
		; CHECK: %{{.}} = bitcast %struct.big_structure %[[A]] to %struct.big_structure.2*

declare token @llvm.coro.id(i32, i8* readnone, i8* nocapture readonly, i8*)		declare token @llvm.coro.id(i32, i8* readnone, i8* nocapture readonly, i8*)
declare i1 @llvm.coro.alloc(token) #3		declare i1 @llvm.coro.alloc(token) #3
declare i64 @llvm.coro.size.i64() #5		declare i64 @llvm.coro.size.i64() #5
declare i8* @llvm.coro.begin(token, i8* writeonly) #3		declare i8* @llvm.coro.begin(token, i8* writeonly) #3
declare token @llvm.coro.save(i8*) #3		declare token @llvm.coro.save(i8*) #3
declare i8* @llvm.coro.frame() #5		declare i8* @llvm.coro.frame() #5
declare i8 @llvm.coro.suspend(token, i1) #3		declare i8 @llvm.coro.suspend(token, i1) #3
declare i8* @llvm.coro.free(token, i8* nocapture readonly) #2		declare i8* @llvm.coro.free(token, i8* nocapture readonly) #2
declare i1 @llvm.coro.end(i8*, i1) #3		declare i1 @llvm.coro.end(i8*, i1) #3
declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture) #4		declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture) #4
declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture) #4		declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture) #4
No newline at end of file

llvm/test/Transforms/Coroutines/coro-retcon-frame.ll

	Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines
	; CHECK: store i64 0, i64* [[PROJ2]]			; CHECK: store i64 0, i64* [[PROJ2]]
	; CHECK: [[ESCAPED_ADDR:%.]] = ptrtoint { i64, i64 } [[TMP]] to i64			; CHECK: [[ESCAPED_ADDR:%.]] = ptrtoint { i64, i64 } [[TMP]] to i64
	; CHECK: call void @init(i64* [[PROJ1]])			; CHECK: call void @init(i64* [[PROJ1]])
	; CHECK: call void @init(i64* [[PROJ2]])			; CHECK: call void @init(i64* [[PROJ2]])
	; CHECK: call void @use_addr_val(i64 [[ESCAPED_ADDR]], { i64, i64 }* [[TMP]])			; CHECK: call void @use_addr_val(i64 [[ESCAPED_ADDR]], { i64, i64 }* [[TMP]])

	; CHECK-LABEL: define internal void @f.resume.0(i8* {{.*}} %0, i1 %1) {			; CHECK-LABEL: define internal void @f.resume.0(i8* {{.*}} %0, i1 %1) {
	; CHECK: [[FRAMEPTR:%.]] = bitcast i8 %0 to %f.Frame*			; CHECK: [[FRAMEPTR:%.]] = bitcast i8 %0 to %f.Frame*
	; CHECK: resume:
	; CHECK: [[TMP:%.]] = getelementptr inbounds %f.Frame, %f.Frame [[FRAMEPTR]], i32 0, i32 0			; CHECK: [[TMP:%.]] = getelementptr inbounds %f.Frame, %f.Frame [[FRAMEPTR]], i32 0, i32 0
				; CHECK: resume:
	; CHECK: [[CAST:%.]] = bitcast { i64, i64 } [[TMP]] to i8*			; CHECK: [[CAST:%.]] = bitcast { i64, i64 } [[TMP]] to i8*
	; CHECK: call void @use(i8* [[CAST]])			; CHECK: call void @use(i8* [[CAST]])

	declare token @llvm.coro.id.retcon.once(i32, i32, i8, i8, i8, i8)			declare token @llvm.coro.id.retcon.once(i32, i32, i8, i8, i8, i8)
	declare i8* @llvm.coro.begin(token, i8*)			declare i8* @llvm.coro.begin(token, i8*)
	declare i1 @llvm.coro.suspend.retcon.i1(...)			declare i1 @llvm.coro.suspend.retcon.i1(...)
	declare i1 @llvm.coro.end(i8*, i1)			declare i1 @llvm.coro.end(i8*, i1)

llvm/test/Transforms/Coroutines/coro-retcon-once-value2.ll

	Show All 36 Lines
	; CHECK-NEXT: ret { i8, i32 } [[T0]]			; CHECK-NEXT: ret { i8, i32 } [[T0]]
	; CHECK-NEXT: }			; CHECK-NEXT: }

	; CHECK-LABEL: define internal void @f.resume.0(i8* noalias nonnull align 8 dereferenceable(8) %0, i1 zeroext %1)			; CHECK-LABEL: define internal void @f.resume.0(i8* noalias nonnull align 8 dereferenceable(8) %0, i1 zeroext %1)
	; CHECK-NEXT: :			; CHECK-NEXT: :
	; CHECK-NEXT: [[T0:%.]] = bitcast i8 %0 to [[FRAME_T:%.]]*			; CHECK-NEXT: [[T0:%.]] = bitcast i8 %0 to [[FRAME_T:%.]]*
	; CHECK-NEXT: [[FRAME:%.]] = load [[FRAME_T]], [[FRAME_T]]** [[T0]]			; CHECK-NEXT: [[FRAME:%.]] = load [[FRAME_T]], [[FRAME_T]]** [[T0]]
	; CHECK-NEXT: bitcast [[FRAME_T]]* [[FRAME]] to i8*			; CHECK-NEXT: bitcast [[FRAME_T]]* [[FRAME]] to i8*
	; CHECK-NEXT: %temp = getelementptr inbounds [[FRAME_T]], [[FRAME_T]]* [[FRAME]], i32 0, i32 1			; CHECK-NEXT: [[TEMP_SLOT:%.]] = getelementptr inbounds [[FRAME_T]], [[FRAME_T]] [[FRAME]], i32 0, i32 1
	; CHECK-NEXT: br i1 %1,			; CHECK-NEXT: br i1 %1,
	; CHECK: :			; CHECK: :
	; CHECK-NEXT: [[TEMP_SLOT:%.]] = getelementptr inbounds [[FRAME_T]], [[FRAME_T]] [[FRAME]], i32 0, i32 1
	; CHECK-NEXT: [[PTR_SLOT:%.]] = getelementptr inbounds [[FRAME_T]], [[FRAME_T]] [[FRAME]], i32 0, i32 0			; CHECK-NEXT: [[PTR_SLOT:%.]] = getelementptr inbounds [[FRAME_T]], [[FRAME_T]] [[FRAME]], i32 0, i32 0
	; CHECK-NEXT: [[PTR_RELOAD:%.]] = load i32, i32** [[PTR_SLOT]]			; CHECK-NEXT: [[PTR_RELOAD:%.]] = load i32, i32** [[PTR_SLOT]]
	; CHECK-NEXT: %newvalue = load i32, i32* [[TEMP_SLOT]]			; CHECK-NEXT: %newvalue = load i32, i32* [[TEMP_SLOT]]
	; CHECK-NEXT: store i32 %newvalue, i32* [[PTR_RELOAD]]			; CHECK-NEXT: store i32 %newvalue, i32* [[PTR_RELOAD]]
	; CHECK-NEXT: br label			; CHECK-NEXT: br label
	; CHECK: :			; CHECK: :
	; CHECK-NEXT: [[T0:%.]] = bitcast [[FRAME_T]] [[FRAME]] to i8*			; CHECK-NEXT: [[T0:%.]] = bitcast [[FRAME_T]] [[FRAME]] to i8*
	; CHECK-NEXT: call fastcc void @deallocate(i8* [[T0]])			; CHECK-NEXT: call fastcc void @deallocate(i8* [[T0]])
	Show All 15 Lines

llvm/test/Transforms/Coroutines/coro-split-sink-lifetime-01.ll

Show All 37 Lines	await.ready:
br label %exit		br label %exit
exit:		exit:
call i1 @llvm.coro.end(i8* null, i1 false)		call i1 @llvm.coro.end(i8* null, i1 false)
ret void		ret void
}		}

; CHECK-LABEL: @a.resume(		; CHECK-LABEL: @a.resume(
; CHECK: %testval = alloca i32, align 4		; CHECK: %testval = alloca i32, align 4
		; CHECK-NEXT: getelementptr inbounds %a.Frame
; CHECK-NEXT: %0 = bitcast i32* %testval to i8*		; CHECK-NEXT: %0 = bitcast i32* %testval to i8*
; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 4, i8* %0)		; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 4, i8* %0)
; CHECK-NEXT: getelementptr inbounds %a.Frame
; CHECK-NEXT: getelementptr inbounds %"struct.lean_future<int>::Awaiter"		; CHECK-NEXT: getelementptr inbounds %"struct.lean_future<int>::Awaiter"
; CHECK-NEXT: %val = load i32, i32* %Result		; CHECK-NEXT: %val = load i32, i32* %Result
; CHECK-NEXT: %test = load i32, i32* %testval		; CHECK-NEXT: %test = load i32, i32* %testval
; CHECK-NEXT: call void @print(i32 %test)		; CHECK-NEXT: call void @print(i32 %test)
; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 4, i8* %0)		; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 4, i8* %0)
; CHECK-NEXT: call void @print(i32 %val)		; CHECK-NEXT: call void @print(i32 %val)
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void

Show All 15 Lines

llvm/test/Transforms/Coroutines/coro-split-sink-lifetime-03.ll

Show All 38 Lines	await.ready:
call void @print(i32 %val)		call void @print(i32 %val)
br label %exit		br label %exit
exit:		exit:
call i1 @llvm.coro.end(i8* null, i1 false)		call i1 @llvm.coro.end(i8* null, i1 false)
ret void		ret void
}		}
; CHECK-LABEL: @a.gep.resume(		; CHECK-LABEL: @a.gep.resume(
; CHECK: %testval = alloca %i8.array		; CHECK: %testval = alloca %i8.array
		; CHECK-NEXT: getelementptr inbounds %a.gep.Frame
; CHECK-NEXT: %0 = bitcast %i8.array* %testval to i8*		; CHECK-NEXT: %0 = bitcast %i8.array* %testval to i8*
; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 100, i8* %0)		; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 100, i8* %0)
; CHECK-NEXT: getelementptr inbounds %a.gep.Frame
; CHECK-NEXT: getelementptr inbounds %"struct.lean_future<int>::Awaiter"		; CHECK-NEXT: getelementptr inbounds %"struct.lean_future<int>::Awaiter"
; CHECK-NEXT: getelementptr inbounds %i8.array, %i8.array* %testval		; CHECK-NEXT: getelementptr inbounds %i8.array, %i8.array* %testval
; CHECK-NEXT: %val = load i32, i32* %Result		; CHECK-NEXT: %val = load i32, i32* %Result
; CHECK-NEXT: call void @consume.i8.array(%i8.array* %testval)		; CHECK-NEXT: call void @consume.i8.array(%i8.array* %testval)
; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 100, i8* %cast1)		; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 100, i8* %cast1)
; CHECK-NEXT: call void @print(i32 %val)		; CHECK-NEXT: call void @print(i32 %val)
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void

Show All 14 Lines

llvm/test/Transforms/Coroutines/coro-split-sink-lifetime-04.ll

Show All 37 Lines	await.ready:
br label %exit		br label %exit
exit:		exit:
call i1 @llvm.coro.end(i8* null, i1 false)		call i1 @llvm.coro.end(i8* null, i1 false)
ret void		ret void
}		}

; CHECK-LABEL: @a.resume(		; CHECK-LABEL: @a.resume(
; CHECK: %testval = alloca i8, align 1		; CHECK: %testval = alloca i8, align 1
; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 1, i8* %testval)
; CHECK-NEXT: getelementptr inbounds %a.Frame		; CHECK-NEXT: getelementptr inbounds %a.Frame
		; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 1, i8* %testval)
; CHECK-NEXT: getelementptr inbounds %"struct.lean_future<int>::Awaiter"		; CHECK-NEXT: getelementptr inbounds %"struct.lean_future<int>::Awaiter"
; CHECK-NEXT: %val = load i32, i32* %Result		; CHECK-NEXT: %val = load i32, i32* %Result
; CHECK-NEXT: %test = load i8, i8* %testval		; CHECK-NEXT: %test = load i8, i8* %testval
; CHECK-NEXT: call void @consume.i8(i8 %test)		; CHECK-NEXT: call void @consume.i8(i8 %test)
; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 1, i8* %testval)		; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 1, i8* %testval)
; CHECK-NEXT: call void @print(i32 %val)		; CHECK-NEXT: call void @print(i32 %val)
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void

Show All 15 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Coroutines] Refactor/Rewrite Spill and Alloca processingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 297440

llvm/lib/Transforms/Coroutines/CoroFrame.cpp

llvm/lib/Transforms/Coroutines/CoroInternal.h

llvm/test/Transforms/Coroutines/coro-debug-frame-variable.ll

llvm/test/Transforms/Coroutines/coro-debug.ll

llvm/test/Transforms/Coroutines/coro-frame-arrayalloca.ll

llvm/test/Transforms/Coroutines/coro-frame-reuse-alloca-01.ll

llvm/test/Transforms/Coroutines/coro-frame-reuse-alloca-02.ll

llvm/test/Transforms/Coroutines/coro-retcon-frame.ll

llvm/test/Transforms/Coroutines/coro-retcon-once-value2.ll

llvm/test/Transforms/Coroutines/coro-split-sink-lifetime-01.ll

llvm/test/Transforms/Coroutines/coro-split-sink-lifetime-03.ll

llvm/test/Transforms/Coroutines/coro-split-sink-lifetime-04.ll

[Coroutines] Refactor/Rewrite Spill and Alloca processing
ClosedPublic