This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Transforms/Coroutines/
-
llvm/
-
Transforms/
-
Coroutines/
-
CoroSplit.h
-
lib/
-
Passes/
-
PassBuilder.cpp
-
PassRegistry.def
-
Transforms/Coroutines/
-
Coroutines/
7/14
CoroSplit.cpp
-
test/Transforms/Coroutines/
-
Transforms/
-
Coroutines/
-
coro-alloc-with-param-O0.ll
-
coro-alloc-with-param-O2.ll
-
coro-alloc-with-param.ll
-
coro-catchswitch.ll
-
coro-debug.ll
-
coro-eh-aware-edge-split-00.ll
-
coro-eh-aware-edge-split-01.ll
-
coro-eh-aware-edge-split-02.ll
-
coro-eh-aware-edge-split.ll
-
coro-frame-arrayalloca.ll
-
coro-frame-unreachable.ll
-
coro-frame.ll
-
coro-materialize.ll
-
coro-padding.ll
-
coro-param-copy.ll
-
coro-spill-after-phi.ll
-
coro-spill-corobegin.ll
-
coro-split-00.ll
-
coro-split-02.ll
-
coro-split-alloc.ll
-
coro-split-dbg.ll
-
coro-split-eh-00.ll
-
coro-split-eh-01.ll
-
coro-split-eh.ll
-
coro-split-hidden.ll
-
coro-split-musttail.ll
-
coro-split-musttail1.ll
-
no-suspend.ll
-
restart-trigger.ll

Differential D71899

[Coroutines][2/6] New pass manager: coro-split
ClosedPublic

Authored by modocache on Dec 26 2019, 6:24 AM.

Download Raw Diff

Details

Reviewers

GorNishanov
lewissbaker
chandlerc
jdoerfert
junparser
deadalnix
wenlei

Commits

rG7125d66f9969: [Coroutines][2/6] New pass manager: coro-split

Summary

This patch has four dependencies:

The first in this series of patches that implement coroutine passes in the new pass manager: https://reviews.llvm.org/D71898.
A patch that introduces an API for CGSCC passes to add new reference edges to a LazyCallGraph, updateCGAndAnalysisManagerForCGSCCPass: https://reviews.llvm.org/D72025.
A patch that introduces a CallGraphUpdater helper class that is capable of mutating internal LazyCallGraph state in order to insert new function nodes into a specific SCC: https://reviews.llvm.org/D70927.
And finally, a small edge case fix for updating LazyCallGraph that patch 3 above happens to run into: https://reviews.llvm.org/D72226.

This is the second in a series of patches that ports the LLVM coroutines
passes to the new pass manager infrastructure. This patch implements
'coro-split'.

Some notes:

Using the new CGSCC pass manager resulted in IR being printed in the reverse order in some tests. To prevent FileCheck checks from failing due to these reversed orders, this patch splits up test files that test multiple different coroutine functions: specifically coro-alloc-with-param.ll, coro-split-eh.ll, and coro-eh-aware-edge-split.ll.
CoroSplit.cpp contained 2 overloads of splitCoroutine, one of which dispatched to the other based on the coroutine ABI being used (C++20 switch-based versus Swift returned-continuation-based). I found this confusing, especially with the additional branching based on CallGraph vs. LazyCallGraph, so I removed the ABI-checking overload of splitCoroutine.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

modocache created this revision.Dec 26 2019, 6:24 AM

Herald added a reviewer: deadalnix. · View Herald TranscriptDec 26 2019, 6:24 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: hiraditya, EricWF, qcolombet. · View Herald Transcript

Harbormaster completed remote builds in B42954: Diff 235339.Dec 26 2019, 6:26 AM

wenlei added a subscriber: wenlei.Dec 26 2019, 8:00 AM

As a generic comment: Changes like the addition of const or renaming of the pass to be Legacy could be done in NFC commits prior to this one without pre-commit review (IMHO). That makes this patch smaller. The same holds for splitting tests and functions, if there is no functional change.

Edit: Also consider using the child/parent revision feature to ease the navigation between dependent patches.

modocache added a parent revision: D71898: [Coroutines][1/6] New pass manager: coro-early.Dec 26 2019, 9:27 AM

modocache added a child revision: D71900: [Coroutines][3/6] New pass manager: coro-elide.

Also consider using the child/parent revision feature to ease the navigation between dependent patches.

Oh, thanks! I hadn't realized LLVM's Phabricator instance had this feature. I just edited the parent/child revisions as per your suggestion, they're now displayed as a stack.

Changes like the addition of const or renaming of the pass to be Legacy could be done in NFC commits prior to this one without pre-commit review (IMHO)

Good point! I considered doing so with const, but renaming the passes to Legacy sounds good, too. I'll split out these changes and commit them without review.

My understanding is devirt trigger is only a trick used with Legacy PM to run optimizations on coroutine funclets after coro-split. With NewPM's CGSCCUpdateResult infra, can we communicate that change and request passes directly with ModuleToPostOrderCGSCCPassAdaptor, without artificially introducing the indirect call and devirt?

I would imaging any form of outlining would require this communication if optimization is needed for the outlined region. How is that handled for outlining in general with New PM and can we follow that without the devirt trick?

llvm/lib/Transforms/Coroutines/CoroSplit.cpp
1418	Seems `LazyCallGraph` wanted every possible (direct or indirect) references to be modeled upfront, and there's check/asserts for that. This change effectively bypasses that. Is slightly less optimal CGSCC order the only consequence of not following that rule? With coro-split, we do alter the underlying CG, is it safe to populate CG Node without other bookkeeping like informing `CGSCCUpdateResult`? I'm not familiar with the assumptions and constraints about how CG should be updated, but asking because this `populate()` is never called outside of call graph construction, and I'm guessing it's a public function only because `LazyCallGraphTest` needed it..
1499	This doesn't seem necessary as DevirtFn doesn't contain any calls.

In D71899#1799233, @wenlei wrote:

My understanding is devirt trigger is only a trick used with Legacy PM to run optimizations on coroutine funclets after coro-split. With NewPM's CGSCCUpdateResult infra, can we communicate that change and request passes directly with ModuleToPostOrderCGSCCPassAdaptor, without artificially introducing the indirect call and devirt?

The devirt trigger here is to restart CGSCC pipeline to run coro-split again to split the coroutine function. There is no need to introduce devirt trigger in NewPM since we call coro-split pass manually.

I would imaging any form of outlining would require this communication if optimization is needed for the outlined region. How is that handled for outlining in general with New PM and can we follow that without the devirt trick?

However, I do think the SCC should be updated using CGSCCUpdateResult in second run of coro-split for outlined functions

The devirt trigger here is to restart CGSCC pipeline to run coro-split again to split the coroutine function. There is no need to introduce devirt trigger in NewPM since we call coro-split pass manually.

Manually schedule the 2nd coro-split pass is only a workaround before we can trigger CGSCC passes on the split funclet like we did for legacy PM. It does the split without restarting CGSCC passes so it works, but it also leaves performance on the table because the split funclets won't go through many opt passes of CGSCC pipeline. Yes, I agree don't need to introduce devirt trigger with new PM, but that's because I think we can request CGSCC passes on split funclet via other mechanism like CGSCCUpdateResult, not just because 2nd coro-split pass is manually scheduled.

That said, for new PM, this patch implemented devirt trigger insertion only, but not the devirt detection part. I would suggest we have all or nothing for a working mechanism of rerunning CGSCC passes for split funclet. Funclets like resume contains actual code, not just stubs, so IMHO fully optimizing these funclets is an essential part of coroutine support.

In D71899#1799622, @wenlei wrote:

The devirt trigger here is to restart CGSCC pipeline to run coro-split again to split the coroutine function. There is no need to introduce devirt trigger in NewPM since we call coro-split pass manually.

Manually schedule the 2nd coro-split pass is only a workaround before we can trigger CGSCC passes on the split funclet like we did for legacy PM. It does the split without restarting CGSCC passes so it works, but it also leaves performance on the table because the split funclets won't go through many opt passes of CGSCC pipeline. Yes, I agree don't need to introduce devirt trigger with new PM, but that's because I think we can request CGSCC passes on split funclet via other mechanism like CGSCCUpdateResult, not just because 2nd coro-split pass is manually scheduled.

There are two issues here: 1) coro-split needs run at least twice, we do not need CGSCC pipeline at pre-split stage which coro-split pass just works as a function pass 2) request CGSCC passes on split funclet after 2nd running of coro-split, and coroutine optimization such as coro-elide pass also depends on these optimization.
Manually schedule coro-split twice is a workaround for 1), as for 2) the current pipeline of coroutine in this patch set need be changed.

That said, for new PM, this patch implemented devirt trigger insertion only, but not the devirt detection part. I would suggest we have all or nothing for a working mechanism of rerunning CGSCC passes for split funclet. Funclets like resume contains actual code, not just stubs, so IMHO fully optimizing these funclets is an essential part of coroutine support.

I agree.

modocache mentioned this in D71898: [Coroutines][1/6] New pass manager: coro-early.Jan 1 2020, 7:04 PM

As per one of the first review comments, I split out the trivial parts of this patch, such as the 'legacy' pass renaming, and committed them separately. Here's an updated version of this patch without those 'NFC' changes. Next I'll work on the other comments, both the ones left here and on D71903. I'll set the status of this diff to 'Changes Planned' to reflect that. Thanks for the reviews so far!

modocache planned changes to this revision.Jan 1 2020, 7:07 PM

modocache mentioned this in D71900: [Coroutines][3/6] New pass manager: coro-elide.Jan 1 2020, 7:08 PM

modocache mentioned this in D71901: [Coroutines][4/6] New pass manager: coro-cleanup.

Harbormaster completed remote builds in B43131: Diff 235811.Jan 1 2020, 7:13 PM

modocache mentioned this in D71903: [Coroutines][6/6] Clang schedules new passes.Jan 1 2020, 7:25 PM

modocache mentioned this in D72025: [PM][CGSCC] Add a helper to update the call graph from SCC passes.Jan 5 2020, 5:35 AM

modocache mentioned this in D70927: Introduce a CallGraph updater helper class.

Thanks for the reviews! This latest revision makes use of the private LazyCallGraph API exposed via the CallGraphUpdater class from D70927, and the helper function updateCGAndAnalysisManagerForCGSCCPass added in D72025, in order to properly update the call graph. It also uses CGSCCUpdateResult in order to enqueue the second phase of coro-split, instead of relying on function devirtualization. This patch exposed what I think is a bug, so it now also relies on a fix for that behavior, D72226.

modocache added parent revisions: D72226: Add LazyCallGraph API to add function to RefSCC, D70927: Introduce a CallGraph updater helper class.Jan 5 2020, 5:42 AM

Harbormaster completed remote builds in B43308: Diff 236241.Jan 5 2020, 5:43 AM

modocache added a subscriber: wenlei.Jan 5 2020, 6:02 AM

Thanks for making the changes to schedule 2nd coro-split so funclets get fully optimized. A few comments inline, looks good otherwise.

llvm/lib/Transforms/Coroutines/CoroSplit.cpp
1435	Now that we have CallGraphUpdater, can we use `CGUpdater.reanalyzeFunction(N.getFunction())` instead?
1598	nit: I would say "new pass manager" instead of "experimental pass manager".. it's not that experimental at this moment.
1620	Is it necessary to request RefSCC to be reprocessed? I thought `UR.CWorklist.insert(&C)` should be enough..

Thanks for the quick review! I'll send an update in a sec.

llvm/lib/Transforms/Coroutines/CoroSplit.cpp
1435	Yeah, I considered that as well. I opted not to because it's just extra steps: We grab node's function via `LazyCallGraph::Node::getFunction`, just to have `CallGraphUpdater::reanalyzeFunction` call `LazyCallGraph::get` to lookup the node in the graph, or create one if it's not in the graph. But we know this node in the graph, and we already have a reference to it, so why look it up? `CallGraphUpdater::reanalyzeFunction` then looks up the SCC, which we also already have a reference to. As far as I can tell there's no way `CallGraphUpdater::registerOutlinedFunction` above would have split the SCC, so there's no reason to re-lookup an SCC we already have a reference to. And finally, the call to `updateCGAndAnalysisManagerForCGSCCPass`. I figured, why not just do (3)? That being said, I do think in future patches there's room to consolidate the legacy pass and the new pass's interactions with the call graph. My personal preference would be to wait until D70927 is merged to do that, though.
1598	Oh, good catch! I agree, will do.
1620	Will do! I was trying to do whatever seemed most similar to the legacy pass manager's repeater, I wasn't sure whether that was the outer repeating loop over RefSCCs, or the inner SCC loop.

wenlei added inline comments.Jan 5 2020, 4:06 PM

llvm/lib/Transforms/Coroutines/CoroSplit.cpp
1435	yeah, there's a bit of abstraction overhead for the CallGraphUpdater layer over calling `updateCGAndAnalysisManagerForCGSCCPass`. Technically `CallGraphUpdater::reanalyzeFunction` is not really needed anywhere as we can almost always call `updateCGAndAnalysisManagerForCGSCCPass` directly, but I thought communicating through `CallGraphUpdater` is cleaner. I don't have a strong opinion on this though - @jdoerfert may want to weigh in on this.

Addressed review comments.

modocache marked 4 inline comments as done and an inline comment as not done.Jan 5 2020, 4:19 PM

Harbormaster completed remote builds in B43322: Diff 236272.Jan 5 2020, 4:25 PM

modocache mentioned this in D72226: Add LazyCallGraph API to add function to RefSCC.Jan 6 2020, 11:15 AM

To avoid an assert when compiling recursive coroutine functions, use the new API I added in D72226: CallGraphUpdater::registerReferredToOutlinedFunction. This is also a more valid method of updating the call graph. Previously, we were inserting coroutine funclets into the same SCC, despite the fact that they did not form a strongly-connected cycle with the original coroutine function. Now, we insert them as referred-to-by the original coroutine.

Harbormaster completed remote builds in B43403: Diff 236511.Jan 6 2020, 9:21 PM

Rebase onto my update of D72226.

Harbormaster completed remote builds in B43457: Diff 236670.Jan 7 2020, 12:49 PM

Apply clang-format.

Harbormaster completed remote builds in B43461: Diff 236676.Jan 7 2020, 1:27 PM

modocache marked an inline comment as done.Jan 7 2020, 2:19 PM

modocache added inline comments.

llvm/lib/Transforms/Coroutines/CoroSplit.cpp
1620	@wenlei Now that I've updated D72226 to outline the funclets into the same RefSCC (so, not the same SCC) as the coroutine, I think we now may actually want to re-enqueue the entire RefSCC. Thoughts?

wenlei added inline comments.Jan 7 2020, 4:48 PM

llvm/lib/Transforms/Coroutines/CoroSplit.cpp
1620	Re-enqueue is for rerun the 2nd coro-split pass, which I though is orthogonal to the actual outlining where funclet is now created as new SCC in the same RefSCC. So as long as we get to rerun 2nd coro-split on those functions I think we should be fine, and re-enqueue SCC does that. But if we re-enqueue the RefSCC, it's functionally correct as well, just we might rerun pipeline for some extra SCCs in the same RefSCC while it's not really needed..

Use the latest outlining interface from D72226, which was just updated based on the latest version of D70927.

Harbormaster completed remote builds in B45651: Diff 242246.Feb 3 2020, 8:21 PM

modocache added a reviewer: wenlei.Feb 9 2020, 6:39 PM

Rebase past D69930 so the patch applies cleanly to trunk.

Harbormaster completed remote builds in B46469: Diff 244565.Feb 13 2020, 8:16 PM

I found a test that tested the legacy PM opt -coro-split, but not the new PM opt -passes=coro-split. Now it tests both.

Harbormaster completed remote builds in B46520: Diff 244697.Feb 14 2020, 10:03 AM

I'm not sure I'm the right person to review this so do not wait for my OK. I left some drive-by comments below though.

llvm/lib/Transforms/Coroutines/CoroSplit.cpp
19	Drive by: Isn't this accurate anymore? Should we replace it instead of removing it? I like file comments that explain what is going on.
1345	Drive-by: Changes to these functions `splitCoroutine` seem NFC to me. If that is the case they can/should be committed beforehand as such (w/o review).
1435	If you are in a new PM only code region, and the LazyCall graph helpers are sufficient, I don't see any reason not to call them directly. The `CallGraphUpdater` comes in handy if you are in generic code that needs to work in both PMs, or if it has abstractions that make your live easier.

Sorry for the delay, LGTM except the file comment as Johannes pointed out. We've tested this stack of coroutine patches with multiple large services internally, all working as expected, so I'm going to accept this stack as review feedbacks are addressed too.

This revision is now accepted and ready to land.Feb 17 2020, 8:15 PM

Thanks for the reviews! You're right, @jdoerfert, the file header comment is still totally applicable. I put it back in.

Harbormaster completed remote builds in B46681: Diff 245076.Feb 17 2020, 8:29 PM

Closed by commit rG7125d66f9969: [Coroutines][2/6] New pass manager: coro-split (authored by modocache). · Explain WhyFeb 17 2020, 8:38 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

Transforms/

Coroutines/

CoroSplit.h

30 lines

lib/

Passes/

PassBuilder.cpp

1 line

PassRegistry.def

1 line

Transforms/

Coroutines/

CoroSplit.cpp

167 lines

test/

Transforms/

Coroutines/

	coro-alloc-with-param-O0.ll
	coro-alloc-with-param.ll

37 lines

	coro-alloc-with-param-O2.ll
	coro-alloc-with-param.ll

40 lines

coro-alloc-with-param.ll

coro-catchswitch.ll

1 line

coro-debug.ll

1 line

coro-eh-aware-edge-split-00.ll

98 lines

coro-eh-aware-edge-split-01.ll

92 lines

coro-eh-aware-edge-split-02.ll

89 lines

coro-eh-aware-edge-split.ll

coro-frame-arrayalloca.ll

1 line

coro-frame-unreachable.ll

1 line

1 line

1 line

1 line

1 line

coro-spill-after-phi.ll

1 line

coro-spill-corobegin.ll

1 line

1 line

1 line

1 line

1 line

	coro-split-eh-00.ll
	coro-split-eh.ll

60 lines

	coro-split-eh-01.ll
	coro-split-eh.ll

76 lines

coro-split-eh.ll

coro-split-hidden.ll

1 line

coro-split-musttail.ll

1 line

coro-split-musttail1.ll

1 line

no-suspend.ll

1 line

restart-trigger.ll

5 lines

Diff 245077

llvm/include/llvm/Transforms/Coroutines/CoroSplit.h

This file was added.

				//===- CoroSplit.h - Converts a coroutine into a state machine -- C++ ---===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// \file
				// This file declares the pass that builds the coroutine frame and outlines
				// the resume and destroy parts of the coroutine into separate functions.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_TRANSFORMS_COROUTINES_COROSPLIT_H
				#define LLVM_TRANSFORMS_COROUTINES_COROSPLIT_H

				#include "llvm/Analysis/CGSCCPassManager.h"
				#include "llvm/Analysis/LazyCallGraph.h"
				#include "llvm/IR/PassManager.h"

				namespace llvm {

				struct CoroSplitPass : PassInfoMixin<CoroSplitPass> {
				PreservedAnalyses run(LazyCallGraph::SCC &C, CGSCCAnalysisManager &AM,
				LazyCallGraph &CG, CGSCCUpdateResult &UR);
				};
				} // end namespace llvm

				#endif // LLVM_TRANSFORMS_COROUTINES_COROSPLIT_H

llvm/lib/Passes/PassBuilder.cpp

	Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	#include "llvm/IR/Verifier.h"			#include "llvm/IR/Verifier.h"
	#include "llvm/Support/CommandLine.h"			#include "llvm/Support/CommandLine.h"
	#include "llvm/Support/Debug.h"			#include "llvm/Support/Debug.h"
	#include "llvm/Support/FormatVariadic.h"			#include "llvm/Support/FormatVariadic.h"
	#include "llvm/Support/Regex.h"			#include "llvm/Support/Regex.h"
	#include "llvm/Target/TargetMachine.h"			#include "llvm/Target/TargetMachine.h"
	#include "llvm/Transforms/AggressiveInstCombine/AggressiveInstCombine.h"			#include "llvm/Transforms/AggressiveInstCombine/AggressiveInstCombine.h"
	#include "llvm/Transforms/Coroutines/CoroEarly.h"			#include "llvm/Transforms/Coroutines/CoroEarly.h"
				#include "llvm/Transforms/Coroutines/CoroSplit.h"
	#include "llvm/Transforms/IPO/AlwaysInliner.h"			#include "llvm/Transforms/IPO/AlwaysInliner.h"
	#include "llvm/Transforms/IPO/ArgumentPromotion.h"			#include "llvm/Transforms/IPO/ArgumentPromotion.h"
	#include "llvm/Transforms/IPO/Attributor.h"			#include "llvm/Transforms/IPO/Attributor.h"
	#include "llvm/Transforms/IPO/CalledValuePropagation.h"			#include "llvm/Transforms/IPO/CalledValuePropagation.h"
	#include "llvm/Transforms/IPO/ConstantMerge.h"			#include "llvm/Transforms/IPO/ConstantMerge.h"
	#include "llvm/Transforms/IPO/CrossDSOCFI.h"			#include "llvm/Transforms/IPO/CrossDSOCFI.h"
	#include "llvm/Transforms/IPO/DeadArgumentElimination.h"			#include "llvm/Transforms/IPO/DeadArgumentElimination.h"
	#include "llvm/Transforms/IPO/ElimAvailExtern.h"			#include "llvm/Transforms/IPO/ElimAvailExtern.h"
	▲ Show 20 Lines • Show All 2,393 Lines • Show Last 20 Lines

llvm/lib/Passes/PassRegistry.def

	Show First 20 Lines • Show All 105 Lines • ▼ Show 20 Lines
	#define CGSCC_PASS(NAME, CREATE_PASS)			#define CGSCC_PASS(NAME, CREATE_PASS)
	#endif			#endif
	CGSCC_PASS("argpromotion", ArgumentPromotionPass())			CGSCC_PASS("argpromotion", ArgumentPromotionPass())
	CGSCC_PASS("invalidate<all>", InvalidateAllAnalysesPass())			CGSCC_PASS("invalidate<all>", InvalidateAllAnalysesPass())
	CGSCC_PASS("function-attrs", PostOrderFunctionAttrsPass())			CGSCC_PASS("function-attrs", PostOrderFunctionAttrsPass())
	CGSCC_PASS("attributor-cgscc", AttributorCGSCCPass())			CGSCC_PASS("attributor-cgscc", AttributorCGSCCPass())
	CGSCC_PASS("inline", InlinerPass())			CGSCC_PASS("inline", InlinerPass())
	CGSCC_PASS("openmpopt", OpenMPOptPass())			CGSCC_PASS("openmpopt", OpenMPOptPass())
				CGSCC_PASS("coro-split", CoroSplitPass())
	CGSCC_PASS("no-op-cgscc", NoOpCGSCCPass())			CGSCC_PASS("no-op-cgscc", NoOpCGSCCPass())
	#undef CGSCC_PASS			#undef CGSCC_PASS

	#ifndef FUNCTION_ANALYSIS			#ifndef FUNCTION_ANALYSIS
	#define FUNCTION_ANALYSIS(NAME, CREATE_PASS)			#define FUNCTION_ANALYSIS(NAME, CREATE_PASS)
	#endif			#endif
	FUNCTION_ANALYSIS("aa", AAManager())			FUNCTION_ANALYSIS("aa", AAManager())
	FUNCTION_ANALYSIS("assumptions", AssumptionAnalysis())			FUNCTION_ANALYSIS("assumptions", AssumptionAnalysis())
	▲ Show 20 Lines • Show All 217 Lines • Show Last 20 Lines

llvm/lib/Transforms/Coroutines/CoroSplit.cpp

Show All 10 Lines
// We present a coroutine to an LLVM as an ordinary function with suspension		// We present a coroutine to an LLVM as an ordinary function with suspension
// points marked up with intrinsics. We let the optimizer party on the coroutine		// points marked up with intrinsics. We let the optimizer party on the coroutine
// as a single function for as long as possible. Shortly before the coroutine is		// as a single function for as long as possible. Shortly before the coroutine is
// eligible to be inlined into its callers, we split up the coroutine into parts		// eligible to be inlined into its callers, we split up the coroutine into parts
// corresponding to an initial, resume and destroy invocations of the coroutine,		// corresponding to an initial, resume and destroy invocations of the coroutine,
// add them to the current SCC and restart the IPO pipeline to optimize the		// add them to the current SCC and restart the IPO pipeline to optimize the
// coroutine subfunctions we extracted before proceeding to the caller of the		// coroutine subfunctions we extracted before proceeding to the caller of the
// coroutine.		// coroutine.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
jdoerfertUnsubmitted Not Done Reply Inline Actions Drive by: Isn't this accurate anymore? Should we replace it instead of removing it? I like file comments that explain what is going on. jdoerfert: Drive by: Isn't this accurate anymore? Should we replace it instead of removing it? I like file…

		#include "llvm/Transforms/Coroutines/CoroSplit.h"
#include "CoroInstr.h"		#include "CoroInstr.h"
#include "CoroInternal.h"		#include "CoroInternal.h"
#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
#include "llvm/Analysis/CallGraph.h"		#include "llvm/Analysis/CallGraph.h"
Show All 25 Lines
#include "llvm/InitializePasses.h"		#include "llvm/InitializePasses.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/PrettyStackTrace.h"		#include "llvm/Support/PrettyStackTrace.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include "llvm/Transforms/Scalar.h"		#include "llvm/Transforms/Scalar.h"
#include "llvm/Transforms/Utils/BasicBlockUtils.h"		#include "llvm/Transforms/Utils/BasicBlockUtils.h"
		#include "llvm/Transforms/Utils/CallGraphUpdater.h"
#include "llvm/Transforms/Utils/Cloning.h"		#include "llvm/Transforms/Utils/Cloning.h"
#include "llvm/Transforms/Utils/Local.h"		#include "llvm/Transforms/Utils/Local.h"
#include "llvm/Transforms/Utils/ValueMapper.h"		#include "llvm/Transforms/Utils/ValueMapper.h"
#include <cassert>		#include <cassert>
#include <cstddef>		#include <cstddef>
#include <cstdint>		#include <cstdint>
#include <initializer_list>		#include <initializer_list>
#include <iterator>		#include <iterator>
▲ Show 20 Lines • Show All 1,267 Lines • ▼ Show 20 Lines	public:
void print(raw_ostream &OS) const override {		void print(raw_ostream &OS) const override {
OS << "While splitting coroutine ";		OS << "While splitting coroutine ";
F.printAsOperand(OS, /print type/ false, F.getParent());		F.printAsOperand(OS, /print type/ false, F.getParent());
OS << "\n";		OS << "\n";
}		}
};		};
}		}

static void splitCoroutine(Function &F, coro::Shape &Shape,		static coro::Shape splitCoroutine(Function &F,
jdoerfertUnsubmitted Not Done Reply Inline Actions Drive-by: Changes to these functions `splitCoroutine` seem NFC to me. If that is the case they can/should be committed beforehand as such (w/o review). jdoerfert: Drive-by: Changes to these functions `splitCoroutine` seem NFC to me. If that is the case they…
SmallVectorImpl<Function *> &Clones) {		SmallVectorImpl<Function *> &Clones) {
switch (Shape.ABI) {
case coro::ABI::Switch:
return splitSwitchCoroutine(F, Shape, Clones);
case coro::ABI::Retcon:
case coro::ABI::RetconOnce:
return splitRetconCoroutine(F, Shape, Clones);
}
llvm_unreachable("bad ABI kind");
}

static void splitCoroutine(Function &F, CallGraph &CG, CallGraphSCC &SCC) {
PrettyStackTraceFunction prettyStackTrace(F);		PrettyStackTraceFunction prettyStackTrace(F);

// The suspend-crossing algorithm in buildCoroutineFrame get tripped		// The suspend-crossing algorithm in buildCoroutineFrame get tripped
// up by uses in unreachable blocks, so remove them as a first pass.		// up by uses in unreachable blocks, so remove them as a first pass.
removeUnreachableBlocks(F);		removeUnreachableBlocks(F);

coro::Shape Shape(F);		coro::Shape Shape(F);
if (!Shape.CoroBegin)		if (!Shape.CoroBegin)
return;		return Shape;

simplifySuspendPoints(Shape);		simplifySuspendPoints(Shape);
buildCoroutineFrame(F, Shape);		buildCoroutineFrame(F, Shape);
replaceFrameSize(Shape);		replaceFrameSize(Shape);

SmallVector<Function*, 4> Clones;

// If there are no suspend points, no split required, just remove		// If there are no suspend points, no split required, just remove
// the allocation and deallocation blocks, they are not needed.		// the allocation and deallocation blocks, they are not needed.
if (Shape.CoroSuspends.empty()) {		if (Shape.CoroSuspends.empty()) {
handleNoSuspendCoroutine(Shape);		handleNoSuspendCoroutine(Shape);
} else {		} else {
splitCoroutine(F, Shape, Clones);		switch (Shape.ABI) {
		case coro::ABI::Switch:
		splitSwitchCoroutine(F, Shape, Clones);
		break;
		case coro::ABI::Retcon:
		case coro::ABI::RetconOnce:
		splitRetconCoroutine(F, Shape, Clones);
		break;
		}
}		}

// Replace all the swifterror operations in the original function.		// Replace all the swifterror operations in the original function.
// This invalidates SwiftErrorOps in the Shape.		// This invalidates SwiftErrorOps in the Shape.
replaceSwiftErrorOps(F, Shape, nullptr);		replaceSwiftErrorOps(F, Shape, nullptr);

		return Shape;
		}

		static void
		updateCallGraphAfterCoroutineSplit(Function &F, const coro::Shape &Shape,
		const SmallVectorImpl<Function *> &Clones,
		CallGraph &CG, CallGraphSCC &SCC) {
		if (!Shape.CoroBegin)
		return;

removeCoroEnds(Shape, &CG);		removeCoroEnds(Shape, &CG);
postSplitCleanup(F);		postSplitCleanup(F);

// Update call graph and add the functions we created to the SCC.		// Update call graph and add the functions we created to the SCC.
coro::updateCallGraph(F, Clones, CG, SCC);		coro::updateCallGraph(F, Clones, CG, SCC);
}		}

		static void updateCallGraphAfterCoroutineSplit(
		LazyCallGraph::Node &N, const coro::Shape &Shape,
		const SmallVectorImpl<Function *> &Clones, LazyCallGraph::SCC &C,
		LazyCallGraph &CG, CGSCCAnalysisManager &AM, CGSCCUpdateResult &UR) {
		if (!Shape.CoroBegin)
		return;

		for (llvm::CoroEndInst *End : Shape.CoroEnds) {
		auto &Context = End->getContext();
		End->replaceAllUsesWith(ConstantInt::getFalse(Context));
		End->eraseFromParent();
		}

		postSplitCleanup(N.getFunction());

		// To insert the newly created coroutine funclets 'f.resume', 'f.destroy', and
		// 'f.cleanup' into the same SCC as the coroutine 'f' they were outlined from,
		// we make use of the CallGraphUpdater class, which can modify the internal
		// state of the LazyCallGraph.
		wenleiUnsubmitted Done Reply Inline Actions Seems `LazyCallGraph` wanted every possible (direct or indirect) references to be modeled upfront, and there's check/asserts for that. This change effectively bypasses that. Is slightly less optimal CGSCC order the only consequence of not following that rule? With coro-split, we do alter the underlying CG, is it safe to populate CG Node without other bookkeeping like informing `CGSCCUpdateResult`? I'm not familiar with the assumptions and constraints about how CG should be updated, but asking because this `populate()` is never called outside of call graph construction, and I'm guessing it's a public function only because `LazyCallGraphTest` needed it.. wenlei: Seems `LazyCallGraph` wanted every possible (direct or indirect) references to be modeled…
		for (Function *Clone : Clones)
		CG.addNewFunctionIntoRefSCC(*Clone, C.getOuterRefSCC());

		// We've inserted instructions into coroutine 'f' that reference the three new
		// coroutine funclets. We must now update the call graph so that reference
		// edges between 'f' and its funclets are added to it. LazyCallGraph only
		// allows CGSCC passes to insert "trivial" reference edges. We've ensured
		// above, by inserting the funclets into the same SCC as the corutine, that
		// the edges are trivial.
		//
		// N.B.: If we didn't update the call graph here, a CGSCCToFunctionPassAdaptor
		// later in this CGSCC pass pipeline may be run, triggering a call graph
		// update of its own. Function passes run by the adaptor are not permitted to
		// add new edges of any kind to the graph, and the new edges inserted by this
		// pass would be misattributed to that unrelated function pass.
		updateCGAndAnalysisManagerForCGSCCPass(CG, C, N, AM, UR);
		}
		wenleiUnsubmitted Not Done Reply Inline Actions Now that we have CallGraphUpdater, can we use `CGUpdater.reanalyzeFunction(N.getFunction())` instead? wenlei: Now that we have CallGraphUpdater, can we use `CGUpdater.reanalyzeFunction(N.getFunction())`…
		modocacheAuthorUnsubmitted Not Done Reply Inline Actions Yeah, I considered that as well. I opted not to because it's just extra steps: We grab node's function via `LazyCallGraph::Node::getFunction`, just to have `CallGraphUpdater::reanalyzeFunction` call `LazyCallGraph::get` to lookup the node in the graph, or create one if it's not in the graph. But we know this node in the graph, and we already have a reference to it, so why look it up? `CallGraphUpdater::reanalyzeFunction` then looks up the SCC, which we also already have a reference to. As far as I can tell there's no way `CallGraphUpdater::registerOutlinedFunction` above would have split the SCC, so there's no reason to re-lookup an SCC we already have a reference to. And finally, the call to `updateCGAndAnalysisManagerForCGSCCPass`. I figured, why not just do (3)? That being said, I do think in future patches there's room to consolidate the legacy pass and the new pass's interactions with the call graph. My personal preference would be to wait until D70927 is merged to do that, though. modocache: Yeah, I considered that as well. I opted not to because it's just extra steps: 1. We grab…
		wenleiUnsubmitted Not Done Reply Inline Actions yeah, there's a bit of abstraction overhead for the CallGraphUpdater layer over calling `updateCGAndAnalysisManagerForCGSCCPass`. Technically `CallGraphUpdater::reanalyzeFunction` is not really needed anywhere as we can almost always call `updateCGAndAnalysisManagerForCGSCCPass` directly, but I thought communicating through `CallGraphUpdater` is cleaner. I don't have a strong opinion on this though - @jdoerfert may want to weigh in on this. wenlei: yeah, there's a bit of abstraction overhead for the CallGraphUpdater layer over calling…
		jdoerfertUnsubmitted Not Done Reply Inline Actions If you are in a new PM only code region, and the LazyCall graph helpers are sufficient, I don't see any reason not to call them directly. The `CallGraphUpdater` comes in handy if you are in generic code that needs to work in both PMs, or if it has abstractions that make your live easier. jdoerfert: If you are in a new PM only code region, and the LazyCall graph helpers are sufficient, I don't…

// When we see the coroutine the first time, we insert an indirect call to a		// When we see the coroutine the first time, we insert an indirect call to a
// devirt trigger function and mark the coroutine that it is now ready for		// devirt trigger function and mark the coroutine that it is now ready for
// split.		// split.
static void prepareForSplit(Function &F, CallGraph &CG) {		static void prepareForSplit(Function &F, CallGraph &CG) {
Module &M = *F.getParent();		Module &M = *F.getParent();
LLVMContext &Context = F.getContext();		LLVMContext &Context = F.getContext();
#ifndef NDEBUG		#ifndef NDEBUG
Function *DevirtFn = M.getFunction(CORO_DEVIRT_TRIGGER_FN);		Function *DevirtFn = M.getFunction(CORO_DEVIRT_TRIGGER_FN);
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	static void createDevirtTriggerFunc(CallGraph &CG, CallGraphSCC &SCC) {
SCC.initialize(Nodes);		SCC.initialize(Nodes);
}		}

/// Replace a call to llvm.coro.prepare.retcon.		/// Replace a call to llvm.coro.prepare.retcon.
static void replacePrepare(CallInst *Prepare, CallGraph &CG) {		static void replacePrepare(CallInst *Prepare, CallGraph &CG) {
auto CastFn = Prepare->getArgOperand(0); // as an i8*		auto CastFn = Prepare->getArgOperand(0); // as an i8*
auto Fn = CastFn->stripPointerCasts(); // as its original type		auto Fn = CastFn->stripPointerCasts(); // as its original type

// Find call graph nodes for the preparation.		// Find call graph nodes for the preparation.
		wenleiUnsubmitted Done Reply Inline Actions This doesn't seem necessary as DevirtFn doesn't contain any calls. wenlei: This doesn't seem necessary as DevirtFn doesn't contain any calls.
CallGraphNode PrepareUserNode = nullptr, FnNode = nullptr;		CallGraphNode PrepareUserNode = nullptr, FnNode = nullptr;
if (auto ConcreteFn = dyn_cast<Function>(Fn)) {		if (auto ConcreteFn = dyn_cast<Function>(Fn)) {
PrepareUserNode = CG[Prepare->getFunction()];		PrepareUserNode = CG[Prepare->getFunction()];
FnNode = CG[ConcreteFn];		FnNode = CG[ConcreteFn];
}		}

// Attempt to peephole this pattern:		// Attempt to peephole this pattern:
// %0 = bitcast [[TYPE]] @some_function to i8*		// %0 = bitcast [[TYPE]] @some_function to i8*
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	for (auto PI = PrepareFn->use_begin(), PE = PrepareFn->use_end();
auto *Prepare = cast<CallInst>((PI++)->getUser());		auto *Prepare = cast<CallInst>((PI++)->getUser());
replacePrepare(Prepare, CG);		replacePrepare(Prepare, CG);
Changed = true;		Changed = true;
}		}

return Changed;		return Changed;
}		}

//===----------------------------------------------------------------------===//		static bool declaresCoroSplitIntrinsics(const Module &M) {
// Top Level Driver		return coro::declaresIntrinsics(
//===----------------------------------------------------------------------===//		M, {"llvm.coro.begin", "llvm.coro.prepare.retcon"});
		}

		PreservedAnalyses CoroSplitPass::run(LazyCallGraph::SCC &C,
		CGSCCAnalysisManager &AM,
		LazyCallGraph &CG, CGSCCUpdateResult &UR) {
		// NB: One invariant of a valid LazyCallGraph::SCC is that it must contain a
		// non-zero number of nodes, so we assume that here and grab the first
		// node's function's module.
		Module &M = *C.begin()->getFunction().getParent();
		if (!declaresCoroSplitIntrinsics(M))
		return PreservedAnalyses::all();

		// Check for uses of llvm.coro.prepare.retcon.
		const auto *PrepareFn = M.getFunction("llvm.coro.prepare.retcon");
		if (PrepareFn && PrepareFn->use_empty())
		PrepareFn = nullptr;

		// Find coroutines for processing.
		SmallVector<LazyCallGraph::Node *, 4> Coroutines;
		for (LazyCallGraph::Node &N : C)
		if (N.getFunction().hasFnAttribute(CORO_PRESPLIT_ATTR))
		Coroutines.push_back(&N);

		if (Coroutines.empty() && !PrepareFn)
		return PreservedAnalyses::all();

		if (Coroutines.empty())
		llvm_unreachable("new pass manager cannot yet handle "
		wenleiUnsubmitted Done Reply Inline Actions nit: I would say "new pass manager" instead of "experimental pass manager".. it's not that experimental at this moment. wenlei: nit: I would say "new pass manager" instead of "experimental pass manager".. it's not that…
		modocacheAuthorUnsubmitted Done Reply Inline Actions Oh, good catch! I agree, will do. modocache: Oh, good catch! I agree, will do.
		"'llvm.coro.prepare.retcon'");

		// Split all the coroutines.
		for (LazyCallGraph::Node *N : Coroutines) {
		Function &F = N->getFunction();
		Attribute Attr = F.getFnAttribute(CORO_PRESPLIT_ATTR);
		StringRef Value = Attr.getValueAsString();
		LLVM_DEBUG(dbgs() << "CoroSplit: Processing coroutine '" << F.getName()
		<< "' state: " << Value << "\n");
		if (Value == UNPREPARED_FOR_SPLIT) {
		// Enqueue a second iteration of the CGSCC pipeline.
		// N.B.:
		// The CoroSplitLegacy pass "triggers" a restart of the CGSCC pass
		// pipeline by inserting an indirect function call that the
		// CoroElideLegacy pass then replaces with a direct function call. The
		// legacy CGSCC pipeline's implicit behavior was as if wrapped in the new
		// pass manager abstraction DevirtSCCRepeatedPass.
		//
		// This pass does not need to "trigger" another run of the pipeline.
		// Instead, it simply enqueues the same RefSCC onto the pipeline's
		// worklist.
		UR.CWorklist.insert(&C);
		wenleiUnsubmitted Done Reply Inline Actions Is it necessary to request RefSCC to be reprocessed? I thought `UR.CWorklist.insert(&C)` should be enough.. wenlei: Is it necessary to request RefSCC to be reprocessed? I thought `UR.CWorklist.insert(&C)` should…
		modocacheAuthorUnsubmitted Done Reply Inline Actions Will do! I was trying to do whatever seemed most similar to the legacy pass manager's repeater, I wasn't sure whether that was the outer repeating loop over RefSCCs, or the inner SCC loop. modocache: Will do! I was trying to do whatever seemed most similar to the legacy pass manager's repeater…
		modocacheAuthorUnsubmitted Done Reply Inline Actions @wenlei Now that I've updated D72226 to outline the funclets into the same RefSCC (so, not the same SCC) as the coroutine, I think we now may actually want to re-enqueue the entire RefSCC. Thoughts? modocache: @wenlei Now that I've updated D72226 to outline the funclets into the same RefSCC (so, not…
		wenleiUnsubmitted Not Done Reply Inline Actions Re-enqueue is for rerun the 2nd coro-split pass, which I though is orthogonal to the actual outlining where funclet is now created as new SCC in the same RefSCC. So as long as we get to rerun 2nd coro-split on those functions I think we should be fine, and re-enqueue SCC does that. But if we re-enqueue the RefSCC, it's functionally correct as well, just we might rerun pipeline for some extra SCCs in the same RefSCC while it's not really needed.. wenlei: Re-enqueue is for rerun the 2nd coro-split pass, which I though is orthogonal to the actual…
		F.addFnAttr(CORO_PRESPLIT_ATTR, PREPARED_FOR_SPLIT);
		continue;
		}
		F.removeFnAttr(CORO_PRESPLIT_ATTR);

		SmallVector<Function *, 4> Clones;
		const coro::Shape Shape = splitCoroutine(F, Clones);
		updateCallGraphAfterCoroutineSplit(*N, Shape, Clones, C, CG, AM, UR);
		}

		if (PrepareFn)
		llvm_unreachable("new pass manager cannot yet handle "
		"'llvm.coro.prepare.retcon'");

		return PreservedAnalyses::none();
		}

namespace {		namespace {

		// We present a coroutine to LLVM as an ordinary function with suspension
		// points marked up with intrinsics. We let the optimizer party on the coroutine
		// as a single function for as long as possible. Shortly before the coroutine is
		// eligible to be inlined into its callers, we split up the coroutine into parts
		// corresponding to initial, resume and destroy invocations of the coroutine,
		// add them to the current SCC and restart the IPO pipeline to optimize the
		// coroutine subfunctions we extracted before proceeding to the caller of the
		// coroutine.
struct CoroSplitLegacy : public CallGraphSCCPass {		struct CoroSplitLegacy : public CallGraphSCCPass {
static char ID; // Pass identification, replacement for typeid		static char ID; // Pass identification, replacement for typeid

CoroSplitLegacy() : CallGraphSCCPass(ID) {		CoroSplitLegacy() : CallGraphSCCPass(ID) {
initializeCoroSplitLegacyPass(*PassRegistry::getPassRegistry());		initializeCoroSplitLegacyPass(*PassRegistry::getPassRegistry());
}		}

bool Run = false;		bool Run = false;

// A coroutine is identified by the presence of coro.begin intrinsic, if		// A coroutine is identified by the presence of coro.begin intrinsic, if
// we don't have any, this pass has nothing to do.		// we don't have any, this pass has nothing to do.
bool doInitialization(CallGraph &CG) override {		bool doInitialization(CallGraph &CG) override {
Run = coro::declaresIntrinsics(CG.getModule(),		Run = declaresCoroSplitIntrinsics(CG.getModule());
{"llvm.coro.begin",
"llvm.coro.prepare.retcon"});
return CallGraphSCCPass::doInitialization(CG);		return CallGraphSCCPass::doInitialization(CG);
}		}

bool runOnSCC(CallGraphSCC &SCC) override {		bool runOnSCC(CallGraphSCC &SCC) override {
if (!Run)		if (!Run)
return false;		return false;

// Check for uses of llvm.coro.prepare.retcon.		// Check for uses of llvm.coro.prepare.retcon.
Show All 25 Lines	for (Function *F : Coroutines) {
StringRef Value = Attr.getValueAsString();		StringRef Value = Attr.getValueAsString();
LLVM_DEBUG(dbgs() << "CoroSplit: Processing coroutine '" << F->getName()		LLVM_DEBUG(dbgs() << "CoroSplit: Processing coroutine '" << F->getName()
<< "' state: " << Value << "\n");		<< "' state: " << Value << "\n");
if (Value == UNPREPARED_FOR_SPLIT) {		if (Value == UNPREPARED_FOR_SPLIT) {
prepareForSplit(*F, CG);		prepareForSplit(*F, CG);
continue;		continue;
}		}
F->removeFnAttr(CORO_PRESPLIT_ATTR);		F->removeFnAttr(CORO_PRESPLIT_ATTR);
splitCoroutine(*F, CG, SCC);
		SmallVector<Function *, 4> Clones;
		const coro::Shape Shape = splitCoroutine(*F, Clones);
		updateCallGraphAfterCoroutineSplit(*F, Shape, Clones, CG, SCC);
}		}

if (PrepareFn)		if (PrepareFn)
replaceAllPrepares(PrepareFn, CG);		replaceAllPrepares(PrepareFn, CG);

return true;		return true;
}		}

Show All 22 Lines

llvm/test/Transforms/Coroutines/coro-alloc-with-param-O0.ll

This file was copied from llvm/test/Transforms/Coroutines/coro-alloc-with-param.ll.

; Check that we can handle the case when both alloc function and		; Check that we can handle the case when both alloc function and
; the user body consume the same argument.		; the user body consume the same argument.
; RUN: opt < %s -coro-split -S \| FileCheck %s		; RUN: opt < %s -coro-split -S \| FileCheck %s
		; RUN: opt < %s -passes=coro-split -S \| FileCheck %s
; using this directly (as it would happen under -O2)
define i8* @f_direct(i64 %this) "coroutine.presplit"="1" {
entry:
%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
%size = call i32 @llvm.coro.size.i32()
%alloc = call i8* @myAlloc(i64 %this, i32 %size)
%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
%0 = call i8 @llvm.coro.suspend(token none, i1 false)
switch i8 %0, label %suspend [i8 0, label %resume
i8 1, label %cleanup]
resume:
call void @print2(i64 %this)
br label %cleanup

cleanup:
%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
call void @free(i8* %mem)
br label %suspend
suspend:
call i1 @llvm.coro.end(i8* %hdl, i1 0)
ret i8* %hdl
}

; using copy of this (as it would happen under -O0)		; using copy of this (as it would happen under -O0)
define i8* @f_copy(i64 %this_arg) "coroutine.presplit"="1" {		define i8* @f_copy(i64 %this_arg) "coroutine.presplit"="1" {
entry:		entry:
%this.addr = alloca i64		%this.addr = alloca i64
store i64 %this_arg, i64* %this.addr		store i64 %this_arg, i64* %this.addr
%this = load i64, i64* %this.addr		%this = load i64, i64* %this.addr
%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)		%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
Show All 12 Lines	cleanup:
call void @free(i8* %mem)		call void @free(i8* %mem)
br label %suspend		br label %suspend
suspend:		suspend:
call i1 @llvm.coro.end(i8* %hdl, i1 0)		call i1 @llvm.coro.end(i8* %hdl, i1 0)
ret i8* %hdl		ret i8* %hdl
}		}

; See if %this was added to the frame		; See if %this was added to the frame
; CHECK: %f_direct.Frame = type { void (%f_direct.Frame), void (%f_direct.Frame), i1, i1, i64 }
; CHECK: %f_copy.Frame = type { void (%f_copy.Frame), void (%f_copy.Frame), i1, i1, i64 }		; CHECK: %f_copy.Frame = type { void (%f_copy.Frame), void (%f_copy.Frame), i1, i1, i64 }

; See that %this is spilled into the frame		; See that %this is spilled into the frame
; CHECK-LABEL: define i8* @f_direct(i64 %this)
; CHECK: %this.spill.addr = getelementptr inbounds %f_direct.Frame, %f_direct.Frame* %FramePtr, i32 0, i32 4
; CHECK: store i64 %this, i64* %this.spill.addr
; CHECK: ret i8* %hdl

; See that %this is spilled into the frame
; CHECK-LABEL: define i8* @f_copy(i64 %this_arg)		; CHECK-LABEL: define i8* @f_copy(i64 %this_arg)
; CHECK: %this.spill.addr = getelementptr inbounds %f_copy.Frame, %f_copy.Frame* %FramePtr, i32 0, i32 4		; CHECK: %this.spill.addr = getelementptr inbounds %f_copy.Frame, %f_copy.Frame* %FramePtr, i32 0, i32 4
; CHECK: store i64 %this_arg, i64* %this.spill.addr		; CHECK: store i64 %this_arg, i64* %this.spill.addr
; CHECK: ret i8* %hdl		; CHECK: ret i8* %hdl

; See that %this was loaded from the frame		; See that %this was loaded from the frame
; CHECK-LABEL: @f_direct.resume(
; CHECK: %this.reload = load i64, i64* %this.reload.addr
; CHECK: call void @print2(i64 %this.reload)
; CHECK: ret void

; See that %this was loaded from the frame
; CHECK-LABEL: @f_copy.resume(		; CHECK-LABEL: @f_copy.resume(
; CHECK: %this.reload = load i64, i64* %this.reload.addr		; CHECK: %this.reload = load i64, i64* %this.reload.addr
; CHECK: call void @print2(i64 %this.reload)		; CHECK: call void @print2(i64 %this.reload)
; CHECK: ret void		; CHECK: ret void

declare i8* @llvm.coro.free(token, i8*)		declare i8* @llvm.coro.free(token, i8*)
declare i32 @llvm.coro.size.i32()		declare i32 @llvm.coro.size.i32()
declare i8 @llvm.coro.suspend(token, i1)		declare i8 @llvm.coro.suspend(token, i1)
Show All 12 Lines

llvm/test/Transforms/Coroutines/coro-alloc-with-param-O2.ll

This file was moved from llvm/test/Transforms/Coroutines/coro-alloc-with-param.ll.

	; Check that we can handle the case when both alloc function and			; Check that we can handle the case when both alloc function and
	; the user body consume the same argument.			; the user body consume the same argument.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	; using this directly (as it would happen under -O2)			; using this directly (as it would happen under -O2)
	define i8* @f_direct(i64 %this) "coroutine.presplit"="1" {			define i8* @f_direct(i64 %this) "coroutine.presplit"="1" {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%size = call i32 @llvm.coro.size.i32()			%size = call i32 @llvm.coro.size.i32()
	%alloc = call i8* @myAlloc(i64 %this, i32 %size)			%alloc = call i8* @myAlloc(i64 %this, i32 %size)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)			%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
	%0 = call i8 @llvm.coro.suspend(token none, i1 false)			%0 = call i8 @llvm.coro.suspend(token none, i1 false)
	switch i8 %0, label %suspend [i8 0, label %resume			switch i8 %0, label %suspend [i8 0, label %resume
	i8 1, label %cleanup]			i8 1, label %cleanup]
	resume:			resume:
	call void @print2(i64 %this)			call void @print2(i64 %this)
	br label %cleanup			br label %cleanup

	cleanup:			cleanup:
	%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)			%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
	call void @free(i8* %mem)			call void @free(i8* %mem)
	br label %suspend			br label %suspend
	suspend:			suspend:
	call i1 @llvm.coro.end(i8* %hdl, i1 0)			call i1 @llvm.coro.end(i8* %hdl, i1 0)
	ret i8* %hdl			ret i8* %hdl
	}			}

	; using copy of this (as it would happen under -O0)
	define i8* @f_copy(i64 %this_arg) "coroutine.presplit"="1" {
	entry:
	%this.addr = alloca i64
	store i64 %this_arg, i64* %this.addr
	%this = load i64, i64* %this.addr
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%size = call i32 @llvm.coro.size.i32()
	%alloc = call i8* @myAlloc(i64 %this, i32 %size)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
	%0 = call i8 @llvm.coro.suspend(token none, i1 false)
	switch i8 %0, label %suspend [i8 0, label %resume
	i8 1, label %cleanup]
	resume:
	call void @print2(i64 %this)
	br label %cleanup

	cleanup:
	%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
	call void @free(i8* %mem)
	br label %suspend
	suspend:
	call i1 @llvm.coro.end(i8* %hdl, i1 0)
	ret i8* %hdl
	}

	; See if %this was added to the frame			; See if %this was added to the frame
	; CHECK: %f_direct.Frame = type { void (%f_direct.Frame), void (%f_direct.Frame), i1, i1, i64 }			; CHECK: %f_direct.Frame = type { void (%f_direct.Frame), void (%f_direct.Frame), i1, i1, i64 }
	; CHECK: %f_copy.Frame = type { void (%f_copy.Frame), void (%f_copy.Frame), i1, i1, i64 }

	; See that %this is spilled into the frame			; See that %this is spilled into the frame
	; CHECK-LABEL: define i8* @f_direct(i64 %this)			; CHECK-LABEL: define i8* @f_direct(i64 %this)
	; CHECK: %this.spill.addr = getelementptr inbounds %f_direct.Frame, %f_direct.Frame* %FramePtr, i32 0, i32 4			; CHECK: %this.spill.addr = getelementptr inbounds %f_direct.Frame, %f_direct.Frame* %FramePtr, i32 0, i32 4
	; CHECK: store i64 %this, i64* %this.spill.addr			; CHECK: store i64 %this, i64* %this.spill.addr
	; CHECK: ret i8* %hdl			; CHECK: ret i8* %hdl

	; See that %this is spilled into the frame
	; CHECK-LABEL: define i8* @f_copy(i64 %this_arg)
	; CHECK: %this.spill.addr = getelementptr inbounds %f_copy.Frame, %f_copy.Frame* %FramePtr, i32 0, i32 4
	; CHECK: store i64 %this_arg, i64* %this.spill.addr
	; CHECK: ret i8* %hdl

	; See that %this was loaded from the frame			; See that %this was loaded from the frame
	; CHECK-LABEL: @f_direct.resume(			; CHECK-LABEL: @f_direct.resume(
	; CHECK: %this.reload = load i64, i64* %this.reload.addr			; CHECK: %this.reload = load i64, i64* %this.reload.addr
	; CHECK: call void @print2(i64 %this.reload)			; CHECK: call void @print2(i64 %this.reload)
	; CHECK: ret void			; CHECK: ret void

	; See that %this was loaded from the frame
	; CHECK-LABEL: @f_copy.resume(
	; CHECK: %this.reload = load i64, i64* %this.reload.addr
	; CHECK: call void @print2(i64 %this.reload)
	; CHECK: ret void

	declare i8* @llvm.coro.free(token, i8*)			declare i8* @llvm.coro.free(token, i8*)
	declare i32 @llvm.coro.size.i32()			declare i32 @llvm.coro.size.i32()
	declare i8 @llvm.coro.suspend(token, i1)			declare i8 @llvm.coro.suspend(token, i1)
	declare void @llvm.coro.resume(i8*)			declare void @llvm.coro.resume(i8*)
	declare void @llvm.coro.destroy(i8*)			declare void @llvm.coro.destroy(i8*)

	declare token @llvm.coro.id(i32, i8, i8, i8*)			declare token @llvm.coro.id(i32, i8, i8, i8*)
	declare i1 @llvm.coro.alloc(token)			declare i1 @llvm.coro.alloc(token)
	declare i8* @llvm.coro.begin(token, i8*)			declare i8* @llvm.coro.begin(token, i8*)
	declare i1 @llvm.coro.end(i8*, i1)			declare i1 @llvm.coro.end(i8*, i1)

	declare noalias i8* @myAlloc(i64, i32)			declare noalias i8* @myAlloc(i64, i32)
	declare double @print(double)			declare double @print(double)
	declare void @print2(i64)			declare void @print2(i64)
	declare void @free(i8*)			declare void @free(i8*)

llvm/test/Transforms/Coroutines/coro-alloc-with-param.ll

This file was deleted after being copied to llvm/test/Transforms/Coroutines/coro-alloc-with-param-O0.ll, llvm/test/Transforms/Coroutines/coro-alloc-with-param-O2.ll.

The contents of this file were not changed.

llvm/test/Transforms/Coroutines/coro-catchswitch.ll

	; Verifies that we can insert the spill for a PHI preceding the catchswitch			; Verifies that we can insert the spill for a PHI preceding the catchswitch
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"			target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"
	target triple = "i686-pc-windows-msvc"			target triple = "i686-pc-windows-msvc"

	; CHECK-LABEL: define void @f(			; CHECK-LABEL: define void @f(
	define void @f(i1 %cond) "coroutine.presplit"="1" personality i32 0 {			define void @f(i1 %cond) "coroutine.presplit"="1" personality i32 0 {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 8, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 8, i8* null, i8* null, i8* null)
	▲ Show 20 Lines • Show All 78 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-debug.ll

	; Tests that debug information is sane after coro-split			; Tests that debug information is sane after coro-split
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	source_filename = "simple-repro.c"			source_filename = "simple-repro.c"
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; Function Attrs: noinline nounwind			; Function Attrs: noinline nounwind
	define i8* @f(i32 %x) #0 !dbg !6 {			define i8* @f(i32 %x) #0 !dbg !6 {
	entry:			entry:
	▲ Show 20 Lines • Show All 132 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-eh-aware-edge-split-00.ll

This file was added.

				; Check that we can handle edge splits leading into a landingpad
				; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				; CHECK-LABEL: define internal fastcc void @f.resume(
				define void @f(i1 %cond) "coroutine.presplit"="1" personality i32 0 {
				entry:
				%id = call token @llvm.coro.id(i32 16, i8* null, i8* null, i8* null)
				%size = tail call i64 @llvm.coro.size.i64()
				%alloc = call i8* @malloc(i64 %size)
				%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
				%sp = call i8 @llvm.coro.suspend(token none, i1 false)
				switch i8 %sp, label %coro.ret [
				i8 0, label %resume
				i8 1, label %cleanup
				]

				resume:
				br i1 %cond, label %invoke1, label %invoke2

				invoke1:
				invoke void @may_throw1()
				to label %unreach unwind label %pad.with.phi
				invoke2:
				invoke void @may_throw2()
				to label %unreach unwind label %pad.with.phi

				; Verify that we cloned landing pad on every edge and inserted a reload of the spilled value

				; CHECK: pad.with.phi.from.invoke2:
				; CHECK: %0 = landingpad { i8*, i32 }
				; CHECK: catch i8* null
				; CHECK: br label %pad.with.phi

				; CHECK: pad.with.phi.from.invoke1:
				; CHECK: %1 = landingpad { i8*, i32 }
				; CHECK: catch i8* null
				; CHECK: br label %pad.with.phi

				; CHECK: pad.with.phi:
				; CHECK: %val = phi i32 [ 0, %pad.with.phi.from.invoke1 ], [ 1, %pad.with.phi.from.invoke2 ]
				; CHECK: %lp = phi { i8*, i32 } [ %0, %pad.with.phi.from.invoke2 ], [ %1, %pad.with.phi.from.invoke1 ]
				; CHECK: %exn = extractvalue { i8*, i32 } %lp, 0
				; CHECK: call i8* @__cxa_begin_catch(i8* %exn)
				; CHECK: call void @use_val(i32 %val)
				; CHECK: call void @__cxa_end_catch()
				; CHECK: call void @free(i8* %vFrame)
				; CHECK: ret void

				pad.with.phi:
				%val = phi i32 [ 0, %invoke1 ], [ 1, %invoke2 ]
				%lp = landingpad { i8*, i32 }
				catch i8* null
				%exn = extractvalue { i8*, i32 } %lp, 0
				call i8* @__cxa_begin_catch(i8* %exn)
				call void @use_val(i32 %val)
				call void @__cxa_end_catch()
				br label %cleanup

				cleanup: ; preds = %invoke.cont15, %if.else, %if.then, %ehcleanup21, %init.suspend
				%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
				call void @free(i8* %mem)
				br label %coro.ret

				coro.ret:
				call i1 @llvm.coro.end(i8* null, i1 false)
				ret void

				unreach:
				unreachable
				}

				; Function Attrs: argmemonly nounwind readonly
				declare token @llvm.coro.id(i32, i8* readnone, i8* nocapture readonly, i8*)
				declare noalias i8* @malloc(i64)
				declare i64 @llvm.coro.size.i64()
				declare i8* @llvm.coro.begin(token, i8* writeonly)

				; Function Attrs: nounwind
				declare token @llvm.coro.save(i8*)
				declare i8 @llvm.coro.suspend(token, i1)

				; Function Attrs: argmemonly nounwind
				declare void @may_throw1()
				declare void @may_throw2()

				declare i8* @__cxa_begin_catch(i8*)

				declare void @use_val(i32)
				declare void @__cxa_end_catch()

				; Function Attrs: nounwind
				declare i1 @llvm.coro.end(i8*, i1)
				declare void @free(i8*)
				declare i8* @llvm.coro.free(token, i8* nocapture readonly)

llvm/test/Transforms/Coroutines/coro-eh-aware-edge-split-01.ll

This file was added.

				; Check that we can handle edge splits leading into a landingpad
				; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				; CHECK-LABEL: define internal fastcc void @g.resume(
				define void @g(i1 %cond, i32 %x, i32 %y) "coroutine.presplit"="1" personality i32 0 {
				entry:
				%id = call token @llvm.coro.id(i32 16, i8* null, i8* null, i8* null)
				%size = tail call i64 @llvm.coro.size.i64()
				%alloc = call i8* @malloc(i64 %size)
				%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
				%sp = call i8 @llvm.coro.suspend(token none, i1 false)
				switch i8 %sp, label %coro.ret [
				i8 0, label %resume
				i8 1, label %cleanup
				]

				resume:
				br i1 %cond, label %invoke1, label %invoke2

				invoke1:
				invoke void @may_throw1()
				to label %unreach unwind label %pad.with.phi
				invoke2:
				invoke void @may_throw2()
				to label %unreach unwind label %pad.with.phi

				; Verify that we created cleanuppads on every edge and inserted a reload of the spilled value

				; CHECK: pad.with.phi.from.invoke2:
				; CHECK: %0 = cleanuppad within none []
				; CHECK: %y.reload.addr = getelementptr inbounds %g.Frame, %g.Frame* %FramePtr, i32 0, i32 6
				; CHECK: %y.reload = load i32, i32* %y.reload.addr
				; CHECK: cleanupret from %0 unwind label %pad.with.phi

				; CHECK: pad.with.phi.from.invoke1:
				; CHECK: %1 = cleanuppad within none []
				; CHECK: %x.reload.addr = getelementptr inbounds %g.Frame, %g.Frame* %FramePtr, i32 0, i32 5
				; CHECK: %x.reload = load i32, i32* %x.reload.addr
				; CHECK: cleanupret from %1 unwind label %pad.with.phi

				; CHECK: pad.with.phi:
				; CHECK: %val = phi i32 [ %x.reload, %pad.with.phi.from.invoke1 ], [ %y.reload, %pad.with.phi.from.invoke2 ]
				; CHECK: %tok = cleanuppad within none []
				; CHECK: call void @use_val(i32 %val)
				; CHECK: cleanupret from %tok unwind to caller

				pad.with.phi:
				%val = phi i32 [ %x, %invoke1 ], [ %y, %invoke2 ]
				%tok = cleanuppad within none []
				call void @use_val(i32 %val)
				cleanupret from %tok unwind to caller

				cleanup: ; preds = %invoke.cont15, %if.else, %if.then, %ehcleanup21, %init.suspend
				%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
				call void @free(i8* %mem)
				br label %coro.ret

				coro.ret:
				call i1 @llvm.coro.end(i8* null, i1 false)
				ret void

				unreach:
				unreachable
				}

				; Function Attrs: argmemonly nounwind readonly
				declare token @llvm.coro.id(i32, i8* readnone, i8* nocapture readonly, i8*)
				declare noalias i8* @malloc(i64)
				declare i64 @llvm.coro.size.i64()
				declare i8* @llvm.coro.begin(token, i8* writeonly)

				; Function Attrs: nounwind
				declare token @llvm.coro.save(i8*)
				declare i8 @llvm.coro.suspend(token, i1)

				; Function Attrs: argmemonly nounwind
				declare void @may_throw1()
				declare void @may_throw2()

				declare i8* @__cxa_begin_catch(i8*)

				declare void @use_val(i32)
				declare void @__cxa_end_catch()

				; Function Attrs: nounwind
				declare i1 @llvm.coro.end(i8*, i1)
				declare void @free(i8*)
				declare i8* @llvm.coro.free(token, i8* nocapture readonly)

llvm/test/Transforms/Coroutines/coro-eh-aware-edge-split-02.ll

This file was added.

				; Check that we can handle edge splits leading into a landingpad
				; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				; CHECK-LABEL: define internal fastcc void @h.resume(
				define void @h(i1 %cond, i32 %x, i32 %y) "coroutine.presplit"="1" personality i32 0 {
				entry:
				%id = call token @llvm.coro.id(i32 16, i8* null, i8* null, i8* null)
				%size = tail call i64 @llvm.coro.size.i64()
				%alloc = call i8* @malloc(i64 %size)
				%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
				%sp = call i8 @llvm.coro.suspend(token none, i1 false)
				switch i8 %sp, label %coro.ret [
				i8 0, label %resume
				i8 1, label %cleanup
				]

				resume:
				br i1 %cond, label %invoke1, label %invoke2

				invoke1:
				invoke void @may_throw1()
				to label %coro.ret unwind label %pad.with.phi
				invoke2:
				invoke void @may_throw2()
				to label %coro.ret unwind label %pad.with.phi

				; Verify that we created cleanuppads on every edge and inserted a reload of the spilled value

				; CHECK: pad.with.phi.from.invoke2:
				; CHECK: %0 = cleanuppad within none []
				; CHECK: %y.reload.addr = getelementptr inbounds %h.Frame, %h.Frame* %FramePtr, i32 0, i32 6
				; CHECK: %y.reload = load i32, i32* %y.reload.addr
				; CHECK: cleanupret from %0 unwind label %pad.with.phi

				; CHECK: pad.with.phi.from.invoke1:
				; CHECK: %1 = cleanuppad within none []
				; CHECK: %x.reload.addr = getelementptr inbounds %h.Frame, %h.Frame* %FramePtr, i32 0, i32 5
				; CHECK: %x.reload = load i32, i32* %x.reload.addr
				; CHECK: cleanupret from %1 unwind label %pad.with.phi

				; CHECK: pad.with.phi:
				; CHECK: %val = phi i32 [ %x.reload, %pad.with.phi.from.invoke1 ], [ %y.reload, %pad.with.phi.from.invoke2 ]
				; CHECK: %switch = catchswitch within none [label %catch] unwind to caller
				pad.with.phi:
				%val = phi i32 [ %x, %invoke1 ], [ %y, %invoke2 ]
				%switch = catchswitch within none [label %catch] unwind to caller

				catch: ; preds = %catch.dispatch
				%pad = catchpad within %switch [i8* null, i32 64, i8* null]
				call void @use_val(i32 %val)
				catchret from %pad to label %coro.ret

				cleanup: ; preds = %invoke.cont15, %if.else, %if.then, %ehcleanup21, %init.suspend
				%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
				call void @free(i8* %mem)
				br label %coro.ret

				coro.ret:
				call i1 @llvm.coro.end(i8* null, i1 false)
				ret void
				}

				; Function Attrs: argmemonly nounwind readonly
				declare token @llvm.coro.id(i32, i8* readnone, i8* nocapture readonly, i8*)
				declare noalias i8* @malloc(i64)
				declare i64 @llvm.coro.size.i64()
				declare i8* @llvm.coro.begin(token, i8* writeonly)

				; Function Attrs: nounwind
				declare token @llvm.coro.save(i8*)
				declare i8 @llvm.coro.suspend(token, i1)

				; Function Attrs: argmemonly nounwind
				declare void @may_throw1()
				declare void @may_throw2()

				declare i8* @__cxa_begin_catch(i8*)

				declare void @use_val(i32)
				declare void @__cxa_end_catch()

				; Function Attrs: nounwind
				declare i1 @llvm.coro.end(i8*, i1)
				declare void @free(i8*)
				declare i8* @llvm.coro.free(token, i8* nocapture readonly)

llvm/test/Transforms/Coroutines/coro-eh-aware-edge-split.ll

This file was deleted.

	; Check that we can handle edge splits leading into a landingpad
	; RUN: opt < %s -coro-split -S \| FileCheck %s

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"

	; CHECK-LABEL: define internal fastcc void @f.resume(
	define void @f(i1 %cond) "coroutine.presplit"="1" personality i32 0 {
	entry:
	%id = call token @llvm.coro.id(i32 16, i8* null, i8* null, i8* null)
	%size = tail call i64 @llvm.coro.size.i64()
	%alloc = call i8* @malloc(i64 %size)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
	%sp = call i8 @llvm.coro.suspend(token none, i1 false)
	switch i8 %sp, label %coro.ret [
	i8 0, label %resume
	i8 1, label %cleanup
	]

	resume:
	br i1 %cond, label %invoke1, label %invoke2

	invoke1:
	invoke void @may_throw1()
	to label %unreach unwind label %pad.with.phi
	invoke2:
	invoke void @may_throw2()
	to label %unreach unwind label %pad.with.phi

	; Verify that we cloned landing pad on every edge and inserted a reload of the spilled value

	; CHECK: pad.with.phi.from.invoke2:
	; CHECK: %0 = landingpad { i8*, i32 }
	; CHECK: catch i8* null
	; CHECK: br label %pad.with.phi

	; CHECK: pad.with.phi.from.invoke1:
	; CHECK: %1 = landingpad { i8*, i32 }
	; CHECK: catch i8* null
	; CHECK: br label %pad.with.phi

	; CHECK: pad.with.phi:
	; CHECK: %val = phi i32 [ 0, %pad.with.phi.from.invoke1 ], [ 1, %pad.with.phi.from.invoke2 ]
	; CHECK: %lp = phi { i8*, i32 } [ %0, %pad.with.phi.from.invoke2 ], [ %1, %pad.with.phi.from.invoke1 ]
	; CHECK: %exn = extractvalue { i8*, i32 } %lp, 0
	; CHECK: call i8* @__cxa_begin_catch(i8* %exn)
	; CHECK: call void @use_val(i32 %val)
	; CHECK: call void @__cxa_end_catch()
	; CHECK: call void @free(i8* %vFrame)
	; CHECK: ret void

	pad.with.phi:
	%val = phi i32 [ 0, %invoke1 ], [ 1, %invoke2 ]
	%lp = landingpad { i8*, i32 }
	catch i8* null
	%exn = extractvalue { i8*, i32 } %lp, 0
	call i8* @__cxa_begin_catch(i8* %exn)
	call void @use_val(i32 %val)
	call void @__cxa_end_catch()
	br label %cleanup

	cleanup: ; preds = %invoke.cont15, %if.else, %if.then, %ehcleanup21, %init.suspend
	%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
	call void @free(i8* %mem)
	br label %coro.ret

	coro.ret:
	call i1 @llvm.coro.end(i8* null, i1 false)
	ret void

	unreach:
	unreachable
	}

	; CHECK-LABEL: define internal fastcc void @g.resume(
	define void @g(i1 %cond, i32 %x, i32 %y) "coroutine.presplit"="1" personality i32 0 {
	entry:
	%id = call token @llvm.coro.id(i32 16, i8* null, i8* null, i8* null)
	%size = tail call i64 @llvm.coro.size.i64()
	%alloc = call i8* @malloc(i64 %size)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
	%sp = call i8 @llvm.coro.suspend(token none, i1 false)
	switch i8 %sp, label %coro.ret [
	i8 0, label %resume
	i8 1, label %cleanup
	]

	resume:
	br i1 %cond, label %invoke1, label %invoke2

	invoke1:
	invoke void @may_throw1()
	to label %unreach unwind label %pad.with.phi
	invoke2:
	invoke void @may_throw2()
	to label %unreach unwind label %pad.with.phi

	; Verify that we created cleanuppads on every edge and inserted a reload of the spilled value

	; CHECK: pad.with.phi.from.invoke2:
	; CHECK: %0 = cleanuppad within none []
	; CHECK: %y.reload.addr = getelementptr inbounds %g.Frame, %g.Frame* %FramePtr, i32 0, i32 6
	; CHECK: %y.reload = load i32, i32* %y.reload.addr
	; CHECK: cleanupret from %0 unwind label %pad.with.phi

	; CHECK: pad.with.phi.from.invoke1:
	; CHECK: %1 = cleanuppad within none []
	; CHECK: %x.reload.addr = getelementptr inbounds %g.Frame, %g.Frame* %FramePtr, i32 0, i32 5
	; CHECK: %x.reload = load i32, i32* %x.reload.addr
	; CHECK: cleanupret from %1 unwind label %pad.with.phi

	; CHECK: pad.with.phi:
	; CHECK: %val = phi i32 [ %x.reload, %pad.with.phi.from.invoke1 ], [ %y.reload, %pad.with.phi.from.invoke2 ]
	; CHECK: %tok = cleanuppad within none []
	; CHECK: call void @use_val(i32 %val)
	; CHECK: cleanupret from %tok unwind to caller

	pad.with.phi:
	%val = phi i32 [ %x, %invoke1 ], [ %y, %invoke2 ]
	%tok = cleanuppad within none []
	call void @use_val(i32 %val)
	cleanupret from %tok unwind to caller

	cleanup: ; preds = %invoke.cont15, %if.else, %if.then, %ehcleanup21, %init.suspend
	%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
	call void @free(i8* %mem)
	br label %coro.ret

	coro.ret:
	call i1 @llvm.coro.end(i8* null, i1 false)
	ret void

	unreach:
	unreachable
	}

	; CHECK-LABEL: define internal fastcc void @h.resume(
	define void @h(i1 %cond, i32 %x, i32 %y) "coroutine.presplit"="1" personality i32 0 {
	entry:
	%id = call token @llvm.coro.id(i32 16, i8* null, i8* null, i8* null)
	%size = tail call i64 @llvm.coro.size.i64()
	%alloc = call i8* @malloc(i64 %size)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
	%sp = call i8 @llvm.coro.suspend(token none, i1 false)
	switch i8 %sp, label %coro.ret [
	i8 0, label %resume
	i8 1, label %cleanup
	]

	resume:
	br i1 %cond, label %invoke1, label %invoke2

	invoke1:
	invoke void @may_throw1()
	to label %coro.ret unwind label %pad.with.phi
	invoke2:
	invoke void @may_throw2()
	to label %coro.ret unwind label %pad.with.phi

	; Verify that we created cleanuppads on every edge and inserted a reload of the spilled value

	; CHECK: pad.with.phi.from.invoke2:
	; CHECK: %0 = cleanuppad within none []
	; CHECK: %y.reload.addr = getelementptr inbounds %h.Frame, %h.Frame* %FramePtr, i32 0, i32 6
	; CHECK: %y.reload = load i32, i32* %y.reload.addr
	; CHECK: cleanupret from %0 unwind label %pad.with.phi

	; CHECK: pad.with.phi.from.invoke1:
	; CHECK: %1 = cleanuppad within none []
	; CHECK: %x.reload.addr = getelementptr inbounds %h.Frame, %h.Frame* %FramePtr, i32 0, i32 5
	; CHECK: %x.reload = load i32, i32* %x.reload.addr
	; CHECK: cleanupret from %1 unwind label %pad.with.phi

	; CHECK: pad.with.phi:
	; CHECK: %val = phi i32 [ %x.reload, %pad.with.phi.from.invoke1 ], [ %y.reload, %pad.with.phi.from.invoke2 ]
	; CHECK: %switch = catchswitch within none [label %catch] unwind to caller
	pad.with.phi:
	%val = phi i32 [ %x, %invoke1 ], [ %y, %invoke2 ]
	%switch = catchswitch within none [label %catch] unwind to caller

	catch: ; preds = %catch.dispatch
	%pad = catchpad within %switch [i8* null, i32 64, i8* null]
	call void @use_val(i32 %val)
	catchret from %pad to label %coro.ret

	cleanup: ; preds = %invoke.cont15, %if.else, %if.then, %ehcleanup21, %init.suspend
	%mem = call i8* @llvm.coro.free(token %id, i8* %hdl)
	call void @free(i8* %mem)
	br label %coro.ret

	coro.ret:
	call i1 @llvm.coro.end(i8* null, i1 false)
	ret void
	}

	; Function Attrs: argmemonly nounwind readonly
	declare token @llvm.coro.id(i32, i8* readnone, i8* nocapture readonly, i8*)
	declare noalias i8* @malloc(i64)
	declare i64 @llvm.coro.size.i64()
	declare i8* @llvm.coro.begin(token, i8* writeonly)

	; Function Attrs: nounwind
	declare token @llvm.coro.save(i8*)
	declare i8 @llvm.coro.suspend(token, i1)

	; Function Attrs: argmemonly nounwind
	declare void @may_throw1()
	declare void @may_throw2()

	declare i8* @__cxa_begin_catch(i8*)

	declare void @use_val(i32)
	declare void @__cxa_end_catch()

	; Function Attrs: nounwind
	declare i1 @llvm.coro.end(i8*, i1)
	declare void @free(i8*)
	declare i8* @llvm.coro.free(token, i8* nocapture readonly)

llvm/test/Transforms/Coroutines/coro-frame-arrayalloca.ll

	; Check that we can handle spills of array allocas			; Check that we can handle spills of array allocas
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	declare void @consume.double.ptr(double*)			declare void @consume.double.ptr(double*)
	declare void @consume.i32.ptr(i32*)			declare void @consume.i32.ptr(i32*)

	define i8* @f() "coroutine.presplit"="1" {			define i8* @f() "coroutine.presplit"="1" {
	entry:			entry:
	%prefix = alloca double			%prefix = alloca double
	%data = alloca i32, i32 4			%data = alloca i32, i32 4
	▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-frame-unreachable.ll

	; Check that coro-split doesn't choke on intrinsics in unreachable blocks			; Check that coro-split doesn't choke on intrinsics in unreachable blocks
	; RUN: opt < %s -coro-split -S			; RUN: opt < %s -coro-split -S
				; RUN: opt < %s -passes=coro-split -S

	define i8* @f(i1 %arg) "coroutine.presplit"="1" personality i32 0 {			define i8* @f(i1 %arg) "coroutine.presplit"="1" personality i32 0 {
	entry:			entry:
	%arg.addr = alloca i1			%arg.addr = alloca i1
	store i1 %arg, i1* %arg.addr			store i1 %arg, i1* %arg.addr
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%size = call i32 @llvm.coro.size.i32()			%size = call i32 @llvm.coro.size.i32()
	%alloc = call i8* @malloc(i32 %size)			%alloc = call i8* @malloc(i32 %size)
	Show All 40 Lines

llvm/test/Transforms/Coroutines/coro-frame.ll

	; Check that we can handle spills of the result of the invoke instruction			; Check that we can handle spills of the result of the invoke instruction
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define i8* @f(i64 %this) "coroutine.presplit"="1" personality i32 0 {			define i8* @f(i64 %this) "coroutine.presplit"="1" personality i32 0 {
	entry:			entry:
	%this.addr = alloca i64			%this.addr = alloca i64
	store i64 %this, i64* %this.addr			store i64 %this, i64* %this.addr
	%this1 = load i64, i64* %this.addr			%this1 = load i64, i64* %this.addr
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%size = call i32 @llvm.coro.size.i32()			%size = call i32 @llvm.coro.size.i32()
	▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-materialize.ll

	; Verifies that we materialize instruction across suspend points			; Verifies that we materialize instruction across suspend points
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define i8* @f(i32 %n) "coroutine.presplit"="1" {			define i8* @f(i32 %n) "coroutine.presplit"="1" {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%size = call i32 @llvm.coro.size.i32()			%size = call i32 @llvm.coro.size.i32()
	%alloc = call i8* @malloc(i32 %size)			%alloc = call i8* @malloc(i32 %size)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)			%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)

	▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-padding.ll

	; Check that we will insert the correct padding if natural alignment of the			; Check that we will insert the correct padding if natural alignment of the
	; spilled data does not match the alignment specified in alloca instruction.			; spilled data does not match the alignment specified in alloca instruction.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	%PackedStruct = type <{ i64 }>			%PackedStruct = type <{ i64 }>

	declare void @consume(%PackedStruct*)			declare void @consume(%PackedStruct*)

	define i8* @f() "coroutine.presplit"="1" {			define i8* @f() "coroutine.presplit"="1" {
	entry:			entry:
	%data = alloca %PackedStruct, align 8			%data = alloca %PackedStruct, align 8
	▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-param-copy.ll

	; Check that we create copy the data from the alloca into the coroutine			; Check that we create copy the data from the alloca into the coroutine
	; frame slot if it was written to.			; frame slot if it was written to.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define i8* @f() "coroutine.presplit"="1" {			define i8* @f() "coroutine.presplit"="1" {
	entry:			entry:
	%x.addr = alloca i64			%x.addr = alloca i64
	call void @use(i64* %x.addr) ; might write to %x			call void @use(i64* %x.addr) ; might write to %x
	%y.addr = alloca i64			%y.addr = alloca i64
	%y = load i64, i64* %y.addr ; cannot modify the value, don't need to copy			%y = load i64, i64* %y.addr ; cannot modify the value, don't need to copy
	call void @print(i64 %y)			call void @print(i64 %y)
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-spill-after-phi.ll

	; Verifies that we insert spills of PHI instruction _after) all PHI Nodes			; Verifies that we insert spills of PHI instruction _after) all PHI Nodes
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define i8* @f(i1 %n) "coroutine.presplit"="1" {			define i8* @f(i1 %n) "coroutine.presplit"="1" {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%size = call i32 @llvm.coro.size.i32()			%size = call i32 @llvm.coro.size.i32()
	%alloc = call i8* @malloc(i32 %size)			%alloc = call i8* @malloc(i32 %size)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)			%hdl = call i8* @llvm.coro.begin(token %id, i8* %alloc)
	br i1 %n, label %begin, label %alt			br i1 %n, label %begin, label %alt
	▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-spill-corobegin.ll

	; Check that we can spills coro.begin from an inlined inner coroutine.			; Check that we can spills coro.begin from an inlined inner coroutine.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	%g.Frame = type { void (%g.Frame), void (%g.Frame), i32, i1, i32 }			%g.Frame = type { void (%g.Frame), void (%g.Frame), i32, i1, i32 }

	@g.resumers = private constant [3 x void (%g.Frame)] [void (%g.Frame) @g.dummy, void (%g.Frame) @g.dummy, void (%g.Frame) @g.dummy]			@g.resumers = private constant [3 x void (%g.Frame)] [void (%g.Frame) @g.dummy, void (%g.Frame) @g.dummy, void (%g.Frame) @g.dummy]

	declare void @g.dummy(%g.Frame*)			declare void @g.dummy(%g.Frame*)

	define i8* @f() "coroutine.presplit"="1" {			define i8* @f() "coroutine.presplit"="1" {
	▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-split-00.ll

	; Tests that coro-split pass splits the coroutine into f, f.resume and f.destroy			; Tests that coro-split pass splits the coroutine into f, f.resume and f.destroy
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define i8* @f() "coroutine.presplit"="1" {			define i8* @f() "coroutine.presplit"="1" {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%need.alloc = call i1 @llvm.coro.alloc(token %id)			%need.alloc = call i1 @llvm.coro.alloc(token %id)
	br i1 %need.alloc, label %dyn.alloc, label %begin			br i1 %need.alloc, label %dyn.alloc, label %begin

	dyn.alloc:			dyn.alloc:
	▲ Show 20 Lines • Show All 69 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-split-02.ll

	; Tests that coro-split can handle the case when a code after coro.suspend uses			; Tests that coro-split can handle the case when a code after coro.suspend uses
	; a value produces between coro.save and coro.suspend (%Result.i19)			; a value produces between coro.save and coro.suspend (%Result.i19)
	; and checks whether stray coro.saves are properly removed			; and checks whether stray coro.saves are properly removed
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	%"struct.std::coroutine_handle" = type { i8* }			%"struct.std::coroutine_handle" = type { i8* }
	%"struct.std::coroutine_handle.0" = type { %"struct.std::coroutine_handle" }			%"struct.std::coroutine_handle.0" = type { %"struct.std::coroutine_handle" }
	%"struct.lean_future<int>::Awaiter" = type { i32, %"struct.std::coroutine_handle.0" }			%"struct.lean_future<int>::Awaiter" = type { i32, %"struct.std::coroutine_handle.0" }

	declare i8* @malloc(i64)			declare i8* @malloc(i64)
	declare void @print(i32)			declare void @print(i32)

	▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-split-alloc.ll

	; Tests that coro-split passes initialized values to coroutine frame allocator.			; Tests that coro-split passes initialized values to coroutine frame allocator.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define i8* @f(i32 %argument) "coroutine.presplit"="1" {			define i8* @f(i32 %argument) "coroutine.presplit"="1" {
	entry:			entry:
	%argument.addr = alloca i32, align 4			%argument.addr = alloca i32, align 4
	%incremented = add i32 %argument, 1			%incremented = add i32 %argument, 1
	store i32 %incremented, i32* %argument.addr, align 4			store i32 %incremented, i32* %argument.addr, align 4
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%need.alloc = call i1 @llvm.coro.alloc(token %id)			%need.alloc = call i1 @llvm.coro.alloc(token %id)
	▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-split-dbg.ll

	; Make sure that coro-split correctly deals with debug information.			; Make sure that coro-split correctly deals with debug information.
	; The test here is simply that it does not result in bad IR that will crash opt.			; The test here is simply that it does not result in bad IR that will crash opt.
	; RUN: opt < %s -coro-split -disable-output			; RUN: opt < %s -coro-split -disable-output
				; RUN: opt < %s -passes=coro-split -disable-output
	source_filename = "coro.c"			source_filename = "coro.c"
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; Function Attrs: nounwind readnone			; Function Attrs: nounwind readnone
	declare void @llvm.dbg.declare(metadata, metadata, metadata) #1			declare void @llvm.dbg.declare(metadata, metadata, metadata) #1

	declare void @bar(...) local_unnamed_addr #2			declare void @bar(...) local_unnamed_addr #2
	▲ Show 20 Lines • Show All 108 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-split-eh-00.ll

This file was copied from llvm/test/Transforms/Coroutines/coro-split-eh.ll.

	; Tests that coro-split removes cleanup code after coro.end in resume functions			; Tests that coro-split removes cleanup code after coro.end in resume functions
	; and retains it in the start function.			; and retains it in the start function.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define i8* @f(i1 %val) "coroutine.presplit"="1" personality i32 3 {			define i8* @f(i1 %val) "coroutine.presplit"="1" personality i32 3 {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* null)			%hdl = call i8* @llvm.coro.begin(token %id, i8* null)
	call void @print(i32 0)			call void @print(i32 0)
	br i1 %val, label %resume, label %susp			br i1 %val, label %resume, label %susp

	Show All 36 Lines

	; CHECK: lpad:			; CHECK: lpad:
	; CHECK-NEXT: %lpval = landingpad { i8*, i32 }			; CHECK-NEXT: %lpval = landingpad { i8*, i32 }
	; CHECK-NEXT: cleanup			; CHECK-NEXT: cleanup
	; CHECK-NEXT: call void @print(i32 2)			; CHECK-NEXT: call void @print(i32 2)
	; CHECK-NEXT: call void @print(i32 3)			; CHECK-NEXT: call void @print(i32 3)
	; CHECK-NEXT: resume { i8*, i32 } %lpval			; CHECK-NEXT: resume { i8*, i32 } %lpval

	define i8* @f2(i1 %val) "coroutine.presplit"="1" personality i32 4 {
	entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* null)
	call void @print(i32 0)
	br i1 %val, label %resume, label %susp

	susp:
	%0 = call i8 @llvm.coro.suspend(token none, i1 false)
	switch i8 %0, label %suspend [i8 0, label %resume
	i8 1, label %suspend]
	resume:
	invoke void @print(i32 1) to label %suspend unwind label %lpad

	suspend:
	call i1 @llvm.coro.end(i8* %hdl, i1 0)
	call void @print(i32 0) ; should not be present in f.resume
	ret i8* %hdl

	lpad:
	%tok = cleanuppad within none []
	call void @print(i32 2)
	%unused = call i1 @llvm.coro.end(i8* null, i1 true) [ "funclet"(token %tok) ]
	cleanupret from %tok unwind label %cleanup.cont

	cleanup.cont:
	%tok2 = cleanuppad within none []
	call void @print(i32 3) ; should not be present in f.resume
	cleanupret from %tok2 unwind to caller
	}

	; Verify that start function contains both print calls the one before and after coro.end
	; CHECK-LABEL: define i8* @f2(
	; CHECK: invoke void @print(i32 1)
	; CHECK: to label %AfterCoroEnd unwind label %lpad

	; CHECK: AfterCoroEnd:
	; CHECK: call void @print(i32 0)
	; CHECK: ret i8* %hdl

	; CHECK: lpad:
	; CHECK-NEXT: %tok = cleanuppad within none []
	; CHECK-NEXT: call void @print(i32 2)
	; CHECK-NEXT: call void @print(i32 3)
	; CHECK-NEXT: cleanupret from %tok unwind to caller

	; VERIFY Resume Parts			; VERIFY Resume Parts

	; Verify that resume function does not contains both print calls appearing after coro.end			; Verify that resume function does not contains both print calls appearing after coro.end
	; CHECK-LABEL: define internal fastcc void @f.resume			; CHECK-LABEL: define internal fastcc void @f.resume
	; CHECK: invoke void @print(i32 1)			; CHECK: invoke void @print(i32 1)
	; CHECK: to label %CoroEnd unwind label %lpad			; CHECK: to label %CoroEnd unwind label %lpad

	; CHECK: CoroEnd:			; CHECK: CoroEnd:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void

	; CHECK: lpad:			; CHECK: lpad:
	; CHECK-NEXT: %lpval = landingpad { i8*, i32 }			; CHECK-NEXT: %lpval = landingpad { i8*, i32 }
	; CHECK-NEXT: cleanup			; CHECK-NEXT: cleanup
	; CHECK-NEXT: call void @print(i32 2)			; CHECK-NEXT: call void @print(i32 2)
	; CHECK-NEXT: resume { i8*, i32 } %lpval			; CHECK-NEXT: resume { i8*, i32 } %lpval

	; Verify that resume function does not contains both print calls appearing after coro.end
	; CHECK-LABEL: define internal fastcc void @f2.resume
	; CHECK: invoke void @print(i32 1)
	; CHECK: to label %CoroEnd unwind label %lpad

	; CHECK: CoroEnd:
	; CHECK-NEXT: ret void

	; CHECK: lpad:
	; CHECK-NEXT: %tok = cleanuppad within none []
	; CHECK-NEXT: call void @print(i32 2)
	; CHECK-NEXT: cleanupret from %tok unwind to caller

	declare i8* @llvm.coro.free(token, i8*)			declare i8* @llvm.coro.free(token, i8*)
	declare i32 @llvm.coro.size.i32()			declare i32 @llvm.coro.size.i32()
	declare i8 @llvm.coro.suspend(token, i1)			declare i8 @llvm.coro.suspend(token, i1)
	declare void @llvm.coro.resume(i8*)			declare void @llvm.coro.resume(i8*)
	declare void @llvm.coro.destroy(i8*)			declare void @llvm.coro.destroy(i8*)

	declare token @llvm.coro.id(i32, i8, i8, i8*)			declare token @llvm.coro.id(i32, i8, i8, i8*)
	declare i8* @llvm.coro.alloc(token)			declare i8* @llvm.coro.alloc(token)
	declare i8* @llvm.coro.begin(token, i8*)			declare i8* @llvm.coro.begin(token, i8*)
	declare i1 @llvm.coro.end(i8*, i1)			declare i1 @llvm.coro.end(i8*, i1)

	declare noalias i8* @malloc(i32)			declare noalias i8* @malloc(i32)
	declare void @print(i32)			declare void @print(i32)
	declare void @free(i8*)			declare void @free(i8*)

llvm/test/Transforms/Coroutines/coro-split-eh-01.ll

This file was moved from llvm/test/Transforms/Coroutines/coro-split-eh.ll.

	; Tests that coro-split removes cleanup code after coro.end in resume functions			; Tests that coro-split removes cleanup code after coro.end in resume functions
	; and retains it in the start function.			; and retains it in the start function.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s
	define i8* @f(i1 %val) "coroutine.presplit"="1" personality i32 3 {
	entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* null)
	call void @print(i32 0)
	br i1 %val, label %resume, label %susp

	susp:
	%0 = call i8 @llvm.coro.suspend(token none, i1 false)
	switch i8 %0, label %suspend [i8 0, label %resume
	i8 1, label %suspend]
	resume:
	invoke void @print(i32 1) to label %suspend unwind label %lpad

	suspend:
	call i1 @llvm.coro.end(i8* %hdl, i1 0)
	call void @print(i32 0) ; should not be present in f.resume
	ret i8* %hdl

	lpad:
	%lpval = landingpad { i8*, i32 }
	cleanup

	call void @print(i32 2)
	%need.resume = call i1 @llvm.coro.end(i8* null, i1 true)
	br i1 %need.resume, label %eh.resume, label %cleanup.cont

	cleanup.cont:
	call void @print(i32 3) ; should not be present in f.resume
	br label %eh.resume

	eh.resume:
	resume { i8*, i32 } %lpval
	}

	; Verify that start function contains both print calls the one before and after coro.end
	; CHECK-LABEL: define i8* @f(
	; CHECK: invoke void @print(i32 1)
	; CHECK: to label %AfterCoroEnd unwind label %lpad

	; CHECK: AfterCoroEnd:
	; CHECK: call void @print(i32 0)
	; CHECK: ret i8* %hdl

	; CHECK: lpad:
	; CHECK-NEXT: %lpval = landingpad { i8*, i32 }
	; CHECK-NEXT: cleanup
	; CHECK-NEXT: call void @print(i32 2)
	; CHECK-NEXT: call void @print(i32 3)
	; CHECK-NEXT: resume { i8*, i32 } %lpval

	define i8* @f2(i1 %val) "coroutine.presplit"="1" personality i32 4 {			define i8* @f2(i1 %val) "coroutine.presplit"="1" personality i32 4 {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%hdl = call i8* @llvm.coro.begin(token %id, i8* null)			%hdl = call i8* @llvm.coro.begin(token %id, i8* null)
	call void @print(i32 0)			call void @print(i32 0)
	br i1 %val, label %resume, label %susp			br i1 %val, label %resume, label %susp

	susp:			susp:
	%0 = call i8 @llvm.coro.suspend(token none, i1 false)			%0 = call i8 @llvm.coro.suspend(token none, i1 false)
	switch i8 %0, label %suspend [i8 0, label %resume			switch i8 %0, label %suspend [i8 0, label %resume
	i8 1, label %suspend]			i8 1, label %suspend]
	resume:			resume:
	invoke void @print(i32 1) to label %suspend unwind label %lpad			invoke void @print(i32 1) to label %suspend unwind label %lpad

	suspend:			suspend:
	call i1 @llvm.coro.end(i8* %hdl, i1 0)			call i1 @llvm.coro.end(i8* %hdl, i1 0)
	call void @print(i32 0) ; should not be present in f.resume			call void @print(i32 0) ; should not be present in f.resume
	ret i8* %hdl			ret i8* %hdl

	lpad:			lpad:
	%tok = cleanuppad within none []			%tok = cleanuppad within none []
	call void @print(i32 2)			call void @print(i32 2)
	%unused = call i1 @llvm.coro.end(i8* null, i1 true) [ "funclet"(token %tok) ]			%unused = call i1 @llvm.coro.end(i8* null, i1 true) [ "funclet"(token %tok) ]
	cleanupret from %tok unwind label %cleanup.cont			cleanupret from %tok unwind label %cleanup.cont

	cleanup.cont:			cleanup.cont:
	%tok2 = cleanuppad within none []			%tok2 = cleanuppad within none []
	call void @print(i32 3) ; should not be present in f.resume			call void @print(i32 3) ; should not be present in f.resume
	cleanupret from %tok2 unwind to caller			cleanupret from %tok2 unwind to caller
	}			}

	; Verify that start function contains both print calls the one before and after coro.end			; Verify that start function contains both print calls the one before and after coro.end
	; CHECK-LABEL: define i8* @f2(			; CHECK-LABEL: define i8* @f2(
	; CHECK: invoke void @print(i32 1)			; CHECK: invoke void @print(i32 1)
	; CHECK: to label %AfterCoroEnd unwind label %lpad			; CHECK: to label %AfterCoroEnd unwind label %lpad

	; CHECK: AfterCoroEnd:			; CHECK: AfterCoroEnd:
	; CHECK: call void @print(i32 0)			; CHECK: call void @print(i32 0)
	; CHECK: ret i8* %hdl			; CHECK: ret i8* %hdl

	; CHECK: lpad:			; CHECK: lpad:
	; CHECK-NEXT: %tok = cleanuppad within none []			; CHECK-NEXT: %tok = cleanuppad within none []
	; CHECK-NEXT: call void @print(i32 2)			; CHECK-NEXT: call void @print(i32 2)
	; CHECK-NEXT: call void @print(i32 3)			; CHECK-NEXT: call void @print(i32 3)
	; CHECK-NEXT: cleanupret from %tok unwind to caller			; CHECK-NEXT: cleanupret from %tok unwind to caller

	; VERIFY Resume Parts			; VERIFY Resume Parts

	; Verify that resume function does not contains both print calls appearing after coro.end			; Verify that resume function does not contains both print calls appearing after coro.end
	; CHECK-LABEL: define internal fastcc void @f.resume
	; CHECK: invoke void @print(i32 1)
	; CHECK: to label %CoroEnd unwind label %lpad

	; CHECK: CoroEnd:
	; CHECK-NEXT: ret void

	; CHECK: lpad:
	; CHECK-NEXT: %lpval = landingpad { i8*, i32 }
	; CHECK-NEXT: cleanup
	; CHECK-NEXT: call void @print(i32 2)
	; CHECK-NEXT: resume { i8*, i32 } %lpval

	; Verify that resume function does not contains both print calls appearing after coro.end
	; CHECK-LABEL: define internal fastcc void @f2.resume			; CHECK-LABEL: define internal fastcc void @f2.resume
	; CHECK: invoke void @print(i32 1)			; CHECK: invoke void @print(i32 1)
	; CHECK: to label %CoroEnd unwind label %lpad			; CHECK: to label %CoroEnd unwind label %lpad

	; CHECK: CoroEnd:			; CHECK: CoroEnd:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void

	; CHECK: lpad:			; CHECK: lpad:
	; CHECK-NEXT: %tok = cleanuppad within none []			; CHECK-NEXT: %tok = cleanuppad within none []
	; CHECK-NEXT: call void @print(i32 2)			; CHECK-NEXT: call void @print(i32 2)
	; CHECK-NEXT: cleanupret from %tok unwind to caller			; CHECK-NEXT: cleanupret from %tok unwind to caller

	declare i8* @llvm.coro.free(token, i8*)			declare i8* @llvm.coro.free(token, i8*)
	declare i32 @llvm.coro.size.i32()			declare i32 @llvm.coro.size.i32()
	declare i8 @llvm.coro.suspend(token, i1)			declare i8 @llvm.coro.suspend(token, i1)
	declare void @llvm.coro.resume(i8*)			declare void @llvm.coro.resume(i8*)
	declare void @llvm.coro.destroy(i8*)			declare void @llvm.coro.destroy(i8*)

	declare token @llvm.coro.id(i32, i8, i8, i8*)			declare token @llvm.coro.id(i32, i8, i8, i8*)
	declare i8* @llvm.coro.alloc(token)			declare i8* @llvm.coro.alloc(token)
	declare i8* @llvm.coro.begin(token, i8*)			declare i8* @llvm.coro.begin(token, i8*)
	declare i1 @llvm.coro.end(i8*, i1)			declare i1 @llvm.coro.end(i8*, i1)

	declare noalias i8* @malloc(i32)			declare noalias i8* @malloc(i32)
	declare void @print(i32)			declare void @print(i32)
	declare void @free(i8*)			declare void @free(i8*)

llvm/test/Transforms/Coroutines/coro-split-eh.ll

This file was deleted after being copied to llvm/test/Transforms/Coroutines/coro-split-eh-00.ll, llvm/test/Transforms/Coroutines/coro-split-eh-01.ll.

The contents of this file were not changed.

llvm/test/Transforms/Coroutines/coro-split-hidden.ll

	; Tests that coro-split can convert functions with hidden visibility.			; Tests that coro-split can convert functions with hidden visibility.
	; These may be generated by a frontend such as Clang, when inlining with			; These may be generated by a frontend such as Clang, when inlining with
	; '-fvisibility-inlines-hidden'.			; '-fvisibility-inlines-hidden'.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define hidden i8* @f() "coroutine.presplit"="1" {			define hidden i8* @f() "coroutine.presplit"="1" {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%need.alloc = call i1 @llvm.coro.alloc(token %id)			%need.alloc = call i1 @llvm.coro.alloc(token %id)
	br i1 %need.alloc, label %dyn.alloc, label %begin			br i1 %need.alloc, label %dyn.alloc, label %begin

	dyn.alloc:			dyn.alloc:
	▲ Show 20 Lines • Show All 69 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-split-musttail.ll

	; Tests that coro-split will convert coro.resume followed by a suspend to a			; Tests that coro-split will convert coro.resume followed by a suspend to a
	; musttail call.			; musttail call.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define void @f() #0 {			define void @f() #0 {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%alloc = call i8* @malloc(i64 16) #3			%alloc = call i8* @malloc(i64 16) #3
	%vFrame = call noalias nonnull i8* @llvm.coro.begin(token %id, i8* %alloc)			%vFrame = call noalias nonnull i8* @llvm.coro.begin(token %id, i8* %alloc)

	%save = call token @llvm.coro.save(i8* null)			%save = call token @llvm.coro.save(i8* null)
	▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-split-musttail1.ll

	; Tests that coro-split will convert coro.resume followed by a suspend to a			; Tests that coro-split will convert coro.resume followed by a suspend to a
	; musttail call.			; musttail call.
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	define void @f() #0 {			define void @f() #0 {
	entry:			entry:
	%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 0, i8* null, i8* null, i8* null)
	%alloc = call i8* @malloc(i64 16) #3			%alloc = call i8* @malloc(i64 16) #3
	%vFrame = call noalias nonnull i8* @llvm.coro.begin(token %id, i8* %alloc)			%vFrame = call noalias nonnull i8* @llvm.coro.begin(token %id, i8* %alloc)

	%save = call token @llvm.coro.save(i8* null)			%save = call token @llvm.coro.save(i8* null)
	▲ Show 20 Lines • Show All 93 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/no-suspend.ll

	; Test no suspend coroutines			; Test no suspend coroutines
	; RUN: opt < %s -coro-split -S \| FileCheck %s			; RUN: opt < %s -coro-split -S \| FileCheck %s
				; RUN: opt < %s -passes=coro-split -S \| FileCheck %s

	; Coroutine with no-suspends will turn into:			; Coroutine with no-suspends will turn into:
	;			;
	; CHECK-LABEL: define void @no_suspends(			; CHECK-LABEL: define void @no_suspends(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: alloca			; CHECK-NEXT: alloca
	; CHECK-NEXT: bitcast			; CHECK-NEXT: bitcast
	; CHECK-NEXT: call void @print(i32 %n)			; CHECK-NEXT: call void @print(i32 %n)
	▲ Show 20 Lines • Show All 370 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/restart-trigger.ll

	; Verifies that restart trigger forces IPO pipelines restart and the same			; Verifies that the restart trigger that is used by legacy coroutine passes
	; coroutine is looked at by CoroSplit pass twice.			; forces the legacy pass manager to restart IPO pipelines, thereby causing the
				; same coroutine to be looked at by CoroSplit pass twice.
	; REQUIRES: asserts			; REQUIRES: asserts
	; RUN: opt < %s -S -O0 -enable-coroutines -debug-only=coro-split 2>&1 \| FileCheck %s			; RUN: opt < %s -S -O0 -enable-coroutines -debug-only=coro-split 2>&1 \| FileCheck %s
	; RUN: opt < %s -S -O1 -enable-coroutines -debug-only=coro-split 2>&1 \| FileCheck %s			; RUN: opt < %s -S -O1 -enable-coroutines -debug-only=coro-split 2>&1 \| FileCheck %s

	; CHECK: CoroSplit: Processing coroutine 'f' state: 0			; CHECK: CoroSplit: Processing coroutine 'f' state: 0
	; CHECK-NEXT: CoroSplit: Processing coroutine 'f' state: 1			; CHECK-NEXT: CoroSplit: Processing coroutine 'f' state: 1

	define void @f() {			define void @f() {
	Show All 33 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Coroutines][2/6] New pass manager: coro-splitClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 245077

llvm/include/llvm/Transforms/Coroutines/CoroSplit.h

llvm/lib/Passes/PassBuilder.cpp

llvm/lib/Passes/PassRegistry.def

llvm/lib/Transforms/Coroutines/CoroSplit.cpp

llvm/test/Transforms/Coroutines/coro-alloc-with-param-O0.ll

llvm/test/Transforms/Coroutines/coro-alloc-with-param-O2.ll

llvm/test/Transforms/Coroutines/coro-alloc-with-param.ll

llvm/test/Transforms/Coroutines/coro-catchswitch.ll

llvm/test/Transforms/Coroutines/coro-debug.ll

llvm/test/Transforms/Coroutines/coro-eh-aware-edge-split-00.ll

llvm/test/Transforms/Coroutines/coro-eh-aware-edge-split-01.ll

llvm/test/Transforms/Coroutines/coro-eh-aware-edge-split-02.ll

llvm/test/Transforms/Coroutines/coro-eh-aware-edge-split.ll

llvm/test/Transforms/Coroutines/coro-frame-arrayalloca.ll

llvm/test/Transforms/Coroutines/coro-frame-unreachable.ll

llvm/test/Transforms/Coroutines/coro-frame.ll

llvm/test/Transforms/Coroutines/coro-materialize.ll

llvm/test/Transforms/Coroutines/coro-padding.ll

llvm/test/Transforms/Coroutines/coro-param-copy.ll

llvm/test/Transforms/Coroutines/coro-spill-after-phi.ll

llvm/test/Transforms/Coroutines/coro-spill-corobegin.ll

llvm/test/Transforms/Coroutines/coro-split-00.ll

llvm/test/Transforms/Coroutines/coro-split-02.ll

llvm/test/Transforms/Coroutines/coro-split-alloc.ll

llvm/test/Transforms/Coroutines/coro-split-dbg.ll

llvm/test/Transforms/Coroutines/coro-split-eh-00.ll

llvm/test/Transforms/Coroutines/coro-split-eh-01.ll

llvm/test/Transforms/Coroutines/coro-split-eh.ll

llvm/test/Transforms/Coroutines/coro-split-hidden.ll

llvm/test/Transforms/Coroutines/coro-split-musttail.ll

llvm/test/Transforms/Coroutines/coro-split-musttail1.ll

llvm/test/Transforms/Coroutines/no-suspend.ll

llvm/test/Transforms/Coroutines/restart-trigger.ll

[Coroutines][2/6] New pass manager: coro-split
ClosedPublic