This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Coroutines/
-
Transforms/
-
Coroutines/
1/2
CoroFrame.cpp
-
test/Transforms/Coroutines/
-
Transforms/
-
Coroutines/
-
coro-debug-frame-variable.ll

Differential D90772

[Coroutines] Add missing llvm.dbg.declare's to cover more allocas
ClosedPublic

Authored by bruno on Nov 4 2020, 10:00 AM.

Download Raw Diff

Details

Reviewers

jmorse
lxfind
aprantl
GorNishanov
vsk

Commits

rGdc14542a71f8: [Coroutines] Add missing llvm.dbg.declare's to cover for more allocas

Summary

Tracking local variables across suspend points is still somewhat incomplete. Consider this coroutine:

// Complete code here: https://gist.github.com/bcardosolopes/a992950bdfc66ab4ce6dc0c75920f4ef
resumable foo() {
  int x[10] = {};
  int a = 3;
  co_await std::experimental::suspend_always();
  a++;
  x[0] = 1;
  a += 2;
  x[1] = 2;
  a += 3;
  x[2] = 3;
}

Can't manage to print a or x if they turn out to be allocas during CoroSplit (which happens if you build this code with -O0 against ToT):

* thread #1, queue = 'com.apple.main-thread', stop reason = step over
    frame #0: 0x0000000100003729 main-noprint`foo() at main-noprint.cpp:43:5
   40     co_await std::experimental::suspend_always();
   41     a++;
   42     x[0] = 1;
-> 43     a += 2;
   44     x[1] = 2;
   45     a += 3;
   46     x[2] = 3;
(lldb) p x
error: <user expression 21>:1:1: use of undeclared identifier 'x'
x
^

The generated IR contains a llvm.dbg.declare for x in it's initialization basic block. However, even though this BB dominates all other BBs where x is manipulated, that doesn't seem to be enough debug info for the debugger to be happy. By adding extra llvm.dbg.declares in these BBs, lldb prints x successfully. Is this how llvm.dbg.declares supposed to work or am I missing something? Given the perceived behavior, this patch improves CoroSplit by placing extra llvm.dbg.declares in all basic blocks that need some "refresh" for the frame location to be found, so this:

await.ready:
  ...
  %arrayidx = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 0, !dbg !760
  ...
  %arrayidx19 = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 1, !dbg !763
  ...
  %arrayidx21 = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 2, !dbg !766

becomes:

await.ready:
  ...
  call void @llvm.dbg.declare(metadata [10 x i32]* %x.reload.addr, metadata !751, metadata !DIExpression()), !dbg !753
  ...
  %arrayidx = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 0, !dbg !760
  ...
  %arrayidx19 = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 1, !dbg !763
  ...
  %arrayidx21 = getelementptr inbounds [10 x i32], [10 x i32]* %x.reload.addr, i64 0, i64 2, !dbg !766

For additional context, this builds up on top of changes from D75338 back in Feb. I also plan to add a LLDB end-to-end test for coroutines in a followup patch once this is fixed.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

bruno created this revision.Nov 4 2020, 10:00 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 4 2020, 10:00 AM

Herald added subscribers: llvm-commits, modimo, wenlei and 3 others. · View Herald Transcript

bruno requested review of this revision.Nov 4 2020, 10:00 AM

Thank you for working on this!

llvm/lib/Transforms/Coroutines/CoroFrame.cpp
1226	Do we need to insert to every BB that uses it though? Though this may be the safest way to guarantee there is at least one, so I don't object doing this. Also want to point out this: https://llvm.org/docs/SourceLevelDebugging.html#llvm-dbg-declare it says there can only be one dbg.declare. However in practice I think as long as they all look the same it should be fine.

davide removed a subscriber: davide.Nov 4 2020, 11:38 AM

bruno added inline comments.Nov 4 2020, 12:00 PM

llvm/lib/Transforms/Coroutines/CoroFrame.cpp
1226	Do we need to insert to every BB that uses it though? Though this may be the safest way to guarantee there is at least one, so I don't object doing this. That's a good question, I initially thought a `dbg.declare` dominating the BB in question would be enough, perhaps the debugging experts could help clarify what's going on. Also want to point out this: https://llvm.org/docs/SourceLevelDebugging.html#llvm-dbg-declare it says there can only be one dbg.declare. However in practice I think as long as they all look the same it should be fine. Yep, I also saw this but didn't get any verifier complains (perhaps because they look the same), guess we have another question for the experts :)

bruno wrote:

That's a good question, I initially thought a dbg.declare dominating the BB in question would be enough, perhaps the debugging experts could help clarify what's going on.

Hrrrmmmm. Just to confirm my understanding, in tests like the one modified in this patch, all those allocas (%i, %j, %x) are transformed into GEPs from FramePtr, which is malloc'd / new'd or otherwise not on the stack?

Normally dbg.declare is supposed to refer to a static alloca -- that way we can easily identify stack variables and not bother tracking them through optimisations. Sometimes this isn't true though (certain ABIs pass in memory for return values apparently), in which case the dbg.declare gets silently transformed into something like dbg.value(%0, [...], DW_OP_deref), which is what seems to happen here.

That should be fine: however as you've already spotted, the debug intrinsic needs to dominate the instructions that receive the variable location, and that doesn't seem to be the case in the given example. Here's a gist with the input test.cpp at the bottom, and IR for foo.resume on IR line 693 onwards. I've compiled this with a ~6 week old clang master and -emit-llvm -S -g -O0 -fcoroutines-ts -stdlib=libc++. I'm not familiar with the coroutine implementation details and assumed:

foo.resume is the function we care about as it's the first function gdb stops in when I set breakpoints in the "foo" function,
The await.ready block is the main part of the foo.ready function, and everything else is coroutine plumbing.

From the entry block, there's a path through the blocks thus:

resume.entry
resume.1
resume.1.landing
AfterCoroSuspend50
await.ready

Which reaches the await.ready block without going through any debug intrinsics. The variable locations won't be propagated into the await.ready block because of this path, as there's no information about variable locations on it. This matches what dwarfdump reports: there's a variable location for _some_ of the function, but not all of it, and not the part that the developer wants to step through.

I'm not familiar with the coroutine implementation details, glad to be corrected. (CC @Orlando @chrisjackson , this is an example of where redesigned intrinsics would definitely need to refer to variables in an arbitrary memory location, ouch).

Hi Jeremy, thanks for taking a deep look here.

Hrrrmmmm. Just to confirm my understanding, in tests like the one modified in this patch, all those allocas (%i, %j, %x) are transformed into GEPs from FramePtr, which is malloc'd / new'd or otherwise not on the stack?

That's right.

Normally dbg.declare is supposed to refer to a static alloca -- that way we can easily identify stack variables and not bother tracking them through optimisations. Sometimes this isn't true though (certain ABIs pass in memory for return values apparently), in which case the dbg.declare gets silently transformed into something like dbg.value(%0, [...], DW_OP_deref), which is what seems to happen here.

Can you clarify what you mean by "silently transformed into something like dbg.value..."? In what stage do you see that happening? Those don't show up for me unless the local variables are already promoted to regs.

That should be fine: however as you've already spotted, the debug intrinsic needs to dominate the instructions that receive the variable location, and that doesn't seem to be the case in the given example. Here's a gist with the input test.cpp at the bottom, and IR for foo.resume on IR line 693 onwards. I've compiled this with a ~6 week old clang master and -emit-llvm -S -g -O0 -fcoroutines-ts -stdlib=libc++. I'm not familiar with the coroutine implementation details and assumed:

foo.resume is the function we care about as it's the first function gdb stops in when I set breakpoints in the "foo" function,

The await.ready block is the main part of the foo.ready function, and everything else is coroutine plumbing.

Yep.

From the entry block, there's a path through the blocks thus:
resume.entry
resume.1
resume.1.landing
AfterCoroSuspend50
await.ready
Which reaches the await.ready block without going through any debug intrinsics. The variable locations won't be propagated into the await.ready block because of this path, as there's no information about variable locations on it. This matches what dwarfdump reports: there's a variable location for _some_ of the function, but not all of it, and not the part that the developer wants to step through.

If you look at the final IR, this is true and it actually confirms why it wasn't working before, thanks! However, at the point where insertSpills is called and the logic on this patch runs, there's no resume function just yet, since it's operating on the original foo, and in that context init.ready (which already has a dbg.declares) still dominates await.ready.

Given that, another approach would be to redo this logic on top of the already formed resume function and only re-insert the dbg.declares in BBs that are not dominated. One possible drawback of this approach is that it would require applying the same logic to init, resume and destroy and going for another CFG walk for each alloca spill. Doesn't sound super expensive but the convenience on doing it in the original function instead sounds compelling. What's the impact of having reductant dbg.declares in already dominated paths? Is there a pass that cleans those up? Thoughts?

I'm not familiar with the coroutine implementation details, glad to be corrected. (CC @Orlando @chrisjackson , this is an example of where redesigned intrinsics would definitely need to refer to variables in an arbitrary memory location, ouch).

junparser added a subscriber: junparser.Nov 6 2020, 12:01 AM

This patch works for llvm.dbg.addr. So what is status of llvm.dbg.addr? does anyone know about it ?

This patch works for llvm.dbg.addr. So what is status of llvm.dbg.addr? does anyone know about it ?

Note sure I follow your comment, did you mean "works for llvm.dbg.declare, what about llvm.dbg.addr"? My quick code search here seems like the only way to get llvm.dbg.addrs are via the llvm option -use-dbg-addr. This option doesn't seem to be used by clang (maybe some other frontends do) and using it means changing getDeclareIntrin() to give you .addrs instead of .declare, so I assume this change should work for both?

Hi Bruno,

Can you clarify what you mean by "silently transformed into something like dbg.value..."? In what stage do you see that happening? Those don't show up for me unless the local variables are already promoted to regs.

During instruction selection, so quite late: if you run "llc -stop-before=finalize-isel" on some IR that contains a dbg.declare, if it refers to a stack slot then the variable location is attached to the stack slot:

stack:
  - { id: 0, name: __x.addr, type: default, offset: 0, size: 8, alignment : 8,
      stack-id: default, callee-saved-register: '', callee-saved-restored : true,
      debug-info-variable: '!692', debug-info-expression: '!DIExpression()',
      debug-info-location: '!693' }

However if the dbg.declare can't be tracked back to a stack slot, it becomes a "DBG_VALUE" machine instruction (equivalent to a dbg.value intrinsic):

DBG_VALUE %26, 0, !704, !DIExpression(), debug-location !706

The former doesn't need to worry about control flow because the variable is always homed in a stack slot; the latter does need to worry about control flow. I imagine that everything to do with coroutines will take the latter path.

What's the impact of having reductant dbg.declares in already dominated paths? Is there a pass that cleans those up? Thoughts?

Zero functional change, and a tiny performance cost from having one extra metadata instruction hanging around. If you go down this route, I'd recommend using the dbg.value intrinsic with a DIExpression with a single DW_OP_deref -- this is effectively what dbg.declare becomes as shown above, and avoids any unexpected surprises if it turns out some code somewhere really does expect only one dbg.declare to exist.

junparser wrote:

This patch works for llvm.dbg.addr. So what is status of llvm.dbg.addr? does anyone know about it ?

In theory this kind of behaviour is exactly what dbg.addr is for (variable lives in memory, maybe changes due to control flow). However moving everything to use dbg.addr stalled a long time ago, I believe @Orlando found that it's often unexpectedly dropped by optimisations, any opinion on whether it's usable @Orlando? We're hoping to use it for things someday soon, but we're definitely not there yet.

Hi Jeremy,

However if the dbg.declare can't be tracked back to a stack slot, it becomes a "DBG_VALUE" machine instruction (equivalent to a dbg.value intrinsic):
DBG_VALUE %26, 0, !704, !DIExpression(), debug-location !706
The former doesn't need to worry about control flow because the variable is always homed in a stack slot; the latter does need to worry about control flow. I imagine that everything to do with coroutines will take the latter path.

I see now, thanks for sharing!

What's the impact of having reductant dbg.declares in already dominated paths? Is there a pass that cleans those up? Thoughts?

Zero functional change, and a tiny performance cost from having one extra metadata instruction hanging around. If you go down this route, I'd recommend using the dbg.value intrinsic with a DIExpression with a single DW_OP_deref -- this is effectively what dbg.declare becomes as shown above, and avoids any unexpected surprises if it turns out some code somewhere really does expect only one dbg.declare to exist.

Gotcha, I like your recommendation, let's prevent unexpected surprises. I've tested the approach and works the same in the final debugging experience. Will update the patch.

Update comments and now use dbg.values after @jmorse feedback.

Could you add a test (or update existing tests) to demonstrate that the issue is fixed?

Jeremy said:

In theory this kind of behaviour is exactly what dbg.addr is for (variable lives in memory, maybe changes due to control flow). However moving everything to use dbg.addr stalled a long time ago, I believe @Orlando found that it's often unexpectedly dropped by optimisations, any opinion on whether it's usable @Orlando? We're hoping to use it for things someday soon, but we're definitely not there yet.

Yes that's right. It wasn't a particularly deep dive but from recent experience I would advise avoiding dbg.addr for now.

Could you add a test (or update existing tests) to demonstrate that the issue is fixed?

The testcase was updated as part of the last diff update, anything specific you are looking for?

In D90772#2383597, @bruno wrote:

Could you add a test (or update existing tests) to demonstrate that the issue is fixed?

The testcase was updated as part of the last diff update, anything specific you are looking for?

Sorry never mind. I wasn't looking at it correctly.

LGTM. Thanks!

This revision is now accepted and ready to land.Nov 9 2020, 5:25 PM

Hi Bruno,
One of the another issue is that it is also necessary to track coroutine function parameters correctly under O1/O2 level which use dbg.value. Thoughts?

Hi @junparser,

One of the another issue is that it is also necessary to track coroutine function parameters correctly under O1/O2 level which use dbg.value. Thoughts?

I haven't looked up the coroutine function parameter debugging quality yet, does it even work fine for -O0? I'm trying to go over the issues I find for -O0 first and incrementally improve on top of that. Any specific testcase in mind?

In D90772#2384353, @bruno wrote:

Hi @junparser,

One of the another issue is that it is also necessary to track coroutine function parameters correctly under O1/O2 level which use dbg.value. Thoughts?

I haven't looked up the coroutine function parameter debugging quality yet, does it even work fine for -O0? I'm trying to go over the issues I find for -O0 first and incrementally improve on top of that. Any specific testcase in mind?

I'm not sure about it, but it should not work fine for O1 above. I do not have testcase right now, you can just change your testcase to add some parameters and use them.

Closed by commit rGdc14542a71f8: [Coroutines] Add missing llvm.dbg.declare's to cover for more allocas (authored by bruno). · Explain WhyNov 10 2020, 12:36 PM

This revision was automatically updated to reflect the committed changes.

bruno added a commit: rGdc14542a71f8: [Coroutines] Add missing llvm.dbg.declare's to cover for more allocas.

junparser mentioned this in D91305: [Coroutine] Allocas used by StoreInst does not always escape.Nov 13 2020, 8:54 PM

junparser mentioned this in D92462: [Coroutines] Add DW_OP_deref for transformed dbg.value intrinsic..Dec 1 2020, 10:56 PM

dongAxis1944 added a subscriber: dongAxis1944.Dec 2 2020, 3:04 AM

Herald added a subscriber: hoy. · View Herald TranscriptDec 2 2020, 3:04 AM

bruno mentioned this in D93497: Salvage debug info for function arguments in coro-split funclets..Jan 25 2021, 11:28 PM

aprantl added a reverting change: rG0554541b4454: Salvage debug info for function arguments in coro-split funclets..Jan 26 2021, 3:01 PM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Coroutines/

CoroFrame.cpp

46 lines

test/

Transforms/

Coroutines/

coro-debug-frame-variable.ll

34 lines

Diff 304305

llvm/lib/Transforms/Coroutines/CoroFrame.cpp

Show First 20 Lines • Show All 1,185 Lines • ▼ Show 20 Lines	for (User *U : Alloca->users()) {
auto *I = cast<Instruction>(U);		auto *I = cast<Instruction>(U);
if (DT.dominates(CB, I))		if (DT.dominates(CB, I))
UsersToUpdate.push_back(I);		UsersToUpdate.push_back(I);
}		}
if (UsersToUpdate.empty())		if (UsersToUpdate.empty())
continue;		continue;
auto *G = GetFramePointer(Alloca);		auto *G = GetFramePointer(Alloca);
G->setName(Alloca->getName() + Twine(".reload.addr"));		G->setName(Alloca->getName() + Twine(".reload.addr"));

		SmallPtrSet<BasicBlock *, 4> SeenDbgBBs;
TinyPtrVector<DbgDeclareInst *> DIs = FindDbgDeclareUses(Alloca);		TinyPtrVector<DbgDeclareInst *> DIs = FindDbgDeclareUses(Alloca);
if (!DIs.empty())		DIBuilder DIB(Alloca->getModule(), /AllowUnresolved*/ false);
DIBuilder(*Alloca->getModule(),		Instruction *FirstDbgDecl = nullptr;
/AllowUnresolved/ false)
.insertDeclare(G, DIs.front()->getVariable(),		if (!DIs.empty()) {
		FirstDbgDecl = DIB.insertDeclare(G, DIs.front()->getVariable(),
DIs.front()->getExpression(),		DIs.front()->getExpression(),
DIs.front()->getDebugLoc(), DIs.front());		DIs.front()->getDebugLoc(), DIs.front());
		SeenDbgBBs.insert(DIs.front()->getParent());
		}
for (auto *DI : FindDbgDeclareUses(Alloca))		for (auto *DI : FindDbgDeclareUses(Alloca))
DI->eraseFromParent();		DI->eraseFromParent();
replaceDbgUsesWithUndef(Alloca);		replaceDbgUsesWithUndef(Alloca);

for (Instruction *I : UsersToUpdate)		for (Instruction *I : UsersToUpdate) {
I->replaceUsesOfWith(Alloca, G);		I->replaceUsesOfWith(Alloca, G);

		// After cloning, transformations might not guarantee that all uses
		// of this alloca are dominated by the already existing dbg.declare's,
		// compromising the debug quality. Instead of writing another
		// transformation to patch each clone, go ahead and early populate
		// basic blocks that use such allocas with more debug info.
		if (SeenDbgBBs.count(I->getParent()))
		continue;

		// If there isn't a prior dbg.declare for this alloca, it probably
		// means the state hasn't changed prior to one of the relevant suspend
		// point for this frame access.
		if (!FirstDbgDecl)
		continue;

		lxfindUnsubmitted Not Done Reply Inline Actions Do we need to insert to every BB that uses it though? Though this may be the safest way to guarantee there is at least one, so I don't object doing this. Also want to point out this: https://llvm.org/docs/SourceLevelDebugging.html#llvm-dbg-declare it says there can only be one dbg.declare. However in practice I think as long as they all look the same it should be fine. lxfind: Do we need to insert to every BB that uses it though? Though this may be the safest way to…
		brunoAuthorUnsubmitted Done Reply Inline Actions Do we need to insert to every BB that uses it though? Though this may be the safest way to guarantee there is at least one, so I don't object doing this. That's a good question, I initially thought a `dbg.declare` dominating the BB in question would be enough, perhaps the debugging experts could help clarify what's going on. Also want to point out this: https://llvm.org/docs/SourceLevelDebugging.html#llvm-dbg-declare it says there can only be one dbg.declare. However in practice I think as long as they all look the same it should be fine. Yep, I also saw this but didn't get any verifier complains (perhaps because they look the same), guess we have another question for the experts :) bruno: > Do we need to insert to every BB that uses it though? Though this may be the safest way to…
		// These instructions are all dominated by the alloca, insert the
		// dbg.value in the beginning of the BB to enhance debugging
		// experience and allow values to be inspected as early as possible.
		// Prefer dbg.value over dbg.declare since it better sets expectations
		// that control flow can be later changed by other passes.
		auto *DI = cast<DbgDeclareInst>(FirstDbgDecl);
		BasicBlock *CurrentBlock = I->getParent();
		DIB.insertDbgValueIntrinsic(G, DI->getVariable(), DI->getExpression(),
		DI->getDebugLoc(),
		&*CurrentBlock->getFirstInsertionPt());
		SeenDbgBBs.insert(CurrentBlock);
		}
}		}
Builder.SetInsertPoint(FramePtr->getNextNode());		Builder.SetInsertPoint(FramePtr->getNextNode());
for (const auto &A : FrameData.Allocas) {		for (const auto &A : FrameData.Allocas) {
AllocaInst *Alloca = A.Alloca;		AllocaInst *Alloca = A.Alloca;
if (A.MayWriteBeforeCoroBegin) {		if (A.MayWriteBeforeCoroBegin) {
// isEscaped really means potentially modified before CoroBegin.		// isEscaped really means potentially modified before CoroBegin.
if (Alloca->isArrayAllocation())		if (Alloca->isArrayAllocation())
report_fatal_error(		report_fatal_error(
▲ Show 20 Lines • Show All 981 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/coro-debug-frame-variable.ll

	; RUN: opt < %s -O0 -enable-coroutines -S \| FileCheck %s			; RUN: opt < %s -O0 -enable-coroutines -S \| FileCheck %s
	; RUN: opt < %s -passes='default<O0>' -enable-coroutines -S \| FileCheck %s			; RUN: opt < %s -passes='default<O0>' -enable-coroutines -S \| FileCheck %s

	; Define a function 'f' that resembles the Clang frontend's output for the			; Define a function 'f' that resembles the Clang frontend's output for the
	; following C++ coroutine:			; following C++ coroutine:
	;			;
	; void foo() {			; void foo() {
	; int i = 0;			; int i = 0;
	; ++i;			; ++i;
				; int x = {};
	; print(i); // Prints '1'			; print(i); // Prints '1'
	;			;
	; co_await suspend_always();			; co_await suspend_always();
	;			;
	; int j = 0;			; int j = 0;
				; x[0] = 1;
				; x[1] = 2;
	; ++i;			; ++i;
	; print(i); // Prints '2'			; print(i); // Prints '2'
	; ++j;			; ++j;
	; print(j); // Prints '1'			; print(j); // Prints '1'
				; print(x); // Print '1'
	; }			; }
	;			;
	; The CHECKs verify that dbg.declare intrinsics are created for the coroutine			; The CHECKs verify that dbg.declare intrinsics are created for the coroutine
	; funclet 'f.resume', and that they reference the address of the variables on			; funclet 'f.resume', and that they reference the address of the variables on
	; the coroutine frame. The debug locations for the original function 'f' are			; the coroutine frame. The debug locations for the original function 'f' are
	; static (!11 and !13), whereas the coroutine funclet will have its own new			; static (!11 and !13), whereas the coroutine funclet will have its own new
	; ones with identical line and column numbers.			; ones with identical line and column numbers.
	;			;
	; CHECK-LABEL: define void @f() {			; CHECK-LABEL: define void @f() {
	; CHECK: entry:			; CHECK: entry:
	; CHECK: %j = alloca i32, align 4			; CHECK: %j = alloca i32, align 4
	; CHECK: [[IGEP:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4			; CHECK: [[IGEP:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4
				; CHECK: [[XGEP:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 6
	; CHECK: init.ready:			; CHECK: init.ready:
	; CHECK: call void @llvm.dbg.declare(metadata i32* [[IGEP]], metadata ![[IVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* [[IGEP]], metadata ![[IVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC:[0-9]+]]
				; CHECK: call void @llvm.dbg.declare(metadata [10 x i32]* [[XGEP]], metadata ![[XVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC]]
	; CHECK: await.ready:			; CHECK: await.ready:
				; CHECK: call void @llvm.dbg.value(metadata [10 x i32]* [[XGEP]], metadata ![[XVAR]], metadata !DIExpression()), !dbg ![[IDBGLOC]]
				; CHECK: call void @llvm.dbg.value(metadata i32* [[IGEP]], metadata ![[IVAR]], metadata !DIExpression()), !dbg ![[IDBGLOC]]
	; CHECK: call void @llvm.dbg.declare(metadata i32* %j, metadata ![[JVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[JDBGLOC:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* %j, metadata ![[JVAR:[0-9]+]], metadata !DIExpression()), !dbg ![[JDBGLOC:[0-9]+]]
	;			;
	; CHECK-LABEL: define internal fastcc void @f.resume({{.*}}) {			; CHECK-LABEL: define internal fastcc void @f.resume({{.*}}) {
	; CHECK: entry.resume:			; CHECK: entry.resume:
	; CHECK: %j = alloca i32, align 4			; CHECK: %j = alloca i32, align 4
	; CHECK: [[IGEP_RESUME:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4			; CHECK: [[IGEP_RESUME:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 4
				; CHECK: [[XGEP_RESUME:%.+]] = getelementptr inbounds %f.Frame, %f.Frame* %FramePtr, i32 0, i32 6
	; CHECK: init.ready:			; CHECK: init.ready:
	; CHECK: call void @llvm.dbg.declare(metadata i32* [[IGEP_RESUME]], metadata ![[IVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC_RESUME:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* [[IGEP_RESUME]], metadata ![[IVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC_RESUME:[0-9]+]]
				; CHECK: call void @llvm.dbg.declare(metadata [10 x i32]* [[XGEP_RESUME]], metadata ![[XVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[IDBGLOC_RESUME]]
	; CHECK: await.ready:			; CHECK: await.ready:
				; CHECK: call void @llvm.dbg.value(metadata [10 x i32]* [[XGEP_RESUME]], metadata ![[XVAR_RESUME]], metadata !DIExpression()), !dbg ![[IDBGLOC_RESUME]]
				; CHECK: call void @llvm.dbg.value(metadata i32* [[IGEP_RESUME]], metadata ![[IVAR_RESUME]], metadata !DIExpression()), !dbg ![[IDBGLOC_RESUME]]
	; CHECK: call void @llvm.dbg.declare(metadata i32* %j, metadata ![[JVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[JDBGLOC_RESUME:[0-9]+]]			; CHECK: call void @llvm.dbg.declare(metadata i32* %j, metadata ![[JVAR_RESUME:[0-9]+]], metadata !DIExpression()), !dbg ![[JDBGLOC_RESUME:[0-9]+]]
	;			;
	; CHECK: ![[IVAR]] = !DILocalVariable(name: "i"			; CHECK: ![[IVAR]] = !DILocalVariable(name: "i"
	; CHECK: ![[SCOPE:[0-9]+]] = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)			; CHECK: ![[SCOPE:[0-9]+]] = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)
	; CHECK: ![[IDBGLOC]] = !DILocation(line: 24, column: 7, scope: ![[SCOPE]])			; CHECK: ![[IDBGLOC]] = !DILocation(line: 24, column: 7, scope: ![[SCOPE]])
				; CHECK: ![[XVAR]] = !DILocalVariable(name: "x"
	; CHECK: ![[JVAR]] = !DILocalVariable(name: "j"			; CHECK: ![[JVAR]] = !DILocalVariable(name: "j"
	; CHECK: ![[JDBGLOC]] = !DILocation(line: 32, column: 7, scope: ![[SCOPE]])			; CHECK: ![[JDBGLOC]] = !DILocation(line: 32, column: 7, scope: ![[SCOPE]])

	; CHECK: ![[IVAR_RESUME]] = !DILocalVariable(name: "i"			; CHECK: ![[IVAR_RESUME]] = !DILocalVariable(name: "i"
	; CHECK: ![[RESUME_SCOPE:[0-9]+]] = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)			; CHECK: ![[RESUME_SCOPE:[0-9]+]] = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)
	; CHECK: ![[IDBGLOC_RESUME]] = !DILocation(line: 24, column: 7, scope: ![[RESUME_SCOPE]])			; CHECK: ![[IDBGLOC_RESUME]] = !DILocation(line: 24, column: 7, scope: ![[RESUME_SCOPE]])
				; CHECK: ![[XVAR_RESUME]] = !DILocalVariable(name: "x"
	; CHECK: ![[JVAR_RESUME]] = !DILocalVariable(name: "j"			; CHECK: ![[JVAR_RESUME]] = !DILocalVariable(name: "j"
	; CHECK: ![[JDBGLOC_RESUME]] = !DILocation(line: 32, column: 7, scope: ![[RESUME_SCOPE]])			; CHECK: ![[JDBGLOC_RESUME]] = !DILocation(line: 32, column: 7, scope: ![[RESUME_SCOPE]])
	define void @f() {			define void @f() {
	entry:			entry:
	%__promise = alloca i8, align 8			%__promise = alloca i8, align 8
	%i = alloca i32, align 4			%i = alloca i32, align 4
	%j = alloca i32, align 4			%j = alloca i32, align 4
				%x = alloca [10 x i32], align 16
	%id = call token @llvm.coro.id(i32 16, i8* %__promise, i8* null, i8* null)			%id = call token @llvm.coro.id(i32 16, i8* %__promise, i8* null, i8* null)
	%alloc = call i1 @llvm.coro.alloc(token %id)			%alloc = call i1 @llvm.coro.alloc(token %id)
	br i1 %alloc, label %coro.alloc, label %coro.init			br i1 %alloc, label %coro.alloc, label %coro.init

	coro.alloc: ; preds = %entry			coro.alloc: ; preds = %entry
	%size = call i64 @llvm.coro.size.i64()			%size = call i64 @llvm.coro.size.i64()
	%memory = call i8* @new(i64 %size)			%memory = call i8* @new(i64 %size)
	br label %coro.init			br label %coro.init
	Show All 18 Lines

	init.ready: ; preds = %init.suspend, %coro.init			init.ready: ; preds = %init.suspend, %coro.init
	call void @await_resume()			call void @await_resume()
	call void @llvm.dbg.declare(metadata i32* %i, metadata !6, metadata !DIExpression()), !dbg !11			call void @llvm.dbg.declare(metadata i32* %i, metadata !6, metadata !DIExpression()), !dbg !11
	store i32 0, i32* %i, align 4			store i32 0, i32* %i, align 4
	%i.init.ready.load = load i32, i32* %i, align 4			%i.init.ready.load = load i32, i32* %i, align 4
	%i.init.ready.inc = add nsw i32 %i.init.ready.load, 1			%i.init.ready.inc = add nsw i32 %i.init.ready.load, 1
	store i32 %i.init.ready.inc, i32* %i, align 4			store i32 %i.init.ready.inc, i32* %i, align 4
				call void @llvm.dbg.declare(metadata [10 x i32]* %x, metadata !14, metadata !DIExpression()), !dbg !11
				%memset = bitcast [10 x i32]* %x to i8*, !dbg !11
				call void @llvm.memset.p0i8.i64(i8* align 16 %memset, i8 0, i64 40, i1 false), !dbg !11
	%i.init.ready.reload = load i32, i32* %i, align 4			%i.init.ready.reload = load i32, i32* %i, align 4
	call void @print(i32 %i.init.ready.reload)			call void @print(i32 %i.init.ready.reload)
	%ready.again = call zeroext i1 @await_ready()			%ready.again = call zeroext i1 @await_ready()
	br i1 %ready.again, label %await.ready, label %await.suspend			br i1 %ready.again, label %await.ready, label %await.suspend

	await.suspend: ; preds = %init.ready			await.suspend: ; preds = %init.ready
	%save.again = call token @llvm.coro.save(i8* null)			%save.again = call token @llvm.coro.save(i8* null)
	%from.address = call i8* @from_address(i8* %begin)			%from.address = call i8* @from_address(i8* %begin)
	call void @await_suspend()			call void @await_suspend()
	%suspend.again = call i8 @llvm.coro.suspend(token %save.again, i1 false)			%suspend.again = call i8 @llvm.coro.suspend(token %save.again, i1 false)
	switch i8 %suspend.again, label %coro.ret [			switch i8 %suspend.again, label %coro.ret [
	i8 0, label %await.ready			i8 0, label %await.ready
	i8 1, label %await.cleanup			i8 1, label %await.cleanup
	]			]

	await.cleanup: ; preds = %await.suspend			await.cleanup: ; preds = %await.suspend
	br label %cleanup			br label %cleanup

	await.ready: ; preds = %await.suspend, %init.ready			await.ready: ; preds = %await.suspend, %init.ready
	call void @await_resume()			call void @await_resume()
	call void @llvm.dbg.declare(metadata i32* %j, metadata !12, metadata !DIExpression()), !dbg !13			call void @llvm.dbg.declare(metadata i32* %j, metadata !12, metadata !DIExpression()), !dbg !13
	store i32 0, i32* %j, align 4			store i32 0, i32* %j, align 4
				%arrayidx0 = getelementptr inbounds [10 x i32], [10 x i32]* %x, i64 0, i64 0, !dbg !18
				store i32 1, i32* %arrayidx0, align 16, !dbg !19
				%arrayidx1 = getelementptr inbounds [10 x i32], [10 x i32]* %x, i64 0, i64 1, !dbg !20
				store i32 2, i32* %arrayidx1, align 4, !dbg !21
	%i.await.ready.load = load i32, i32* %i, align 4			%i.await.ready.load = load i32, i32* %i, align 4
	%i.await.ready.inc = add nsw i32 %i.await.ready.load, 1			%i.await.ready.inc = add nsw i32 %i.await.ready.load, 1
	store i32 %i.await.ready.inc, i32* %i, align 4			store i32 %i.await.ready.inc, i32* %i, align 4
	%j.await.ready.load = load i32, i32* %j, align 4			%j.await.ready.load = load i32, i32* %j, align 4
	%j.await.ready.inc = add nsw i32 %j.await.ready.load, 1			%j.await.ready.inc = add nsw i32 %j.await.ready.load, 1
	store i32 %j.await.ready.inc, i32* %j, align 4			store i32 %j.await.ready.inc, i32* %j, align 4
	%i.await.ready.reload = load i32, i32* %i, align 4			%i.await.ready.reload = load i32, i32* %i, align 4
	call void @print(i32 %i.await.ready.reload)			call void @print(i32 %i.await.ready.reload)
	▲ Show 20 Lines • Show All 66 Lines • ▼ Show 20 Lines
	declare i1 @await_ready()			declare i1 @await_ready()
	declare void @await_suspend()			declare void @await_suspend()
	declare void @await_resume()			declare void @await_resume()
	declare void @print(i32)			declare void @print(i32)
	declare i8* @from_address(i8*)			declare i8* @from_address(i8*)
	declare void @return_void()			declare void @return_void()
	declare void @final_suspend()			declare void @final_suspend()

				declare void @llvm.memset.p0i8.i64(i8* nocapture writeonly, i8, i64, i1 immarg)

	!llvm.dbg.cu = !{!0}			!llvm.dbg.cu = !{!0}
	!llvm.linker.options = !{}			!llvm.linker.options = !{}
	!llvm.module.flags = !{!3, !4}			!llvm.module.flags = !{!3, !4}
	!llvm.ident = !{!5}			!llvm.ident = !{!5}

	!0 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus_14, file: !1, producer: "clang version 11.0.0", isOptimized: false, runtimeVersion: 0, emissionKind: FullDebug, enums: !2, retainedTypes: !2, splitDebugInlining: false, nameTableKind: None)			!0 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus_14, file: !1, producer: "clang version 11.0.0", isOptimized: false, runtimeVersion: 0, emissionKind: FullDebug, enums: !2, retainedTypes: !2, splitDebugInlining: false, nameTableKind: None)
	!1 = !DIFile(filename: "repro.cpp", directory: ".")			!1 = !DIFile(filename: "repro.cpp", directory: ".")
	!2 = !{}			!2 = !{}
	!3 = !{i32 7, !"Dwarf Version", i32 4}			!3 = !{i32 7, !"Dwarf Version", i32 4}
	!4 = !{i32 2, !"Debug Info Version", i32 3}			!4 = !{i32 2, !"Debug Info Version", i32 3}
	!5 = !{!"clang version 11.0.0"}			!5 = !{!"clang version 11.0.0"}
	!6 = !DILocalVariable(name: "i", scope: !7, file: !1, line: 24, type: !10)			!6 = !DILocalVariable(name: "i", scope: !7, file: !1, line: 24, type: !10)
	!7 = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)			!7 = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)
	!8 = distinct !DISubprogram(name: "foo", linkageName: "_Z3foov", scope: !1, file: !1, line: 23, type: !9, scopeLine: 23, flags: DIFlagPrototyped, spFlags: DISPFlagDefinition, unit: !0, retainedNodes: !2)			!8 = distinct !DISubprogram(name: "foo", linkageName: "_Z3foov", scope: !1, file: !1, line: 23, type: !9, scopeLine: 23, flags: DIFlagPrototyped, spFlags: DISPFlagDefinition, unit: !0, retainedNodes: !2)
	!9 = !DISubroutineType(types: !2)			!9 = !DISubroutineType(types: !2)
	!10 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)			!10 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)
	!11 = !DILocation(line: 24, column: 7, scope: !7)			!11 = !DILocation(line: 24, column: 7, scope: !7)
	!12 = !DILocalVariable(name: "j", scope: !7, file: !1, line: 32, type: !10)			!12 = !DILocalVariable(name: "j", scope: !7, file: !1, line: 32, type: !10)
	!13 = !DILocation(line: 32, column: 7, scope: !7)			!13 = !DILocation(line: 32, column: 7, scope: !7)
				!14 = !DILocalVariable(name: "x", scope: !22, file: !1, line: 34, type: !15)
				!15 = !DICompositeType(tag: DW_TAG_array_type, baseType: !10, size: 320, elements: !16)
				!16 = !{!17}
				!17 = !DISubrange(count: 10)
				!18 = !DILocation(line: 42, column: 3, scope: !7)
				!19 = !DILocation(line: 42, column: 8, scope: !7)
				!20 = !DILocation(line: 43, column: 3, scope: !7)
				!21 = !DILocation(line: 43, column: 8, scope: !7)
				!22 = distinct !DILexicalBlock(scope: !8, file: !1, line: 23, column: 12)
				No newline at end of file

This is an archive of the discontinued LLVM Phabricator instance.

[Coroutines] Add missing llvm.dbg.declare's to cover more allocasClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 304305

llvm/lib/Transforms/Coroutines/CoroFrame.cpp

llvm/test/Transforms/Coroutines/coro-debug-frame-variable.ll

[Coroutines] Add missing llvm.dbg.declare's to cover more allocas
ClosedPublic