This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/WebAssembly/
-
Target/
-
WebAssembly/
6/6
WebAssemblyCFGStackify.cpp
-
test/CodeGen/WebAssembly/
-
CodeGen/
-
WebAssembly/
-
cfg-stackify-eh.ll

Differential D79324

[WebAssembly] Fix block marker placing after fixUnwindMismatches
ClosedPublic

Authored by aheejin on May 4 2020, 5:15 AM.

Download Raw Diff

Details

Reviewers

dschuff

Commits

rG834debfffd0b: [WebAssembly] Fix block marker placing after fixUnwindMismatches

Summary

This fixes a few things that are connected. It is very hard to provide
an independent test case for each of those fixes, because they are
interconnected and sometimes one masks another. The provided test case
triggers some of those bugs below but not all.

Background:

placeBlockMarker takes a BB, and if the BB is a destination of some
branch, it places end_block marker there, and computes the nearest
common dominator of all predecessors (what we call 'header') and places
a block marker there.

When we first place markers, we traverse BBs from top to bottom. For
example, when there are 5 BBs A, B, C, D, and E and B, D, and E are
branch destinations, if mark the BB given to placeBlockMarker with *
and draw a rectangle representing the border of block and end_block
markers, the process is going to look like

                      -------
          -----       |-----|
---       |---|       ||---||
|A|       ||A||       |||A|||
---  -->  |---|  -->  ||---||
*B        | B |       || B ||
 C        | C |       || C ||
 D        -----       |-----|
 E         *D         |  D  |
            E         -------
                        *E

which means when we first place markers, we go from inner to outer
scopes. So when we place a block marker, if the header already
contains other block or try marker, it has to belong to an inner
scope, so the existing block/try markers should go _after_ the new
marker. This was the assumption we had.

But after placing all markers we run fixUnwindMismatches function.
There we do some control flow transformation and create some branches,
and we call placeBlockMarker again to place block/end_block
markers for those newly created branches. We can't assume that we are
traversing branch destination BBs from top to bottom now because we are
basically inserting some new markers in the middle of existing markers.

Fix:
In placeBlockMarker, we don't have the assumption that the BB given is
in the order of top to bottom, and when placing block markers,
calculates whether existing block or try markers are inner or
outer scopes with respect to the current scope.

Background:

In fixUnwindMismatches, when there is a call whose correct unwind
destination mismatches the current destination after initially placing
try markers, we wrap that with a new nested try/catch/end and
jump to the correct handler within the new catch. The correct handler
code is split as a separate BB from its original EH pad so it can be
branched to. Here's an example:

Before

mbb:
  call @foo       <- Unwind destination mismatch!
wrong-ehpad:
  catch
  ...
cont:
  end_try
  ...
correct-ehpad:
  catch
  [handler code]

After

mbb:
  try                (new)
  call @foo
nested-ehpad:        (new)
  catch              (new)
  local.set n / drop (new)
  br %handleri       (new)
nested-end:          (new)
  end_try            (new)
wrong-ehpad:
  catch
  ...
cont:
  end_try
  ...
correct-ehpad:
  catch
  local.set n / drop (new)
handler:             (new)
  end_try
  [handler code]

Note that after this transformation, it is possible there are no calls
to actually unwind to correct-ehpad here. call @foo now
branches to handler, and there can be no other calls to unwind to
correct-ehpad. In this case correct-ehpad does not have any
predecessors anymore.

This can cause a bug in placeBlockMarker, because we may need to place
end_block marker in handler, and placeBlockMarker computes the
nearest common dominator of all predecessors. If one of handler's
predecessor (here correct-ehpad) does not have any predecessors, i.e.,
no way of reaching it, we cannot correctly compute the common dominator
of predecessors of handler, and end up placing no block/end
markers. This bug actually sometimes masks the bug 1.

Fix:
When we have an EH pad that does not have any predecessors after this
transformation, deletes all its successors, so that its successors don't
have any dangling predecessors.

Background:

Actually the handler BB in the example shown in bug 2 doesn't need
end_block marker, despite it being a new branch destination, because
it already has end_try marker which can serve the same purpose. I just
put that example there for an illustration purpose. There is a case we
actually need to place end_block marker: when the branch dest is the
appendix BB. The appendix BB is created when there is a call that is
supposed to unwind to the caller ends up unwinding to a wrong EH pad. In
this case we also wrap the call with a nested try/catch/end,
create an 'appendix' BB at the very end of the function, and branch to
that BB, where we rethrow the exception to the caller.

Fix:
When we don't actually need to place block markers, we don't.

In case we fall through to the continuation BB after the catch block,

after extracting handler code in fixUnwindMismatches (refer to bug 2
for an example), we now have to add a branch to it to bypass the
handler.

Before

try
  ...
  (falls through to 'cont')
catch
  handler body
end
              <-- cont

After

try
  ...
  br %cont    (new)
catch
end
handler body
              <-- cont

The problem is, we haven't been placing a new end_block marker in the
cont BB in this case. We should, and this fixes it. But it is hard to
provide a test case that triggers this bug, because the current
compilation pipeline from .ll to .s does not generate this kind of code;
we always have a br after invoke. But code without br is still
valid, and we can have that kind of code if we have some pipeline
changes or optimizations later. Even mir test cases cannot trigger this
part for now, because we don't encode auxiliary EH-related data
structures (such as [[ https://github.com/llvm/llvm-project/blob/19f5da9c1d698653f942b504544a73b85b1e703c/llvm/include/llvm/CodeGen/WasmEHFuncInfo.h#L29-L54 | WasmEHFuncInfo ]]) in mir now. Those functionalities
can be added later, but I don't think we should block this fix on that.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aheejin created this revision.May 4 2020, 5:15 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 4 2020, 5:15 AM

Herald added subscribers: llvm-commits, sunfish, hiraditya and 2 others. · View Herald Transcript

aheejin edited the summary of this revision. (Show Details)May 4 2020, 5:23 AM

aheejin edited the summary of this revision. (Show Details)

aheejin marked 4 inline comments as done.May 4 2020, 5:27 AM

aheejin added inline comments.

llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp
292	Fix for bug 1
1090	Fix for bug 4
1203	Fix for bug 2
1241	Fix for bug 3

aheejin marked an inline comment as done.May 4 2020, 5:28 AM

aheejin added inline comments.

llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp
879	This was preexisting but just hoisted, because in bugfix 4 we use it earlier.

aheejin marked an inline comment as done.May 4 2020, 5:30 AM

aheejin added inline comments.

llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp
1241	Before we add all branch dests in `BrDestToTryRanges` to `BrDests` so that `block`/`end` markers are placed for them, but actually we don't need the marker for non-appendix BB, because there is already an existing `try`/`end` pair that can serve the same purpose. Please refer to the CL description for bugfix 3.

aheejin edited the summary of this revision. (Show Details)May 4 2020, 5:34 AM

Harbormaster completed remote builds in B55624: Diff 261786.May 4 2020, 6:22 AM

Wow, did our one external partner test case trigger all of these?

This revision is now accepted and ready to land.May 4 2020, 4:24 PM

No it's complicated... Adobe's case triggered bug 1 (placeBlockMarker bug). That can be solvable by either applying bugfix 1 or applying 2 and 3 together. Bugfix 3 means not placing block/end_block in that case bc it is not strictly necessary, and bugfix 1 means we place markers (even if it's unnecessary) that but correctly. But bugfix 3 requires bugfix 2 to work, because due to bug 2 placeBlockMarker does not find the 'header' in placeBlockMarker and just returns. The reason I put all 1~3 here is, even though the Adobe case can be fixed by 2 and 3, 1 is a good necessary caution too (actually it does the same check when it places end_block marker. It did not do that check when it placed block.)

Bugfix 4 is separate and is not triggered by the Adobe case, but I couldn't come up with a test case that triggers this because of the reason I stated in the CL description (it is valid code but is not generated by current wasm backend pipeline). The reason I included bugfix 4 here is 1. when I was fixing bug 3, which is we unnecessary place block/end_block marker sometimes, I accidentally found bug 4, which is the opposite - it does not place block/end_block when necessary, and 2. I can't come up with a separate test case for that anyway.

Our small test case here is not the same with Adobe case because it accidentally succeeds in the current code base, because bug 2 masks bug ... In the current codebase it tries to unnecessarily place block markers, but due to bug 2 it cannot find the header and doesn't end up triggering bug 1. but I couldn't come up with the test that has exactly the same properties with the Adobe one.

Closed by commit rG834debfffd0b: [WebAssembly] Fix block marker placing after fixUnwindMismatches (authored by aheejin). · Explain WhyMay 5 2020, 2:39 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

Target/

WebAssembly/

WebAssemblyCFGStackify.cpp

43 lines

test/

CodeGen/

WebAssembly/

cfg-stackify-eh.ll

45 lines

Diff 262040

llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp

Show First 20 Lines • Show All 271 Lines • ▼ Show 20 Lines	if (MI.getOpcode() == WebAssembly::LOOP) {
if (MBB.getNumber() > LoopBottom->getNumber())		if (MBB.getNumber() > LoopBottom->getNumber())
AfterSet.insert(&MI);		AfterSet.insert(&MI);
#ifndef NDEBUG		#ifndef NDEBUG
else		else
BeforeSet.insert(&MI);		BeforeSet.insert(&MI);
#endif		#endif
}		}

// All previously inserted BLOCK/TRY markers should be after the BLOCK		// If there is a previously placed BLOCK/TRY marker and its corresponding
// because they are all nested blocks.		// END marker is before the current BLOCK's END marker, that should be
		// placed after this BLOCK. Otherwise it should be placed before this BLOCK
		// marker.
if (MI.getOpcode() == WebAssembly::BLOCK \|\|		if (MI.getOpcode() == WebAssembly::BLOCK \|\|
MI.getOpcode() == WebAssembly::TRY)		MI.getOpcode() == WebAssembly::TRY) {
		if (BeginToEnd[&MI]->getParent()->getNumber() <= MBB.getNumber())
AfterSet.insert(&MI);		AfterSet.insert(&MI);
		#ifndef NDEBUG
		else
		BeforeSet.insert(&MI);
		#endif
		}
		aheejinAuthorUnsubmitted Done Reply Inline Actions Fix for bug 1 aheejin: Fix for bug 1

#ifndef NDEBUG		#ifndef NDEBUG
// All END_(BLOCK\|LOOP\|TRY) markers should be before the BLOCK.		// All END_(BLOCK\|LOOP\|TRY) markers should be before the BLOCK.
if (MI.getOpcode() == WebAssembly::END_BLOCK \|\|		if (MI.getOpcode() == WebAssembly::END_BLOCK \|\|
MI.getOpcode() == WebAssembly::END_LOOP \|\|		MI.getOpcode() == WebAssembly::END_LOOP \|\|
MI.getOpcode() == WebAssembly::END_TRY)		MI.getOpcode() == WebAssembly::END_TRY)
BeforeSet.insert(&MI);		BeforeSet.insert(&MI);
#endif		#endif
▲ Show 20 Lines • Show All 568 Lines • ▼ Show 20 Lines	bool WebAssemblyCFGStackify::fixUnwindMismatches(MachineFunction &MF) {
using TryRange = std::pair<MachineInstr , MachineInstr >;		using TryRange = std::pair<MachineInstr , MachineInstr >;
// In original CFG, <unwind destination BB, a vector of try ranges>		// In original CFG, <unwind destination BB, a vector of try ranges>
DenseMap<MachineBasicBlock *, SmallVector<TryRange, 4>> UnwindDestToTryRanges;		DenseMap<MachineBasicBlock *, SmallVector<TryRange, 4>> UnwindDestToTryRanges;
// In new CFG, <destination to branch to, a vector of try ranges>		// In new CFG, <destination to branch to, a vector of try ranges>
DenseMap<MachineBasicBlock *, SmallVector<TryRange, 4>> BrDestToTryRanges;		DenseMap<MachineBasicBlock *, SmallVector<TryRange, 4>> BrDestToTryRanges;
// In new CFG, <destination to branch to, register containing exnref>		// In new CFG, <destination to branch to, register containing exnref>
DenseMap<MachineBasicBlock *, unsigned> BrDestToExnReg;		DenseMap<MachineBasicBlock *, unsigned> BrDestToExnReg;

		// Destinations for branches that will be newly added, for which a new
		// BLOCK/END_BLOCK markers are necessary.
		SmallVector<MachineBasicBlock *, 8> BrDests;
		aheejinAuthorUnsubmitted Done Reply Inline Actions This was preexisting but just hoisted, because in bugfix 4 we use it earlier. aheejin: This was preexisting but just hoisted, because in bugfix 4 we use it earlier.

// Gather possibly throwing calls (i.e., previously invokes) whose current		// Gather possibly throwing calls (i.e., previously invokes) whose current
// unwind destination is not the same as the original CFG.		// unwind destination is not the same as the original CFG.
for (auto &MBB : reverse(MF)) {		for (auto &MBB : reverse(MF)) {
bool SeenThrowableInstInBB = false;		bool SeenThrowableInstInBB = false;
for (auto &MI : reverse(MBB)) {		for (auto &MI : reverse(MBB)) {
if (MI.getOpcode() == WebAssembly::TRY)		if (MI.getOpcode() == WebAssembly::TRY)
EHPadStack.pop_back();		EHPadStack.pop_back();
else if (MI.getOpcode() == WebAssembly::CATCH)		else if (MI.getOpcode() == WebAssembly::CATCH)
▲ Show 20 Lines • Show All 193 Lines • ▼ Show 20 Lines	for (auto &P : UnwindDestToTryRanges) {
MachineBasicBlock TBB = nullptr, FBB = nullptr;		MachineBasicBlock TBB = nullptr, FBB = nullptr;
SmallVector<MachineOperand, 4> Cond;		SmallVector<MachineOperand, 4> Cond;
bool Analyzable = !TII.analyzeBranch(*EHPadLayoutPred, TBB, FBB, Cond);		bool Analyzable = !TII.analyzeBranch(*EHPadLayoutPred, TBB, FBB, Cond);
if (Analyzable && !TBB && !FBB) {		if (Analyzable && !TBB && !FBB) {
DebugLoc DL = EHPadLayoutPred->empty()		DebugLoc DL = EHPadLayoutPred->empty()
? DebugLoc()		? DebugLoc()
: EHPadLayoutPred->rbegin()->getDebugLoc();		: EHPadLayoutPred->rbegin()->getDebugLoc();
BuildMI(EHPadLayoutPred, DL, TII.get(WebAssembly::BR)).addMBB(Cont);		BuildMI(EHPadLayoutPred, DL, TII.get(WebAssembly::BR)).addMBB(Cont);
		BrDests.push_back(Cont);
		aheejinAuthorUnsubmitted Done Reply Inline Actions Fix for bug 4 aheejin: Fix for bug 4
}		}
}		}

// For possibly throwing calls whose unwind destinations are currently		// For possibly throwing calls whose unwind destinations are currently
// incorrect because of CFG linearization, we wrap them with a nested		// incorrect because of CFG linearization, we wrap them with a nested
// try/catch/end_try, and within the new catch block, we branch to the correct		// try/catch/end_try, and within the new catch block, we branch to the correct
// handler.		// handler.
// - Before		// - Before
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	for (auto Range : TryRanges) {
// new nested continuation BB.		// new nested continuation BB.
NestedCont->splice(NestedCont->end(), MBB,		NestedCont->splice(NestedCont->end(), MBB,
std::next(RangeEnd->getIterator()), MBB->end());		std::next(RangeEnd->getIterator()), MBB->end());
unstackifyVRegsUsedInSplitBB(MBB, NestedCont, MFI, MRI);		unstackifyVRegsUsedInSplitBB(MBB, NestedCont, MFI, MRI);
registerTryScope(NestedTry, NestedEndTry, NestedEHPad);		registerTryScope(NestedTry, NestedEndTry, NestedEHPad);

// Fix predecessor-successor relationship.		// Fix predecessor-successor relationship.
NestedCont->transferSuccessors(MBB);		NestedCont->transferSuccessors(MBB);
if (EHPad)		if (EHPad) {
NestedCont->removeSuccessor(EHPad);		NestedCont->removeSuccessor(EHPad);
		// If EHPad does not have any predecessors left after removing
		// NextedCont predecessor, remove its successor too, because this EHPad
		// is not reachable from the entry BB anyway. We can't remove EHPad BB
		// itself because it can contain 'catch' or 'end', which are necessary
		// for keeping try-catch-end structure.
		if (EHPad->pred_empty())
		EHPad->removeSuccessor(BrDest);
		}
		aheejinAuthorUnsubmitted Done Reply Inline Actions Fix for bug 2 aheejin: Fix for bug 2
MBB->addSuccessor(NestedEHPad);		MBB->addSuccessor(NestedEHPad);
MBB->addSuccessor(NestedCont);		MBB->addSuccessor(NestedCont);
NestedEHPad->addSuccessor(BrDest);		NestedEHPad->addSuccessor(BrDest);
}		}
}		}

// Renumber BBs and recalculate ScopeTop info because new BBs might have been		// Renumber BBs and recalculate ScopeTop info because new BBs might have been
// created and inserted above.		// created and inserted above.
Show All 15 Lines	for (auto &MI : reverse(MBB)) {
break;		break;
}		}
}		}
}		}

// Recompute the dominator tree.		// Recompute the dominator tree.
getAnalysis<MachineDominatorTree>().runOnMachineFunction(MF);		getAnalysis<MachineDominatorTree>().runOnMachineFunction(MF);

// Place block markers for newly added branches.		// Place block markers for newly added branches, if necessary.
SmallVector <MachineBasicBlock *, 8> BrDests;
for (auto &P : BrDestToTryRanges)		// If we've created an appendix BB and a branch to it, place a block/end_block
BrDests.push_back(P.first);		// marker for that. For some new branches, those branch destination BBs start
		// with a hoisted end_try marker, so we don't need a new marker there.
		if (AppendixBB)
		BrDests.push_back(AppendixBB);
		aheejinAuthorUnsubmitted Done Reply Inline Actions Fix for bug 3 aheejin: Fix for bug 3
		aheejinAuthorUnsubmitted Done Reply Inline Actions Before we add all branch dests in `BrDestToTryRanges` to `BrDests` so that `block`/`end` markers are placed for them, but actually we don't need the marker for non-appendix BB, because there is already an existing `try`/`end` pair that can serve the same purpose. Please refer to the CL description for bugfix 3. aheejin: Before we add all branch dests in `BrDestToTryRanges` to `BrDests` so that `block`/`end`…

llvm::sort(BrDests,		llvm::sort(BrDests,
[&](const MachineBasicBlock A, const MachineBasicBlock B) {		[&](const MachineBasicBlock A, const MachineBasicBlock B) {
auto ANum = A->getNumber();		auto ANum = A->getNumber();
auto BNum = B->getNumber();		auto BNum = B->getNumber();
return ANum < BNum;		return ANum < BNum;
});		});
for (auto *Dest : BrDests)		for (auto *Dest : BrDests)
placeBlockMarker(*Dest);		placeBlockMarker(*Dest);
▲ Show 20 Lines • Show All 185 Lines • Show Last 20 Lines

llvm/test/CodeGen/WebAssembly/cfg-stackify-eh.ll

	Show First 20 Lines • Show All 844 Lines • ▼ Show 20 Lines

	terminate7: ; preds = %ehcleanup			terminate7: ; preds = %ehcleanup
	%10 = cleanuppad within %9 []			%10 = cleanuppad within %9 []
	%11 = call i8* @llvm.wasm.get.exception(token %10)			%11 = call i8* @llvm.wasm.get.exception(token %10)
	call void @__clang_call_terminate(i8* %11) #7 [ "funclet"(token %10) ]			call void @__clang_call_terminate(i8* %11) #7 [ "funclet"(token %10) ]
	unreachable			unreachable
	}			}

				; We don't need to call placeBlockMarker after fixUnwindMismatches unless the
				; destination is the appendix BB at the very end. This should not crash.
				define void @test16(i32* %p, i32 %a, i32 %b) personality i8* bitcast (i32 (...)* @__gxx_wasm_personality_v0 to i8*) {
				entry:
				br label %loop

				loop:
				invoke void @foo()
				to label %bb0 unwind label %catch.dispatch0

				bb0:
				%cmp = icmp ne i32 %a, %b
				br i1 %cmp, label %bb1, label %last

				bb1: ; preds = %bb0
				invoke void @bar()
				to label %try.cont unwind label %catch.dispatch1

				catch.dispatch0: ; preds = %loop
				%0 = catchswitch within none [label %catch.start0] unwind to caller

				catch.start0: ; preds = %catch.dispatch0
				%1 = catchpad within %0 [i8* null]
				%2 = call i8* @llvm.wasm.get.exception(token %1)
				%3 = call i32 @llvm.wasm.get.ehselector(token %1)
				catchret from %1 to label %try.cont

				catch.dispatch1: ; preds = %bb1
				%4 = catchswitch within none [label %catch.start1] unwind to caller

				catch.start1: ; preds = %catch.dispatch1
				%5 = catchpad within %4 [i8* null]
				%6 = call i8* @llvm.wasm.get.exception(token %5)
				%7 = call i32 @llvm.wasm.get.ehselector(token %5)
				catchret from %5 to label %try.cont

				try.cont: ; preds = %catch.start, %loop
				br label %loop

				last:
				ret void
				}

	; Check if the unwind destination mismatch stats are correct			; Check if the unwind destination mismatch stats are correct
	; NOSORT-STAT: 15 wasm-cfg-stackify - Number of EH pad unwind mismatches found			; NOSORT-STAT: 16 wasm-cfg-stackify - Number of EH pad unwind mismatches found

	declare void @foo()			declare void @foo()
	declare void @bar()			declare void @bar()
	declare i32 @baz()			declare i32 @baz()
	declare i32 @qux(i32)			declare i32 @qux(i32)
	declare void @quux(i32)			declare void @quux(i32)
	declare void @fun(i32)			declare void @fun(i32)
	; Function Attrs: nounwind			; Function Attrs: nounwind
	Show All 29 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[WebAssembly] Fix block marker placing after fixUnwindMismatchesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 262040

llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp

llvm/test/CodeGen/WebAssembly/cfg-stackify-eh.ll

[WebAssembly] Fix block marker placing after fixUnwindMismatches
ClosedPublic