This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/ExecutionEngine/JITLink/
-
ExecutionEngine/
-
JITLink/
1/2
JITLink.cpp
-
unittests/ExecutionEngine/JITLink/
-
ExecutionEngine/
-
JITLink/
-
LinkGraphTests.cpp

Differential D113912

[JITLink] Fix splitBlock if there are symbols span across the boundary
ClosedPublic

Authored by steven_wu on Nov 15 2021, 9:38 AM.

Download Raw Diff

Details

Reviewers

lhames
dexonsmith

Commits

rGfcd07f810781: [JITLink] Fix splitBlock if there are symbols span across the boundary

Summary

Fix splitBlock so that it can handle the case when the block being
split has symbols span across the split boundary. This is an error
case in general but for EHFrame splitting on macho platforms, there is an
anonymous symbol that marks the entire block. Current implementation
will leave a symbol that is out of bound of the underlying block. Fix
the problem by dropping such symbols when the block is split.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

steven_wu created this revision.Nov 15 2021, 9:38 AM

Herald added subscribers: ributzka, hiraditya. · View Herald TranscriptNov 15 2021, 9:38 AM

steven_wu requested review of this revision.Nov 15 2021, 9:38 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 15 2021, 9:38 AM

Harbormaster completed remote builds in B134284: Diff 387300.Nov 15 2021, 10:17 AM

dexonsmith added inline comments.Nov 15 2021, 10:35 AM

llvm/lib/ExecutionEngine/JITLink/JITLink.cpp
216–224	This doesn't seem safe in general, since anonymous symbols aren't necessarily safe to split just because they're anonymous. I wonder if, rather than return `Error`s, it'd be better to trust the caller and split blocks anyway (as the code was), but truncate `Symbol::size()` as necessary to avoid it going past the end of the block it points at. Two other ideas: Change the LinkGraph parser to set the symbol size to 0 for symbols that are generated just to allow edges to a block. Add a bit to Symbol that says "safe to split", which LinkGraph would set for symbols it generates just to allow edges to a block. @lhames, WDYT?

steven_wu added inline comments.Nov 15 2021, 10:46 AM

llvm/lib/ExecutionEngine/JITLink/JITLink.cpp
216–224	So the symbol in question is the one created here: MachOLinkGraphBuilder::addSectionStartSymAndBlock I don't know how this one is being used but the one marked `eh_frame` will become redundant after eh_frame splitting. I guess we can shrink the size of section start sym to have size 0 then we can just always split the block and not worry about that. Then any symbols span across the split boundary will be Error.

I wonder if, rather than return Errors, it'd be better to trust the caller and split blocks anyway (as the code was), but truncate Symbol::size() as necessary to avoid it going past the end of the block it points at.

I like this as a short-term solution.

I think this issue exposes limitations in LinkGraph's current design. I'm not sure that we need/want LLVM-style users-lists for Symbols, but if we just added a ref-count to Symbols we could have a .isUnreferenced method that would be really helpful here: Clients could choose to remove unreferenced anonymous symbols (which should always be safe, since they can be recreated easily), and/or truncate named symbols, and then splitBlock could return an Error for any remaining symbols that extend past a split point.

Address review feedback after talking with Lang and Duncan offline.

Harbormaster completed remote builds in B134323: Diff 387349.Nov 15 2021, 12:41 PM

Update the patch to trust user's action and update symbols to fit in the new block

Harbormaster completed remote builds in B134335: Diff 387369.Nov 15 2021, 1:31 PM

LGTM. Thanks Steven!

This revision is now accepted and ready to land.Nov 15 2021, 1:49 PM

Closed by commit rGfcd07f810781: [JITLink] Fix splitBlock if there are symbols span across the boundary (authored by steven_wu). · Explain WhyNov 15 2021, 1:55 PM

This revision was automatically updated to reflect the committed changes.

steven_wu added a commit: rGfcd07f810781: [JITLink] Fix splitBlock if there are symbols span across the boundary.

Revision Contents

Path

Size

llvm/

lib/

ExecutionEngine/

JITLink/

JITLink.cpp

7 lines

unittests/

ExecutionEngine/

JITLink/

LinkGraphTests.cpp

8 lines

Diff 387398

llvm/lib/ExecutionEngine/JITLink/JITLink.cpp

Show First 20 Lines • Show All 207 Lines • ▼ Show 20 Lines	if (*Cache == None) {
return LHS->getOffset() > RHS->getOffset();		return LHS->getOffset() > RHS->getOffset();
});		});
}		}
auto &BlockSymbols = **Cache;		auto &BlockSymbols = **Cache;

// Transfer all symbols with offset less than SplitIndex to NewBlock.		// Transfer all symbols with offset less than SplitIndex to NewBlock.
while (!BlockSymbols.empty() &&		while (!BlockSymbols.empty() &&
BlockSymbols.back()->getOffset() < SplitIndex) {		BlockSymbols.back()->getOffset() < SplitIndex) {
BlockSymbols.back()->setBlock(NewBlock);		auto *Sym = BlockSymbols.back();
		// If the symbol extends beyond the split, update the size to be within
		// the new block.
		if (Sym->getOffset() + Sym->getSize() > SplitIndex)
		Sym->setSize(SplitIndex - Sym->getOffset());
		Sym->setBlock(NewBlock);
BlockSymbols.pop_back();		BlockSymbols.pop_back();
}		}

		dexonsmithUnsubmitted Not Done Reply Inline Actions This doesn't seem safe in general, since anonymous symbols aren't necessarily safe to split just because they're anonymous. I wonder if, rather than return `Error`s, it'd be better to trust the caller and split blocks anyway (as the code was), but truncate `Symbol::size()` as necessary to avoid it going past the end of the block it points at. Two other ideas: Change the LinkGraph parser to set the symbol size to 0 for symbols that are generated just to allow edges to a block. Add a bit to Symbol that says "safe to split", which LinkGraph would set for symbols it generates just to allow edges to a block. @lhames, WDYT? dexonsmith: This doesn't seem safe in general, since anonymous symbols aren't necessarily safe to split…
		steven_wuAuthorUnsubmitted Done Reply Inline Actions So the symbol in question is the one created here: MachOLinkGraphBuilder::addSectionStartSymAndBlock I don't know how this one is being used but the one marked `eh_frame` will become redundant after eh_frame splitting. I guess we can shrink the size of section start sym to have size 0 then we can just always split the block and not worry about that. Then any symbols span across the split boundary will be Error. steven_wu: So the symbol in question is the one created here: ``` MachOLinkGraphBuilder…
// Update offsets for all remaining symbols in B.		// Update offsets for all remaining symbols in B.
for (auto *Sym : BlockSymbols)		for (auto *Sym : BlockSymbols)
Sym->setOffset(Sym->getOffset() - SplitIndex);		Sym->setOffset(Sym->getOffset() - SplitIndex);
}		}

return NewBlock;		return NewBlock;
}		}

▲ Show 20 Lines • Show All 191 Lines • Show Last 20 Lines

llvm/unittests/ExecutionEngine/JITLink/LinkGraphTests.cpp

Show First 20 Lines • Show All 487 Lines • ▼ Show 20 Lines	TEST(LinkGraphTest, SplitBlock) {
auto &S1 = G.addDefinedSymbol(B1, 0, "S1", 4, Linkage::Strong, Scope::Default,		auto &S1 = G.addDefinedSymbol(B1, 0, "S1", 4, Linkage::Strong, Scope::Default,
false, false);		false, false);
auto &S2 = G.addDefinedSymbol(B1, 4, "S2", 4, Linkage::Strong, Scope::Default,		auto &S2 = G.addDefinedSymbol(B1, 4, "S2", 4, Linkage::Strong, Scope::Default,
false, false);		false, false);
auto &S3 = G.addDefinedSymbol(B1, 8, "S3", 4, Linkage::Strong, Scope::Default,		auto &S3 = G.addDefinedSymbol(B1, 8, "S3", 4, Linkage::Strong, Scope::Default,
false, false);		false, false);
auto &S4 = G.addDefinedSymbol(B1, 12, "S4", 4, Linkage::Strong,		auto &S4 = G.addDefinedSymbol(B1, 12, "S4", 4, Linkage::Strong,
Scope::Default, false, false);		Scope::Default, false, false);
		// Add a symbol that extends beyond the split.
		auto &S5 = G.addDefinedSymbol(B1, 0, "S5", 16, Linkage::Strong,
		Scope::Default, false, false);

// Add an extra block, EB, and target symbols, and use these to add edges		// Add an extra block, EB, and target symbols, and use these to add edges
// from B1 to EB.		// from B1 to EB.
auto &EB = G.createContentBlock(Sec, BlockContent, 0x2000, 8, 0);		auto &EB = G.createContentBlock(Sec, BlockContent, 0x2000, 8, 0);
auto &ES1 = G.addDefinedSymbol(EB, 0, "TS1", 4, Linkage::Strong,		auto &ES1 = G.addDefinedSymbol(EB, 0, "TS1", 4, Linkage::Strong,
Scope::Default, false, false);		Scope::Default, false, false);
auto &ES2 = G.addDefinedSymbol(EB, 4, "TS2", 4, Linkage::Strong,		auto &ES2 = G.addDefinedSymbol(EB, 4, "TS2", 4, Linkage::Strong,
Scope::Default, false, false);		Scope::Default, false, false);
Show All 29 Lines	TEST(LinkGraphTest, SplitBlock) {
EXPECT_EQ(S2.getOffset(), 4U);		EXPECT_EQ(S2.getOffset(), 4U);

EXPECT_EQ(&S3.getBlock(), &B1);		EXPECT_EQ(&S3.getBlock(), &B1);
EXPECT_EQ(S3.getOffset(), 0U);		EXPECT_EQ(S3.getOffset(), 0U);

EXPECT_EQ(&S4.getBlock(), &B1);		EXPECT_EQ(&S4.getBlock(), &B1);
EXPECT_EQ(S4.getOffset(), 4U);		EXPECT_EQ(S4.getOffset(), 4U);

		EXPECT_EQ(&S5.getBlock(), &B2);
		EXPECT_EQ(S5.getOffset(), 0U);
		// Size shrinks to fit.
		EXPECT_EQ(S5.getSize(), 8U);

// Check that edges in B1 have been transferred as expected:		// Check that edges in B1 have been transferred as expected:
// Both blocks should now have two edges each at offsets 0 and 4.		// Both blocks should now have two edges each at offsets 0 and 4.
EXPECT_EQ(llvm::size(B1.edges()), 2);		EXPECT_EQ(llvm::size(B1.edges()), 2);
if (size(B1.edges()) == 2) {		if (size(B1.edges()) == 2) {
auto E1 = &B1.edges().begin();		auto E1 = &B1.edges().begin();
auto E2 = &(B1.edges().begin() + 1);		auto E2 = &(B1.edges().begin() + 1);
if (E2->getOffset() < E1->getOffset())		if (E2->getOffset() < E1->getOffset())
std::swap(E1, E2);		std::swap(E1, E2);
Show All 14 Lines