This is an archive of the discontinued LLVM Phabricator instance.

[BOLT] Update labels for split landing pad
Needs RevisionPublic

Authored by nhuhuan on Jul 19 2022, 3:25 AM.

Download Raw Diff

Details

Reviewers

rafauler
Amir
maksfb

Summary

Landing pad is the catch block in C++ exception handling and normally
resides in same function with callsite. As a result, it makes sense
to reuse or create a local label during parsing LSDA section.

However, for function-splitting enabled binaries, callsite and landing
pad could be in different sibling fragments. The split landing pad's
label needs to be registered as a secondary entry point to the target
fragment.

Also, unless LSDA section is updated, it is necessary to link callsite
fragment to landing pad fragment to guarantee that landing pad fragment
can be moved.

Test Plan:

ninja check-bolt

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nhuhuan created this revision.Jul 19 2022, 3:25 AM

Herald added a reviewer: rafauler. · View Herald TranscriptJul 19 2022, 3:25 AM

Herald added a reviewer: Amir. · View Herald Transcript

Herald added a reviewer: maksfb. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: ayermolo, JDevlieghere. · View Herald Transcript

nhuhuan requested review of this revision.Jul 19 2022, 3:25 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 19 2022, 3:25 AM

Herald added subscribers: llvm-commits, yota9. · View Herald Transcript

nhuhuan added a parent revision: D130071: [BOLT] Adapted policy checks for stripped binaries.Jul 19 2022, 3:25 AM

nhuhuan edited the summary of this revision. (Show Details)

nhuhuan edited the summary of this revision. (Show Details)Jul 19 2022, 3:27 AM

nhuhuan edited the summary of this revision. (Show Details)

nhuhuan edited the summary of this revision. (Show Details)Jul 19 2022, 3:30 AM

Harbormaster completed remote builds in B176214: Diff 445762.Jul 19 2022, 3:36 AM

nhuhuan edited the summary of this revision. (Show Details)Jul 19 2022, 3:37 AM

Thanks for working on it and adding a test with a stripped binary.

The approach looks good to me. I would appreciate if @maksfb or @rafaelauler take a look.
@nhuhuan: let's wait a day before proceeding.

This revision is now accepted and ready to land.Jul 20 2022, 7:24 PM

Excellent work Huan! I have a few design suggestions below. Let me know what do you think.

bolt/lib/Core/Exceptions.cpp
102	I'm not a fan of using boolean input variables (there are several texts written about this anti-pattern, this is just one of them: https://understandlegacycode.com/blog/what-is-wrong-with-boolean-parameters/ ) We have quite a few of these in our codebase already and it makes me uneasy. Here, it just makes code harder to read and understand. This is already a reasonably complex parser, and we should at least try to make it simpler to read. I would prefer if this function was refactored to reflect the fact that it now has two modes of operation. If we look at parseLSDA as it is in this patch, it can be refactored using a pattern of parser logic separated from callback action. this is similar to semantic actions in a compiler parser, in which we completely separate parser from the component responsible for taking actions depending on what the parser reports as the rule being currently consumed. put all of the current parser logic into parseLSDA, make parseLSDA receive a callback as an argument that is what is going to happen to each parsed element, and then implement checkonly=true and checkonly=false logic in the callback. something along these lines (let me know if it is not feasible to implement this pattern or if it is too cumbersome): BinaryFunction { private: parseAndWalkOverLSDA(data, Callback) public: consumeLSDAForFragmentRelationships(); consumeLSDAForEHInfo() } parseAndWalkOverLSDA{ parsing logic... for each iteration of loop { parsing logic callback() } } consumeLSDAForFragmentRelationships{ auto Callback = [&] { establish fragment relationships logic... } parseAndWalkOverLSDA(Callback); } consumeLSDAForEHInfo { auto Callback = [&] { modify instructions to add all EH info stuff.. } parseAndWalkOverLSDA(Callback); }
522	fillLSDAAddressFor

I'm also curious if this adds runtime overhead by parsing LSDA two times per function. Can we measure that using a large binary as input?

In D130072#3673246, @rafauler wrote:

I'm also curious if this adds runtime overhead by parsing LSDA two times per function. Can we measure that using a large binary as input?

Regarding overhead, running twice should have (almost) no effect because size
of LSDA is proportional to only C++ exception handling. Very small amount of
code actually uses this feature.

I measured performance on a binary of size 741MB.

Run parseLSDA once - without checking for valid LSDA:
Testing Time: 431.06s
real 7m11.206s
user 8m43.858s
sys 13m35.488s

Run parseLSDA twice - with checking for valid LSDA:
Testing Time: 442.66s
real 7m22.803s
user 8m43.770s
sys 13m54.131s

Note that to quickly get results, I skipped the checks in "Run Once" version,
resulted in faster performance, but that is not significant.

If we keep the checks in place, running once or twice probably does not matter
much. I don't think performance is a problem.

Please update the summary to narrative form.

nhuhuan added inline comments.Jul 26 2022, 12:44 PM

bolt/lib/Core/Exceptions.cpp
102	Thanks for your suggestions, @rafaelauler. I generally agree that using callbacks improves maintainability. However, a few things are holding me back. (a) Callbacks may need too many arguments I assume that we don't want to add parsing logic remained in parseAndWalkOverLSDA If so, there are simply too many arguments for one general callback. (b) Avoid printing duplicate debugging messages. If printing message in callbacks, then many more arguments. So parseAndWalkOverLSDA need to take another boolean AllowPrintingMessage.

Split this diff into a few diffs.

nhuhuan retitled this revision from [BOLT] Support split landing pad for stripped binaries to [BOLT] Update labels for split landing pad.Jul 27 2022, 1:20 AM

nhuhuan edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B177803: Diff 447947.Jul 27 2022, 1:26 AM

nhuhuan added a child revision: D130617: [BOLT] Update split landing pad check for stripped binaries.Jul 27 2022, 2:30 AM

rafauler added inline comments.Jul 27 2022, 1:16 PM

bolt/lib/Core/BinaryFunction.cpp
1887–1892	.. if LPFunc has no CFG yet
1888	Solution: rerun recomputeLandingPads after buildCFG for all functions
bolt/lib/Core/Exceptions.cpp
202–204	I would suggest moving this before MCSymbol *LPLabel = nullptr; so the code that calculates LPLabel is closer together
208	I was thinking in maybe adding a comment "Skip creating a landing pad if LPOffset is zero" so it is clear that this is from the LSDA format
216	Treat split landing pad as the fragment's secondary entry point? But maybe expand this comment, as it looks like this code section is handling _only_ that. Maybe: Create/fetch landing pad label. If necessary, treat split landing pad as...
225	Is this correct for the case where LPOffset is an offset outside this function (LPLabel is set by Fragment->addEntryPointAtOffset(FragmentOffset)) ?
bolt/lib/Rewrite/RewriteInstance.cpp
2925–2929	I would reverse for clarity: Process non-simple cases due to split jump table or split landing pad Ignore all other non-simple cases

Address feedbacks

bolt/lib/Core/Exceptions.cpp
208	I wrote an equivalent comment: "Assumption: landing pad cannot target current fragment entry"
225	You're correct. I explain this in the last paragraph in description. Let me clarify a bit more here: The link between callsite and landing pad fragment is Labels[LPOffset]. If LSDA is not updated, the LPOffset will be the same, while target function may be moved. This link is to keep connection between two fragments. I also see its benefit on merging split function task in future.
522	Addressed in another diff: D130663

nhuhuan marked an inline comment as done.Jul 27 2022, 3:01 PM

Harbormaster completed remote builds in B177962: Diff 448177.Jul 27 2022, 3:11 PM

We need to make sure that the support for split LP is sufficient/correct for also running the processed binary.
Can we turn the test into a runtime test, making sure that the EH works after BOLT?

This revision now requires changes to proceed.Jul 27 2022, 11:16 PM

Revision Contents

Path

Size

bolt/

lib/

Core/

BinaryFunction.cpp

8 lines

Exceptions.cpp

59 lines

Passes/

SplitFunctions.cpp

8 lines

Rewrite/

RewriteInstance.cpp

19 lines

test/

X86/

split-landing-pad.s

9 lines

Diff 448177

bolt/lib/Core/BinaryFunction.cpp

Show First 20 Lines • Show All 1,878 Lines • ▼ Show 20 Lines	for (BinaryBasicBlock *BB : BasicBlocks) {
for (MCInst &Instr : *BB) {		for (MCInst &Instr : *BB) {
if (!BC.MIB->isInvoke(Instr))		if (!BC.MIB->isInvoke(Instr))
continue;		continue;

const Optional<MCPlus::MCLandingPad> EHInfo = BC.MIB->getEHInfo(Instr);		const Optional<MCPlus::MCLandingPad> EHInfo = BC.MIB->getEHInfo(Instr);
if (!EHInfo \|\| !EHInfo->first)		if (!EHInfo \|\| !EHInfo->first)
continue;		continue;

BinaryBasicBlock *LPBlock = getBasicBlockForLabel(EHInfo->first);		// If LPFunc has no CFG, LPBlock does not exist
if (!BBLandingPads.count(LPBlock)) {		// Solution: rerun recomputeLandingPads after buildCFG for all functions
		rafaulerUnsubmitted Done Reply Inline Actions Solution: rerun recomputeLandingPads after buildCFG for all functions rafauler: Solution: rerun recomputeLandingPads after buildCFG for all functions
		const MCSymbol *LPLabel = EHInfo->first;
		BinaryFunction *LPFunc = BC.getFunctionForSymbol(LPLabel);
		BinaryBasicBlock *LPBlock = LPFunc->getBasicBlockForLabel(LPLabel);
		if (LPBlock != nullptr && !BBLandingPads.count(LPBlock)) {
		rafaulerUnsubmitted Done Reply Inline Actions .. if LPFunc has no CFG yet rafauler: .. if LPFunc has no CFG yet
BBLandingPads.insert(LPBlock);		BBLandingPads.insert(LPBlock);
BB->LandingPads.emplace_back(LPBlock);		BB->LandingPads.emplace_back(LPBlock);
LPBlock->Throwers.emplace_back(BB);		LPBlock->Throwers.emplace_back(BB);
}		}
}		}
}		}
}		}

▲ Show 20 Lines • Show All 2,569 Lines • Show Last 20 Lines

bolt/lib/Core/Exceptions.cpp

Show First 20 Lines • Show All 93 Lines • ▼ Show 20 Lines
// may require relocation.		// may require relocation.
//		//
// Ideally we should be able to re-write LSDA in-place, without the need to		// Ideally we should be able to re-write LSDA in-place, without the need to
// allocate a new space for it. Sadly there's no guarantee that the new call		// allocate a new space for it. Sadly there's no guarantee that the new call
// site table will be the same size as GCC uses uleb encodings for PC offsets.		// site table will be the same size as GCC uses uleb encodings for PC offsets.
//		//
// Note: some functions have LSDA entries with 0 call site entries.		// Note: some functions have LSDA entries with 0 call site entries.
void BinaryFunction::parseLSDA(ArrayRef<uint8_t> LSDASectionData,		void BinaryFunction::parseLSDA(ArrayRef<uint8_t> LSDASectionData,
uint64_t LSDASectionAddress) {		uint64_t LSDASectionAddress) {
		rafaulerUnsubmitted Not Done Reply Inline Actions I'm not a fan of using boolean input variables (there are several texts written about this anti-pattern, this is just one of them: https://understandlegacycode.com/blog/what-is-wrong-with-boolean-parameters/ ) We have quite a few of these in our codebase already and it makes me uneasy. Here, it just makes code harder to read and understand. This is already a reasonably complex parser, and we should at least try to make it simpler to read. I would prefer if this function was refactored to reflect the fact that it now has two modes of operation. If we look at parseLSDA as it is in this patch, it can be refactored using a pattern of parser logic separated from callback action. this is similar to semantic actions in a compiler parser, in which we completely separate parser from the component responsible for taking actions depending on what the parser reports as the rule being currently consumed. put all of the current parser logic into parseLSDA, make parseLSDA receive a callback as an argument that is what is going to happen to each parsed element, and then implement checkonly=true and checkonly=false logic in the callback. something along these lines (let me know if it is not feasible to implement this pattern or if it is too cumbersome): BinaryFunction { private: parseAndWalkOverLSDA(data, Callback) public: consumeLSDAForFragmentRelationships(); consumeLSDAForEHInfo() } parseAndWalkOverLSDA{ parsing logic... for each iteration of loop { parsing logic callback() } } consumeLSDAForFragmentRelationships{ auto Callback = [&] { establish fragment relationships logic... } parseAndWalkOverLSDA(Callback); } consumeLSDAForEHInfo { auto Callback = [&] { modify instructions to add all EH info stuff.. } parseAndWalkOverLSDA(Callback); } rafauler: I'm not a fan of using boolean input variables (there are several texts written about this…
		nhuhuanAuthorUnsubmitted Done Reply Inline Actions Thanks for your suggestions, @rafaelauler. I generally agree that using callbacks improves maintainability. However, a few things are holding me back. (a) Callbacks may need too many arguments I assume that we don't want to add parsing logic remained in parseAndWalkOverLSDA If so, there are simply too many arguments for one general callback. (b) Avoid printing duplicate debugging messages. If printing message in callbacks, then many more arguments. So parseAndWalkOverLSDA need to take another boolean AllowPrintingMessage. nhuhuan: Thanks for your suggestions, @rafaelauler. I generally agree that using callbacks improves…
assert(CurrentState == State::Disassembled && "unexpected function state");		assert(CurrentState == State::Disassembled && "unexpected function state");

if (!getLSDAAddress())		if (!getLSDAAddress())
return;		return;

DWARFDataExtractor Data(		DWARFDataExtractor Data(
StringRef(reinterpret_cast<const char *>(LSDASectionData.data()),		StringRef(reinterpret_cast<const char *>(LSDASectionData.data()),
LSDASectionData.size()),		LSDASectionData.size()),
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	while (CallSitePtr < CallSiteTableEnd) {
uint64_t Length = *Data.getEncodedPointer(&CallSitePtr, CallSiteEncoding,		uint64_t Length = *Data.getEncodedPointer(&CallSitePtr, CallSiteEncoding,
CallSitePtr + LSDASectionAddress);		CallSitePtr + LSDASectionAddress);
uint64_t LandingPad = *Data.getEncodedPointer(		uint64_t LandingPad = *Data.getEncodedPointer(
&CallSitePtr, CallSiteEncoding, CallSitePtr + LSDASectionAddress);		&CallSitePtr, CallSiteEncoding, CallSitePtr + LSDASectionAddress);
uint64_t ActionEntry = Data.getULEB128(&CallSitePtr);		uint64_t ActionEntry = Data.getULEB128(&CallSitePtr);

uint64_t LPOffset = LPStart + LandingPad;		uint64_t LPOffset = LPStart + LandingPad;
uint64_t LPAddress = Address + LPOffset;		uint64_t LPAddress = Address + LPOffset;
		BinaryFunction *Fragment = BC.getBinaryFunctionContainingAddress(LPAddress);

// Verify if landing pad code is located outside current function		// Verify if landing pad code is located outside current function
// Support landing pad to builtin_unreachable		// Support landing pad to builtin_unreachable
if (LPAddress < Address \|\| LPAddress > Address + getSize()) {		if (LPAddress < Address \|\| LPAddress > Address + getSize()) {
BinaryFunction *Fragment =
BC.getBinaryFunctionContainingAddress(LPAddress);
assert(Fragment != nullptr &&		assert(Fragment != nullptr &&
"BOLT-ERROR: cannot find landing pad fragment");		"BOLT-ERROR: cannot find landing pad fragment");
BC.addInterproceduralReference(this, Fragment->getAddress());		BC.addInterproceduralReference(this, Fragment->getAddress());
BC.processInterproceduralReferences();		BC.processInterproceduralReferences();
auto isFragmentOf = [](BinaryFunction *Fragment,		auto isFragmentOf = [](BinaryFunction *Fragment,
BinaryFunction *Parent) -> bool {		BinaryFunction *Parent) -> bool {
return (Fragment->isFragment() && Fragment->isParentFragment(Parent));		return (Fragment->isFragment() && Fragment->isParentFragment(Parent));
};		};
assert((isFragmentOf(this, Fragment) \|\| isFragmentOf(Fragment, this)) &&		assert((isFragmentOf(this, Fragment) \|\| isFragmentOf(Fragment, this)) &&
"BOLT-ERROR: cannot have landing pads in different "		"BOLT-ERROR: cannot have landing pads in different "
"functions");		"functions");
setHasIndirectTargetToSplitFragment(true);		setHasIndirectTargetToSplitFragment(true);
BC.addFragmentsToSkip(this);		BC.addFragmentsToSkip(this);
return;
}		}

if (opts::PrintExceptions) {		// Special case, consider builtin_unreachable as part of this function
outs() << "Call Site: [0x" << Twine::utohexstr(RangeBase + Start)		if (LPAddress == Address + getSize())
<< ", 0x" << Twine::utohexstr(RangeBase + Start + Length)		Fragment = this;
<< "); landing pad: 0x" << Twine::utohexstr(LPOffset)
<< "; action entry: 0x" << Twine::utohexstr(ActionEntry) << "\n";
outs() << " current offset is " << (CallSitePtr - CallSiteTableStart)
<< '\n';
}

// Create a handler entry if necessary.		// Create a handler entry if necessary.
		rafaulerUnsubmitted Done Reply Inline Actions I would suggest moving this before MCSymbol LPLabel = nullptr; so the code that calculates LPLabel is closer together rafauler:* I would suggest moving this before MCSymbol *LPLabel = nullptr; so the code that calculates…
MCSymbol *LPSymbol = nullptr;		MCSymbol *LPLabel = nullptr;
if (LPOffset) {
if (!getInstructionAtOffset(LPOffset)) {		// Assumption: landing pad cannot target current fragment entry
		// Note: split landing pad can target other fragment entry
		rafaulerUnsubmitted Done Reply Inline Actions I was thinking in maybe adding a comment "Skip creating a landing pad if LPOffset is zero" so it is clear that this is from the LSDA format rafauler: I was thinking in maybe adding a comment "Skip creating a landing pad if LPOffset is zero" so…
		nhuhuanAuthorUnsubmitted Done Reply Inline Actions I wrote an equivalent comment: "Assumption: landing pad cannot target current fragment entry" nhuhuan: I wrote an equivalent comment: "Assumption: landing pad cannot target current fragment entry"
		if (LPAddress != Address) {
		uint64_t FragmentOffset = LPAddress - Fragment->getAddress();
		if (!Fragment->getInstructionAtOffset(FragmentOffset)) {
if (opts::Verbosity >= 1)		if (opts::Verbosity >= 1)
errs() << "BOLT-WARNING: landing pad " << Twine::utohexstr(LPOffset)		errs() << "BOLT-WARNING: landing pad " << Twine::utohexstr(LPOffset)
<< " not pointing to an instruction in function " << *this		<< " not pointing to an instruction in function " << *Fragment
<< " - ignoring.\n";		<< " - ignoring.\n";
} else {		} else {
		rafaulerUnsubmitted Done Reply Inline Actions Treat split landing pad as the fragment's secondary entry point? But maybe expand this comment, as it looks like this code section is handling _only_ that. Maybe: Create/fetch landing pad label. If necessary, treat split landing pad as... rafauler: Treat split landing pad as the fragment's secondary entry point? But maybe expand this comment…
auto Label = Labels.find(LPOffset);		// Create or fetch landing pad label
if (Label != Labels.end()) {		// For landing pad in same function, create a local label
LPSymbol = Label->second;		// For landing pad in a sibling fragment, register as a secondary entry
} else {		auto Label = Fragment->Labels.find(FragmentOffset);
LPSymbol = BC.Ctx->createNamedTempSymbol("LP");		LPLabel = (Label != Fragment->Labels.end())
Labels[LPOffset] = LPSymbol;		? Label->second
		: ((Fragment != this)
		? Fragment->addEntryPointAtOffset(FragmentOffset)
		: BC.Ctx->createNamedTempSymbol("LP"));
		rafaulerUnsubmitted Done Reply Inline Actions Is this correct for the case where LPOffset is an offset outside this function (LPLabel is set by Fragment->addEntryPointAtOffset(FragmentOffset)) ? rafauler: Is this correct for the case where LPOffset is an offset outside this function (LPLabel is set…
		nhuhuanAuthorUnsubmitted Done Reply Inline Actions You're correct. I explain this in the last paragraph in description. Let me clarify a bit more here: The link between callsite and landing pad fragment is Labels[LPOffset]. If LSDA is not updated, the LPOffset will be the same, while target function may be moved. This link is to keep connection between two fragments. I also see its benefit on merging split function task in future. nhuhuan: You're correct. I explain this in the last paragraph in description. Let me clarify a bit more…
		// Support recomputeLandingPad to identify split landing pad
		BC.setSymbolToFunctionMap(LPLabel, Fragment);
		Labels[LPOffset] = LPLabel;
}		}
}		}

		if (opts::PrintExceptions) {
		outs() << "Call Site: [0x" << Twine::utohexstr(RangeBase + Start)
		<< ", 0x" << Twine::utohexstr(RangeBase + Start + Length)
		<< "); landing pad: 0x" << Twine::utohexstr(LPOffset)
		<< "; action entry: 0x" << Twine::utohexstr(ActionEntry) << "\n";
		outs() << " current offset is " << (CallSitePtr - CallSiteTableStart)
		<< '\n';
		if (LPLabel != nullptr)
		outs() << " landing pad label: " << LPLabel->getName() << "\n";
}		}

// Mark all call instructions in the range.		// Mark all call instructions in the range.
auto II = Instructions.find(Start);		auto II = Instructions.find(Start);
auto IE = Instructions.end();		auto IE = Instructions.end();
assert(II != IE && "exception range not pointing to an instruction");		assert(II != IE && "exception range not pointing to an instruction");
do {		do {
MCInst &Instruction = II->second;		MCInst &Instruction = II->second;
if (BC.MIB->isCall(Instruction) &&		if (BC.MIB->isCall(Instruction) &&
!BC.MIB->getConditionalTailCall(Instruction)) {		!BC.MIB->getConditionalTailCall(Instruction)) {
assert(!BC.MIB->isInvoke(Instruction) &&		assert(!BC.MIB->isInvoke(Instruction) &&
"overlapping exception ranges detected");		"overlapping exception ranges detected");
// Add extra operands to a call instruction making it an invoke from		// Add extra operands to a call instruction making it an invoke from
// now on.		// now on.
BC.MIB->addEHInfo(Instruction,		BC.MIB->addEHInfo(Instruction,
MCPlus::MCLandingPad(LPSymbol, ActionEntry));		MCPlus::MCLandingPad(LPLabel, ActionEntry));
}		}
++II;		++II;
} while (II != IE && II->first < Start + Length);		} while (II != IE && II->first < Start + Length);

if (ActionEntry != 0) {		if (ActionEntry != 0) {
auto printType = [&](int Index, raw_ostream &OS) {		auto printType = [&](int Index, raw_ostream &OS) {
assert(Index > 0 && "only positive indices are valid");		assert(Index > 0 && "only positive indices are valid");
uint64_t TTEntry = TypeTableStart - Index * TTypeEncodingSize;		uint64_t TTEntry = TypeTableStart - Index * TTypeEncodingSize;
▲ Show 20 Lines • Show All 249 Lines • ▼ Show 20 Lines	if (FDEI != FDEs.end() && FDEI->first == CurFDE->getInitialLocation()) {
}		}
}		}
} else {		} else {
FDEs.emplace_hint(FDEI, CurFDE->getInitialLocation(), CurFDE);		FDEs.emplace_hint(FDEI, CurFDE->getInitialLocation(), CurFDE);
}		}
}		}
}		}

bool CFIReaderWriter::fillCFIInfoFor(BinaryFunction &Function) const {		bool CFIReaderWriter::fillCFIInfoFor(BinaryFunction &Function) const {
		rafaulerUnsubmitted Done Reply Inline Actions fillLSDAAddressFor rafauler: fillLSDAAddressFor
		nhuhuanAuthorUnsubmitted Done Reply Inline Actions Addressed in another diff: D130663 nhuhuan: Addressed in another diff: D130663
uint64_t Address = Function.getAddress();		uint64_t Address = Function.getAddress();
auto I = FDEs.find(Address);		auto I = FDEs.find(Address);
// Ignore zero-length FDE ranges.		// Ignore zero-length FDE ranges.
if (I == FDEs.end() \|\| !I->second->getAddressRange())		if (I == FDEs.end() \|\| !I->second->getAddressRange())
return true;		return true;

const FDE &CurFDE = *I->second;		const FDE &CurFDE = *I->second;
Optional<uint64_t> LSDA = CurFDE.getLSDAAddress();		Optional<uint64_t> LSDA = CurFDE.getLSDAAddress();
▲ Show 20 Lines • Show All 393 Lines • Show Last 20 Lines

bolt/lib/Passes/SplitFunctions.cpp

Show First 20 Lines • Show All 327 Lines • ▼ Show 20 Lines	SplitFunctions::createEHTrampolines(BinaryFunction &BF) const {
// function which will invalidate its iterators.		// function which will invalidate its iterators.
std::vector<BinaryBasicBlock *> Blocks(BF.pbegin(), BF.pend());		std::vector<BinaryBasicBlock *> Blocks(BF.pbegin(), BF.pend());
for (BinaryBasicBlock *BB : Blocks) {		for (BinaryBasicBlock *BB : Blocks) {
for (MCInst &Instr : *BB) {		for (MCInst &Instr : *BB) {
const Optional<MCPlus::MCLandingPad> EHInfo = MIB->getEHInfo(Instr);		const Optional<MCPlus::MCLandingPad> EHInfo = MIB->getEHInfo(Instr);
if (!EHInfo \|\| !EHInfo->first)		if (!EHInfo \|\| !EHInfo->first)
continue;		continue;

		// In case of split landing pad, LPFunc != BF
const MCSymbol *LPLabel = EHInfo->first;		const MCSymbol *LPLabel = EHInfo->first;
BinaryBasicBlock *LPBlock = BF.getBasicBlockForLabel(LPLabel);		BinaryFunction *LPFunc =
		BF.getBinaryContext().getFunctionForSymbol(LPLabel);
		BinaryBasicBlock *LPBlock = LPFunc->getBasicBlockForLabel(LPLabel);
if (BB->isCold() == LPBlock->isCold())		if (BB->isCold() == LPBlock->isCold())
continue;		continue;

const MCSymbol *TrampolineLabel = nullptr;		const MCSymbol *TrampolineLabel = nullptr;
auto Iter = LPTrampolines.find(LPLabel);		auto Iter = LPTrampolines.find(LPLabel);
if (Iter != LPTrampolines.end()) {		if (Iter != LPTrampolines.end()) {
TrampolineLabel = Iter->second;		TrampolineLabel = Iter->second;
} else {		} else {
// Create a trampoline basic block in the same fragment as the thrower.		// Create a trampoline basic block in the same fragment as the thrower.
// Note: there's no need to insert the jump instruction, it will be		// Note: there's no need to insert the jump instruction, it will be
// added by fixBranches().		// added by fixBranches().
BinaryBasicBlock *TrampolineBB = BF.addBasicBlock();		BinaryBasicBlock *TrampolineBB = LPFunc->addBasicBlock();
TrampolineBB->setIsCold(BB->isCold());		TrampolineBB->setIsCold(BB->isCold());
TrampolineBB->setExecutionCount(LPBlock->getExecutionCount());		TrampolineBB->setExecutionCount(LPBlock->getExecutionCount());
TrampolineBB->addSuccessor(LPBlock, TrampolineBB->getExecutionCount());		TrampolineBB->addSuccessor(LPBlock, TrampolineBB->getExecutionCount());
TrampolineBB->setCFIState(LPBlock->getCFIState());		TrampolineBB->setCFIState(LPBlock->getCFIState());
TrampolineLabel = TrampolineBB->getLabel();		TrampolineLabel = TrampolineBB->getLabel();
LPTrampolines.insert(std::make_pair(LPLabel, TrampolineLabel));		LPTrampolines.insert(std::make_pair(LPLabel, TrampolineLabel));
		BF.getBinaryContext().setSymbolToFunctionMap(TrampolineLabel, LPFunc);
}		}

// Substitute the landing pad with the trampoline.		// Substitute the landing pad with the trampoline.
MIB->updateEHInfo(Instr,		MIB->updateEHInfo(Instr,
MCPlus::MCLandingPad(TrampolineLabel, EHInfo->second));		MCPlus::MCLandingPad(TrampolineLabel, EHInfo->second));
}		}
}		}

Show All 38 Lines

bolt/lib/Rewrite/RewriteInstance.cpp

Show First 20 Lines • Show All 2,916 Lines • ▼ Show 20 Lines	void RewriteInstance::disassembleFunctions() {
BC->adjustCodePadding();		BC->adjustCodePadding();

for (auto &BFI : BC->getBinaryFunctions()) {		for (auto &BFI : BC->getBinaryFunctions()) {
BinaryFunction &Function = BFI.second;		BinaryFunction &Function = BFI.second;

if (!shouldDisassemble(Function))		if (!shouldDisassemble(Function))
continue;		continue;

if (!Function.isSimple()) {		// Process non-simple cases due to split jump table or split landing pad
assert((!BC->HasRelocations \|\| Function.getSize() == 0 \|\|		// Ignore all other non-simple cases
Function.hasIndirectTargetToSplitFragment()) &&		if (!Function.isSimple() && !Function.hasIndirectTargetToSplitFragment()) {
		assert((!BC->HasRelocations \|\| Function.getSize() == 0) &&
"unexpected non-simple function in relocation mode");		"unexpected non-simple function in relocation mode");
		rafaulerUnsubmitted Done Reply Inline Actions I would reverse for clarity: Process non-simple cases due to split jump table or split landing pad Ignore all other non-simple cases rafauler: I would reverse for clarity: Process non-simple cases due to split jump table or split…
continue;		continue;
}		}

// Fill in CFI information for this function		// Fill in CFI information for this function
if (!Function.trapsOnEntry() && !CFIRdWrt->fillCFIInfoFor(Function)) {		if (!Function.trapsOnEntry() && !CFIRdWrt->fillCFIInfoFor(Function)) {
if (BC->HasRelocations) {		if (BC->HasRelocations) {
BC->exitWithBugReport("unable to fill CFI.", Function);		BC->exitWithBugReport("unable to fill CFI.", Function);
} else {		} else {
Show All 36 Lines	void RewriteInstance::buildFunctionsCFG() {
};		};

ParallelUtilities::runOnEachFunctionWithUniqueAllocId(		ParallelUtilities::runOnEachFunctionWithUniqueAllocId(
*BC, ParallelUtilities::SchedulingPolicy::SP_INST_LINEAR, WorkFun,		*BC, ParallelUtilities::SchedulingPolicy::SP_INST_LINEAR, WorkFun,
SkipPredicate, "disassembleFunctions-buildCFG",		SkipPredicate, "disassembleFunctions-buildCFG",
/ForceSequential/ opts::SequentialDisassembly \|\| opts::PrintAll);		/ForceSequential/ opts::SequentialDisassembly \|\| opts::PrintAll);

BC->postProcessSymbolTable();		BC->postProcessSymbolTable();

		// recomputeLandingPads is invoked during buildCFG for every function.
		// For split landing pad, it is possible that the target function does
		// not yet have CFG, so recomputeLandingPads would fail to capture
		// cross-function targets.
		//
		// Solution:
		// (a) Safely ignore split landing pad during buildCFG
		// (b) Rerun recomputeLandingPads after all functions have CFG
		for (BinaryFunction *Function : BC->getAllBinaryFunctions())
		if (Function->getState() == BinaryFunction::State::CFG)
		Function->recomputeLandingPads();
}		}

void RewriteInstance::postProcessFunctions() {		void RewriteInstance::postProcessFunctions() {
// We mark fragments as non-simple here, not during disassembly,		// We mark fragments as non-simple here, not during disassembly,
// So we can build their CFGs.		// So we can build their CFGs.
BC->skipMarkedFragments();		BC->skipMarkedFragments();
BC->clearFragmentsToSkip();		BC->clearFragmentsToSkip();

▲ Show 20 Lines • Show All 2,550 Lines • Show Last 20 Lines

bolt/test/X86/split-landing-pad.s

	Show All 19 Lines
	# .gcc_except_table specify the landing pads for try blocks			# .gcc_except_table specify the landing pads for try blocks
	# LPStart = 255 (omit), which means LPStart = foo start			# LPStart = 255 (omit), which means LPStart = foo start
	# Landing pads .Ltmp2 and .Ltmp5 in call site record are offset to foo start.			# Landing pads .Ltmp2 and .Ltmp5 in call site record are offset to foo start.


	# REQUIRES: system-linux			# REQUIRES: system-linux
	# RUN: llvm-mc -filetype=obj -triple x86_64-unknown-unknown %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple x86_64-unknown-unknown %s -o %t.o
	# RUN: %clang++ %cxxflags %t.o -o %t.exe -Wl,-q			# RUN: %clang++ %cxxflags %t.o -o %t.exe -Wl,-q
	# RUN: llvm-bolt -v=3 %t.exe -o %t.out 2>&1 \| FileCheck %s			# RUN: llvm-bolt -v=3 %t.exe -o %t.out -print-exceptions 2>&1 \| FileCheck %s

	# CHECK: BOLT-WARNING: Ignoring foo			# CHECK: BOLT-INFO: marking [[FOO_COLD:.+]] as a fragment of [[FOO:.+]]
	# CHECK: BOLT-WARNING: Ignoring foo.cold.1			# CHECK: landing pad label: [[TMP2:.+]]
				# CHECK: landing pad label: [[TMP5:.+]]
				# CHECK: BOLT-WARNING: Ignoring [[FOO]]
				# CHECK: BOLT-WARNING: Ignoring [[FOO_COLD]]
	# CHECK: BOLT-WARNING: skipped 2 functions due to cold fragments			# CHECK: BOLT-WARNING: skipped 2 functions due to cold fragments

	.text			.text
	.globl bar # -- Begin function bar			.globl bar # -- Begin function bar
	.p2align 4, 0x90			.p2align 4, 0x90
	.type bar,@function			.type bar,@function
	bar: # @bar			bar: # @bar
	.cfi_startproc			.cfi_startproc
	▲ Show 20 Lines • Show All 222 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[BOLT] Update labels for split landing padNeeds RevisionPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 448177

bolt/lib/Core/BinaryFunction.cpp

bolt/lib/Core/Exceptions.cpp

bolt/lib/Passes/SplitFunctions.cpp

bolt/lib/Rewrite/RewriteInstance.cpp

bolt/test/X86/split-landing-pad.s

[BOLT] Update labels for split landing pad
Needs RevisionPublic