This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/include/llvm/
-
include/
-
llvm/
-
CodeGen/
1/1
CommandFlags.inc
-
Target/
3/3
TargetMachine.h
6/6
TargetOptions.h

Differential D68063

Propeller: LLVM support for basic block sections
ClosedPublic

Authored by tmsriram on Sep 25 2019, 5:02 PM.

Download Raw Diff

Details

Reviewers

mehdi_amini
MaskRay
davidxl
efriedma
echristo

Commits

rG4dfe92e46542: Basic Block Sections Support.

Summary

This is the parent patch of a series of patches to enable Basic Block Sections support in LLVM which is the building block for the Propeller post link optimization framework. Please see the RFC here: https://groups.google.com/forum/#!msg/llvm-dev/ef3mKzAdJ7U/1shV64BYBAAJ and the detailed RFC doc here: https://github.com/google/llvm-propeller/blob/plo-dev/Propeller_RFC.pdf.

We introduce a new compiler option, -fbasicblock-sections=, which places every basic block in a unique ELF text section in the object file along with a symbol labeling the basic block. The linker can then order the basic block sections in any arbitrary sequence which when done correctly can encapsulate block layout, function layout and function splitting optimizations. However, there are a couple of challenges to be addressed for this to be feasible:

The compiler must not allow any implicit fall-through between any two adjacent basic blocks as they could be reordered at link time to be non-adjacent. In other words, the compiler must make a fall-through between adjacent basic blocks explicit by retaining the direct jump instruction that jumps to the next basic block. These branches can only be removed later by the linker after the blocks have been reordered.
All inter-basic block branch targets would now need to be resolved by the linker as they cannot be calculated during compile time. This is done using static relocations which bloats the size of the object files. Further, the compiler tries to use short branch instructions on some ISAs for branch offsets that can be accommodated in one byte. This is not possible with basic block sections as the offset is not determined at compile time, and long branch instructions have to be used everywhere.
Each additional section bloats object file sizes by tens of bytes. The number of basic blocks can be potentially very large compared to the size of functions and can bloat object sizes significantly. Option fbasicblock-sections= also takes a file path which can be used to specify a subset of basic blocks that needs unique sections to keep the bloats small.
Debug Info and CFI need special handling and will be presented as separate patches.

Basic Block Labels

With -fbasicblock-sections=labels, or when a basic block is placed in a unique section, it is labelled with a symbol. This allows easy mapping of virtual addresses from
PMU profiles back to the corresponding basic blocks. Since the number of basic blocks is large, the labeling bloats the symbol table sizes and the string table sizes significantly. While the binary size does increase, it does not affect performance as the symbol table is not loaded in memory during run-time. The string table size bloat is kept very minimal using a unary naming scheme that uses string suffix compression. The basic blocks for function foo are named "a.BB.foo", "aa.BB.foo", ... This turns out to be very good for string table sizes and the bloat in the string table size for a very large binary is ~8 %. The naming also allows using the --symbol-ordering-file option in LLD to arbitrarily reorder the sections.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

tmsriram created this revision.Sep 25 2019, 5:02 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 25 2019, 5:02 PM

Herald added subscribers: tschuett, hiraditya, aprantl. · View Herald Transcript

rahmanl mentioned this in D68073: Propeller code layout optimizations.Sep 25 2019, 11:48 PM

MaskRay added a subscriber: MaskRay.Sep 26 2019, 4:32 AM

A few comments after a quick high level scan of patch. Also - please mark all these related patches as parent and child versions in Phabricator.

llvm/include/llvm/CodeGen/MachineBasicBlock.h
139 ↗	(On Diff #221865)	comment
820 ↗	(On Diff #221865)	please run clang-format on this patch
llvm/include/llvm/Transforms/Utils/ModuleUtils.h
111 ↗	(On Diff #221865)	What happens if unique module id is not actually unique?
112 ↗	(On Diff #221865)	I don't see any calls using this new parameter in this patch - should this change be included with a different patch in the set?
llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
2988 ↗	(On Diff #221865)	Should this be uncommented, or removed?
llvm/lib/CodeGen/MachineBlockPlacement.cpp
3128 ↗	(On Diff #221865)	Only a whitespace change in this file, remove from patch.
llvm/lib/MC/MCDwarf.cpp
1481 ↗	(On Diff #221865)	add comment summarizing when dedup is needed
1711 ↗	(On Diff #221865)	Add parameter name constants to constant parameters here and in other calls to this function.
llvm/lib/Target/X86/X86FrameLowering.cpp
465 ↗	(On Diff #221865)	change to early return. Also a comment would be good
llvm/lib/Transforms/Utils/ModuleUtils.cpp
274 ↗	(On Diff #221865)	Confusing comment, as the first 2 sentences seem contradictory (module id not guaranteed to be unique, so we use it).

MaskRay added a reviewer: MaskRay.Sep 26 2019, 8:28 PM

MaskRay added inline comments.

llvm/lib/CodeGen/MachineBasicBlock.cpp
81 ↗	(On Diff #221865)	`std::string(getNumber(), 'a')`
llvm/lib/MC/ELFObjectWriter.cpp
1374 ↗	(On Diff #221865)	R_*_SIZE is x86 specific. If you want to use it on other architectures, it will require an ABI change.

aprantl added inline comments.Sep 27 2019, 10:52 AM

llvm/include/llvm/CodeGen/AsmPrinterHandler.h
71 ↗	(On Diff #221865)	Please doxygenify all comments in this patch according to https://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments.
llvm/test/DebugInfo/X86/basicblock-sections-cfi.ll
85 ↗	(On Diff #221865)	Please try to remove all string attributes that aren't strictly necessary, they are a maintenance burden.

clang-format, comments, minor fixes suggested.

tmsriram marked 9 inline comments as done and an inline comment as not done.Sep 27 2019, 4:57 PM

tmsriram added inline comments.

llvm/include/llvm/Transforms/Utils/ModuleUtils.h
111 ↗	(On Diff #221865)	It does not make the situation any worse and we would could end up with two or more internal linkage functions having the same name. Profile attribution would not be accurate and could cause sub-optimal performance decisions particularly if this function is hot. Maybe we could warn about this and let the user pick another name for this.
112 ↗	(On Diff #221865)	It is used in clang/lib/CodeGen/CodeGenModule.cpp where the function name is formed. Is this ok?
llvm/lib/CodeGen/MachineBasicBlock.cpp
81 ↗	(On Diff #221865)	Great catch!
llvm/lib/MC/ELFObjectWriter.cpp
1374 ↗	(On Diff #221865)	Currently guarded with a -mrelocate-with-symbols option. Anything better we could do here?

tejohnson added inline comments.Oct 4 2019, 11:38 AM

llvm/include/llvm/Transforms/Utils/ModuleUtils.h
111 ↗	(On Diff #221865)	I think just clarify what the consequences of a collision are in the comments (would be a bigger problem if there was a possible correctness issue).
112 ↗	(On Diff #221865)	Can you connect these patches by parent/child relationships in phabricator?

Main changes:

Support for Basic block sections with exceptions
Support for selectively enabling basic block sections for a subset of basic blocks
Bug fixes with debug info and cfi

Herald added subscribers: aheejin, sbc100. · View Herald TranscriptJan 16 2020, 4:55 PM

aprantl added inline comments.Jan 17 2020, 11:13 AM

llvm/lib/CodeGen/AsmPrinter/EHStreamer.h
73 ↗	(On Diff #238663)	///
76 ↗	(On Diff #238663)	///

Fix comments.

tmsriram marked 2 inline comments as done.Jan 17 2020, 2:19 PM

tmsriram added inline comments.

llvm/lib/CodeGen/AsmPrinter/EHStreamer.h
76 ↗	(On Diff #238663)	This seems alright, other structs use the same style for per-field comments.

bcain added a subscriber: bcain.Jan 17 2020, 2:55 PM

tmsriram added a child revision: D73310: Allow Module name to be used to generate a unique Module ID.Jan 23 2020, 4:51 PM

Try to avoid monolithic patch like this. Please consider splitting it into a few smaller incremental patches with (possibly) independent testing. Logically, it can be split into 1) IR support; 2) machine BB level support; 3) debug support 4) CFI support 5) exception and 6) the 'main tranformation' part if there is one'.

Splitting the LLVM patch for Basic Block Sections Support into many smaller patches as follows:

Base patch for basic block sections support - This itself is split into Base_1 and Base_2
CFI Support for block sections
DebugInfo Support for basic block sections
Exceptions Support for basic block sections
Other smaller patches

We will link the dependent patches as children of this parent patch.

This is the parent patch and is the base patch (Base_1)

tmsriram added a reviewer: davidxl.Jan 29 2020, 2:36 PM

davidxl added inline comments.Jan 29 2020, 3:04 PM

llvm/include/llvm/Target/TargetMachine.h
262	nit: naming consistency: getBasicBlockSections -->getBBSections
268	isFunctionBBSectionsList.
275	getBBSectionSet

tmsriram added a child revision: D73674: Propeller: LLVM support for basic block sections (Base Patch - Part 2).Jan 29 2020, 4:57 PM

Include llc options for basic block sections.

tmsriram added a child revision: D73739: Exception support for basic block sections.Jan 30 2020, 2:03 PM

Change "BasicBlockSections" to "BBSections".

tmsriram marked 3 inline comments as done.Jan 30 2020, 2:51 PM

MaskRay added inline comments.Jan 30 2020, 4:56 PM

llvm/include/llvm/Target/TargetOptions.h
70	Newer code should use `enum class` instead of namespace+unscoped enumeration
71	Order enum members by the extent they enable basic block sections.
llvm/lib/CodeGen/CodeGenPrepare.cpp
450 ↗	(On Diff #241591)	We probably can make TargetPassConfig required so we don't have to check `TM` nullness everywhere. Created D73754

Support the basicblock-sections=<file> option in llc too for feature completeness here and to add tests.

Fix a typo.

Refactore getBBSectionsList function into a common place so that it can be shared with clang and lld.

Herald added a subscriber: mgorny. · View Herald TranscriptFeb 4 2020, 4:50 PM

tmsriram added a reviewer: efriedma.Feb 6 2020, 11:03 AM

I'm too far away from this part of the code to really assess if this is the proper way of plugin into the codegen unfortunately, I'd ask @echristo maybe?

llvm/include/llvm/CodeGen/CommandFlags.inc
26	Is this needed in this file?
llvm/include/llvm/IR/BasicBlock.h
436 ↗	(On Diff #242470)	These methods aren't defined (or used) anywhere.
llvm/include/llvm/ProfileData/PropellerProf.h
20 ↗	(On Diff #242470)	The "propeller-specific" aspect of it seems unrelated to "adding support for basic block sections"

Address reviewer comments. Delete changes to BasicBlock.h.

tmsriram marked 2 inline comments as done.Feb 6 2020, 11:51 AM

tmsriram added inline comments.

llvm/include/llvm/ProfileData/PropellerProf.h
20 ↗	(On Diff #242470)	We can rename this to BBSectionsList.h or something similar. This allows basic block sections for a specific set of basic blocks which is useful for Propeller.

Rebase and rename PropellerProf to BBSectionsProf.

Use MemoryBuffer::getFile instead of fstream.

I'm not really happy with the way functions lists are handled:

It probably makes sense to encode as a function attribute, rather than sticking booleans directly onto llvm::Function . Otherwise, we need to figure out a serialization story; we try to keep IR serialization as complete as possible, even as late as CodeGenPrepare.

Attaching the attributes should be done by a separate module pass, I think, not by jamming it into CodeGenPrepare. CodeGenPrepare is an optimization, this isn't (at least, not in the same sense).

Not sure it makes sense to stick the list of functions into TargetMachine, as opposed to just modifying the module when the file is parsed.

MaskRay mentioned this in D68049: Propeller: Clang options for basic block sections .Feb 10 2020, 9:07 PM

Remove usage of "propeller" from this patch, this is only about support for basic block sections.

(tests missing?)

llvm/include/llvm/ProfileData/BBSectionsProf.h
1 ↗	(On Diff #243943)	License blurb missing
llvm/lib/ProfileData/BBSectionsProf.cpp
1 ↗	(On Diff #243943)	License blurb missing

tmsriram added a child revision: D68049: Propeller: Clang options for basic block sections .Feb 11 2020, 12:36 PM

tmsriram removed a child revision: D73310: Allow Module name to be used to generate a unique Module ID.

Deleted changes to CodeGenPrepare.cpp, Function.*.

BBSections is now handled in a separate pass so this base patch becomes very simple.

tmsriram added a child revision: D68065: Propeller: LLD Support for Basic Block Sections.Feb 26 2020, 4:42 PM

tmsriram mentioned this in D68065: Propeller: LLD Support for Basic Block Sections.Feb 26 2020, 4:51 PM

Rebase.

@efriedma : Hi Eli, this is the parent patch of D73674. Appreciate if you could take a look at this too, thanks!

I'm not completely comfortable passing the BB-sections list to the backend as a file path; generally we try to allow the "frontend" (clang/llc/etc.) to control all file I/O. But I'm not sure what the alternative looks like. I guess we could pass it as MemoryBuffer?

Otherwise looks fine.

In D68063#1908455, @efriedma wrote:

I'm not completely comfortable passing the BB-sections list to the backend as a file path; generally we try to allow the "frontend" (clang/llc/etc.) to control all file I/O. But I'm not sure what the alternative looks like. I guess we could pass it as MemoryBuffer?

Otherwise looks fine.

Looking into this, it is not possible to keep the unique_ptr<MemoryBuffer> in TargetOptions.h as the assignment operator is deleted.
I may have to store the MemoryBuffer in TargetMachine or find another home for it, in which case I may need special handling for both llc and LTO.
I looked at how some other profile files are stored and noticed that for SampleProfileLoader, the file path is stored as a string in PassBuilder.h and the file seems to be read in the backend.

Thoughts? Thanks!

Address Reviewer comments.

Use MemoryBuffer in TargetOptions.h instead of storing the profile file as string. Make it shared_ptr to allow copying of Options.

Couple of inline comments. I'm trying pretty hard to be able to get rid of TargetOptions.h some day if possible. Any thoughts on ways to do this without?

llvm/include/llvm/Target/TargetOptions.h
73–74	Can you describe the use case behind this in the comments?
281–283	Why would you want to be able to copy it again?

Also the commit message is awesome, but would be good to get the commit message represented as comments in lots of the final code if possible :)

To be clear I think this is close to being acceptable, I'm just asking some questions before hitting the ack and getting things tidied up a bit so the next person doesn't have to ask :)

In D68063#1913547, @echristo wrote:

Also the commit message is awesome, but would be good to get the commit message represented as comments in lots of the final code if possible :)

Will do, I can go over the code and add these comments in appropriate places but if you have specific locations please mention.

llvm/include/llvm/Target/TargetOptions.h
73–74	Sure. Just to be clear, are you referring only to "List" or everything?
281–283	I am not copying Options but this is existing code. I can dig exactly where but Options is being copied and that disallows unique_ptr. shared_ptr seems fine as I only want one copy of the MemoryBuffer and that is guaranteed.

In D68063#1913544, @echristo wrote:

Couple of inline comments. I'm trying pretty hard to be able to get rid of TargetOptions.h some day if possible. Any thoughts on ways to do this without?

Missed this comment. I can dig more and maybe move it to TargetMachine.h potentially. This has to be done from llc and LTO too so not sure how that would work. Where were you planning on moving the existing fields of TargetOptions to? Thanks.

In LLVM core libraries, we generally want to accommodate use-cases that don't involve writing files to disk. This makes it easier to write tools targeting new use-cases. Here, for example, someone might want to try writing a JIT using a Propeller workflow. If SampleProfileLoader isn't supporting that, it should probably be fixed.

Mechanically, this seems fine... but it's also okay if we end up with a bit of similar code in multiple places. llc option parsing vs. lld option parsing is two distinct operations.

In D68063#1913593, @efriedma wrote:

In LLVM core libraries, we generally want to accommodate use-cases that don't involve writing files to disk. This makes it easier to write tools targeting new use-cases. Here, for example, someone might want to try writing a JIT using a Propeller workflow. If SampleProfileLoader isn't supporting that, it should probably be fixed.

Mechanically, this seems fine... but it's also okay if we end up with a bit of similar code in multiple places. llc option parsing vs. lld option parsing is two distinct operations.

Sure, I made the MemoryBuffer a shared_ptr in TargetOptions and moves the IO out of the LLVM core. The duplicated code in lld and llc is just the getFile and Error check. Thanks!

To be clear I think this is close to being acceptable

Agreed. I am still waiting for the resolution to my (very old) comment about:

namespace BasicBlockSection {
  enum SectionMode {

A scoped enumeration (enum class) will be clearer.

The summary is very long and includes lots of stuff not touched by this patch. The relevant paragraphs should be moved to a subsequent patch I think.

In D68063#1913667, @MaskRay wrote:

To be clear I think this is close to being acceptable

Agreed. I am still waiting for the resolution to my (very old) comment about:

Then, let me end your wait right away! :)

namespace BasicBlockSection {
  enum SectionMode {
A scoped enumeration (enum class) will be clearer.

I put it in its own namespace like many other enums in that file which are not scoped. If you insist, I can remove the namespace and make it scoped.

The summary is very long and includes lots of stuff not touched by this patch. The relevant paragraphs should be moved to a subsequent patch I think.

IMO, The ideal place to put the summary would be in BBSectionsPrepare.cpp which is the pass that does the actual analysis. I will do that.

Address Reviewer comments:

Make BasicBlock a scoped enum and reorder fields
More comments on all the enum values
Rebase

@eli.friedman @echristo Is this patch alright? I have added detailed comments as suggested by Eric to BBSectionsPrepare.cpp in https://reviews.llvm.org/D73674#change-lCLJRdamtrcE Thanks!

LGTM

This revision is now accepted and ready to land.Mar 11 2020, 5:31 PM

Please still consider my suggestion that removes unrelated paragraphs from the summary :)

echristo accepted this revision.Mar 11 2020, 5:59 PM

tmsriram edited the summary of this revision. (Show Details)Mar 12 2020, 1:02 PM

In D68063#1918478, @MaskRay wrote:

Please still consider my suggestion that removes unrelated paragraphs from the summary :)

Edited to keep it focussed on bb sections and labels.

Fix the comment for the option basicblock-sections.

Closed by commit rG4dfe92e46542: Basic Block Sections Support. (authored by tmsriram). · Explain WhyMar 14 2020, 6:48 PM

This revision was automatically updated to reflect the committed changes.

tmsriram mentioned this in rGdf082ac45aa0: Basic Block Sections support in LLVM..Mar 16 2020, 4:25 PM

tmsriram mentioned this in D78851: Debug Info Support for Basic Block Sections.Apr 24 2020, 8:26 PM

tmsriram mentioned this in rGe0bca46b0854: Options for Basic Block Sections, enabled in D68063 and D73674..Jun 2 2020, 1:05 AM

rahmanl removed a child revision: D73739: Exception support for basic block sections.Jul 15 2020, 12:10 PM

Allen added a subscriber: Allen.Sep 9 2022, 9:32 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 9 2022, 9:32 AM

Herald added a subscriber: StephenFan. · View Herald Transcript

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

CommandFlags.inc

34 lines

Target/

TargetMachine.h

14 lines

TargetOptions.h

31 lines

Diff 250395

llvm/include/llvm/CodeGen/CommandFlags.inc

Show All 17 Lines
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/MC/MCTargetOptionsCommandFlags.inc"		#include "llvm/MC/MCTargetOptionsCommandFlags.inc"
#include "llvm/MC/SubtargetFeature.h"		#include "llvm/MC/SubtargetFeature.h"
#include "llvm/Support/CodeGen.h"		#include "llvm/Support/CodeGen.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Host.h"		#include "llvm/Support/Host.h"
#include "llvm/Target/TargetMachine.h"		#include "llvm/Target/TargetMachine.h"
#include "llvm/Target/TargetOptions.h"		#include "llvm/Target/TargetOptions.h"
#include <string>		#include <string>
		mehdi_aminiUnsubmitted Done Reply Inline Actions Is this needed in this file? mehdi_amini: Is this needed in this file?
using namespace llvm;		using namespace llvm;

static cl::opt<std::string>		static cl::opt<std::string>
MArch("march",		MArch("march",
cl::desc("Architecture to generate code for (see --version)"));		cl::desc("Architecture to generate code for (see --version)"));

static cl::opt<std::string>		static cl::opt<std::string>
MCPU("mcpu",		MCPU("mcpu",
▲ Show 20 Lines • Show All 198 Lines • ▼ Show 20 Lines	static cl::opt<bool> DataSections("data-sections",
cl::desc("Emit data into separate sections"),		cl::desc("Emit data into separate sections"),
cl::init(false));		cl::init(false));

static cl::opt<bool>		static cl::opt<bool>
FunctionSections("function-sections",		FunctionSections("function-sections",
cl::desc("Emit functions into separate sections"),		cl::desc("Emit functions into separate sections"),
cl::init(false));		cl::init(false));

		static cl::opt<std::string>
		BBSections("basicblock-sections",
		cl::desc("Emit basic blocks into separate sections"),
		cl::value_desc("all \| <function list (file)> \| labels \| none"),
		cl::init("none"));

static cl::opt<unsigned> TLSSize("tls-size",		static cl::opt<unsigned> TLSSize("tls-size",
cl::desc("Bit size of immediate TLS offsets"),		cl::desc("Bit size of immediate TLS offsets"),
cl::init(0));		cl::init(0));

static cl::opt<bool> EmulatedTLS("emulated-tls",		static cl::opt<bool> EmulatedTLS("emulated-tls",
cl::desc("Use emulated TLS model"),		cl::desc("Use emulated TLS model"),
cl::init(false));		cl::init(false));

static cl::opt<bool>		static cl::opt<bool>
UniqueSectionNames("unique-section-names",		UniqueSectionNames("unique-section-names",
cl::desc("Give unique names to every section"),		cl::desc("Give unique names to every section"),
cl::init(true));		cl::init(true));

		static cl::opt<bool> UniqueBBSectionNames(
		"unique-bb-section-names",
		cl::desc("Give unique names to every basic block section"),
		cl::init(false));

static cl::opt<llvm::EABI>		static cl::opt<llvm::EABI>
EABIVersion("meabi", cl::desc("Set EABI type (default depends on triple):"),		EABIVersion("meabi", cl::desc("Set EABI type (default depends on triple):"),
cl::init(EABI::Default),		cl::init(EABI::Default),
cl::values(clEnumValN(EABI::Default, "default",		cl::values(clEnumValN(EABI::Default, "default",
"Triple default EABI version"),		"Triple default EABI version"),
clEnumValN(EABI::EABI4, "4", "EABI version 4"),		clEnumValN(EABI::EABI4, "4", "EABI version 4"),
clEnumValN(EABI::EABI5, "5", "EABI version 5"),		clEnumValN(EABI::EABI5, "5", "EABI version 5"),
clEnumValN(EABI::GNU, "gnu", "EABI GNU")));		clEnumValN(EABI::GNU, "gnu", "EABI GNU")));
Show All 24 Lines	EnableDebugEntryValues("debug-entry-values",
cl::desc("Emit debug info about parameter's entry values"),		cl::desc("Emit debug info about parameter's entry values"),
cl::init(false));		cl::init(false));

static cl::opt<bool>		static cl::opt<bool>
ForceDwarfFrameSection("force-dwarf-frame-section",		ForceDwarfFrameSection("force-dwarf-frame-section",
cl::desc("Always emit a debug frame section."),		cl::desc("Always emit a debug frame section."),
cl::init(false));		cl::init(false));

		static llvm::BasicBlockSection
		getBBSectionsMode(llvm::TargetOptions &Options) {
		if (BBSections == "all")
		return BasicBlockSection::All;
		else if (BBSections == "labels")
		return BasicBlockSection::Labels;
		else if (BBSections == "none")
		return BasicBlockSection::None;
		else {
		ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =
		MemoryBuffer::getFile(BBSections);
		if (!MBOrErr) {
		errs() << "Error loading basic block sections function list file: "
		<< MBOrErr.getError().message() << "\n";
		} else {
		Options.BBSectionsFuncListBuf = std::move(*MBOrErr);
		}
		return BasicBlockSection::List;
		}
		}

// Common utility function tightly tied to the options listed here. Initializes		// Common utility function tightly tied to the options listed here. Initializes
// a TargetOptions object with CodeGen flags and returns it.		// a TargetOptions object with CodeGen flags and returns it.
static TargetOptions InitTargetOptionsFromCodeGenFlags() {		static TargetOptions InitTargetOptionsFromCodeGenFlags() {
TargetOptions Options;		TargetOptions Options;
Options.AllowFPOpFusion = FuseFPOps;		Options.AllowFPOpFusion = FuseFPOps;
Options.UnsafeFPMath = EnableUnsafeFPMath;		Options.UnsafeFPMath = EnableUnsafeFPMath;
Options.NoInfsFPMath = EnableNoInfsFPMath;		Options.NoInfsFPMath = EnableNoInfsFPMath;
Options.NoNaNsFPMath = EnableNoNaNsFPMath;		Options.NoNaNsFPMath = EnableNoNaNsFPMath;
Options.NoSignedZerosFPMath = EnableNoSignedZerosFPMath;		Options.NoSignedZerosFPMath = EnableNoSignedZerosFPMath;
Options.NoTrappingFPMath = EnableNoTrappingFPMath;		Options.NoTrappingFPMath = EnableNoTrappingFPMath;
Options.FPDenormalMode = DenormalFPMath;		Options.FPDenormalMode = DenormalFPMath;
Options.HonorSignDependentRoundingFPMathOption =		Options.HonorSignDependentRoundingFPMathOption =
EnableHonorSignDependentRoundingFPMath;		EnableHonorSignDependentRoundingFPMath;
if (FloatABIForCalls != FloatABI::Default)		if (FloatABIForCalls != FloatABI::Default)
Options.FloatABIType = FloatABIForCalls;		Options.FloatABIType = FloatABIForCalls;
Options.NoZerosInBSS = DontPlaceZerosInBSS;		Options.NoZerosInBSS = DontPlaceZerosInBSS;
Options.GuaranteedTailCallOpt = EnableGuaranteedTailCallOpt;		Options.GuaranteedTailCallOpt = EnableGuaranteedTailCallOpt;
Options.StackAlignmentOverride = OverrideStackAlignment;		Options.StackAlignmentOverride = OverrideStackAlignment;
Options.StackSymbolOrdering = StackSymbolOrdering;		Options.StackSymbolOrdering = StackSymbolOrdering;
Options.UseInitArray = !UseCtors;		Options.UseInitArray = !UseCtors;
Options.RelaxELFRelocations = RelaxELFRelocations;		Options.RelaxELFRelocations = RelaxELFRelocations;
Options.DataSections = DataSections;		Options.DataSections = DataSections;
Options.FunctionSections = FunctionSections;		Options.FunctionSections = FunctionSections;
		Options.BBSections = getBBSectionsMode(Options);
Options.UniqueSectionNames = UniqueSectionNames;		Options.UniqueSectionNames = UniqueSectionNames;
		Options.UniqueBBSectionNames = UniqueBBSectionNames;
Options.TLSSize = TLSSize;		Options.TLSSize = TLSSize;
Options.EmulatedTLS = EmulatedTLS;		Options.EmulatedTLS = EmulatedTLS;
Options.ExplicitEmulatedTLS = EmulatedTLS.getNumOccurrences() > 0;		Options.ExplicitEmulatedTLS = EmulatedTLS.getNumOccurrences() > 0;
Options.ExceptionModel = ExceptionModel;		Options.ExceptionModel = ExceptionModel;
Options.EmitStackSizeSection = EnableStackSizeSection;		Options.EmitStackSizeSection = EnableStackSizeSection;
Options.EmitAddrsig = EnableAddrsig;		Options.EmitAddrsig = EnableAddrsig;
Options.EmitCallSiteInfo = EmitCallSiteInfo;		Options.EmitCallSiteInfo = EmitCallSiteInfo;
Options.EnableDebugEntryValues = EnableDebugEntryValues;		Options.EnableDebugEntryValues = EnableDebugEntryValues;
▲ Show 20 Lines • Show All 137 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetMachine.h

Show First 20 Lines • Show All 236 Lines • ▼ Show 20 Lines	public:
void setSupportsDefaultOutlining(bool Enable) {		void setSupportsDefaultOutlining(bool Enable) {
Options.SupportsDefaultOutlining = Enable;		Options.SupportsDefaultOutlining = Enable;
}		}

bool shouldPrintMachineCode() const { return Options.PrintMachineCode; }		bool shouldPrintMachineCode() const { return Options.PrintMachineCode; }

bool getUniqueSectionNames() const { return Options.UniqueSectionNames; }		bool getUniqueSectionNames() const { return Options.UniqueSectionNames; }

		/// Return true if unique basic block section names must be generated.
		bool getUniqueBBSectionNames() const { return Options.UniqueBBSectionNames; }

/// Return true if data objects should be emitted into their own section,		/// Return true if data objects should be emitted into their own section,
/// corresponds to -fdata-sections.		/// corresponds to -fdata-sections.
bool getDataSections() const {		bool getDataSections() const {
return Options.DataSections;		return Options.DataSections;
}		}

/// Return true if functions should be emitted into their own section,		/// Return true if functions should be emitted into their own section,
/// corresponding to -ffunction-sections.		/// corresponding to -ffunction-sections.
bool getFunctionSections() const {		bool getFunctionSections() const {
return Options.FunctionSections;		return Options.FunctionSections;
}		}

		/// If basic blocks should be emitted into their own section,
		/// corresponding to -fbasicblock-sections.
		llvm::BasicBlockSection getBBSectionsType() const {
		davidxlUnsubmitted Done Reply Inline Actions nit: naming consistency: getBasicBlockSections -->getBBSections davidxl: nit: naming consistency: getBasicBlockSections -->getBBSections
		return Options.BBSections;
		}

		/// Get the list of functions and basic block ids that need unique sections.
		const MemoryBuffer *getBBSectionsFuncListBuf() const {
		return Options.BBSectionsFuncListBuf.get();
		davidxlUnsubmitted Done Reply Inline Actions isFunctionBBSectionsList. davidxl: isFunctionBBSectionsList.
		}

/// Get a \c TargetIRAnalysis appropriate for the target.		/// Get a \c TargetIRAnalysis appropriate for the target.
///		///
/// This is used to construct the new pass manager's target IR analysis pass,		/// This is used to construct the new pass manager's target IR analysis pass,
/// set up appropriately for this target machine. Even the old pass manager		/// set up appropriately for this target machine. Even the old pass manager
/// uses this to answer queries about the IR.		/// uses this to answer queries about the IR.
		davidxlUnsubmitted Done Reply Inline Actions getBBSectionSet davidxl: getBBSectionSet
TargetIRAnalysis getTargetIRAnalysis();		TargetIRAnalysis getTargetIRAnalysis();

/// Return a TargetTransformInfo for a given function.		/// Return a TargetTransformInfo for a given function.
///		///
/// The returned TargetTransformInfo is specialized to the subtarget		/// The returned TargetTransformInfo is specialized to the subtarget
/// corresponding to \p F.		/// corresponding to \p F.
virtual TargetTransformInfo getTargetTransformInfo(const Function &F);		virtual TargetTransformInfo getTargetTransformInfo(const Function &F);

▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetOptions.h

Show All 10 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TARGET_TARGETOPTIONS_H		#ifndef LLVM_TARGET_TARGETOPTIONS_H
#define LLVM_TARGET_TARGETOPTIONS_H		#define LLVM_TARGET_TARGETOPTIONS_H

#include "llvm/MC/MCTargetOptions.h"		#include "llvm/MC/MCTargetOptions.h"

		#include <memory>

namespace llvm {		namespace llvm {
class MachineFunction;		class MachineFunction;
		class MemoryBuffer;
class Module;		class Module;

namespace FloatABI {		namespace FloatABI {
enum ABIType {		enum ABIType {
Default, // Target-specific (either soft or hard depending on triple, etc).		Default, // Target-specific (either soft or hard depending on triple, etc).
Soft, // Soft float.		Soft, // Soft float.
Hard // Hard float.		Hard // Hard float.
};		};
Show All 29 Lines	namespace FPDenormal {
enum DenormalMode {		enum DenormalMode {
IEEE, // IEEE 754 denormal numbers		IEEE, // IEEE 754 denormal numbers
PreserveSign, // the sign of a flushed-to-zero number is preserved in		PreserveSign, // the sign of a flushed-to-zero number is preserved in
// the sign of 0		// the sign of 0
PositiveZero // denormals are flushed to positive zero		PositiveZero // denormals are flushed to positive zero
};		};
}		}

		enum class BasicBlockSection {
		All, // Use Basic Block Sections for all basic blocks. A section
		MaskRayUnsubmitted Done Reply Inline Actions Newer code should use `enum class` instead of namespace+unscoped enumeration MaskRay: Newer code should use `enum class` instead of namespace+unscoped enumeration
		// for every basic block can significantly bloat object file sizes.
		MaskRayUnsubmitted Done Reply Inline Actions Order enum members by the extent they enable basic block sections. MaskRay: Order enum members by the extent they enable basic block sections.
		List, // Get list of functions & BBs from a file. Selectively enables
		// basic block sections for a subset of basic blocks which can be
		// used to control object size bloats from creating sections.
		echristoUnsubmitted Done Reply Inline Actions Can you describe the use case behind this in the comments? echristo: Can you describe the use case behind this in the comments?
		tmsriramAuthorUnsubmitted Done Reply Inline Actions Sure. Just to be clear, are you referring only to "List" or everything? tmsriram: Sure. Just to be clear, are you referring only to "List" or everything?
		Labels, // Do not use Basic Block Sections but label basic blocks. This
		// is useful when associating profile counts from virtual addresses
		// to basic blocks.
		None // Do not use Basic Block Sections.
		};

enum class EABI {		enum class EABI {
Unknown,		Unknown,
Default, // Default means not specified		Default, // Default means not specified
EABI4, // Target-specific (either 4, 5 or gnu depending on triple).		EABI4, // Target-specific (either 4, 5 or gnu depending on triple).
EABI5,		EABI5,
GNU		GNU
};		};

Show All 35 Lines	TargetOptions()
: PrintMachineCode(false), UnsafeFPMath(false), NoInfsFPMath(false),		: PrintMachineCode(false), UnsafeFPMath(false), NoInfsFPMath(false),
NoNaNsFPMath(false), NoTrappingFPMath(true),		NoNaNsFPMath(false), NoTrappingFPMath(true),
NoSignedZerosFPMath(false),		NoSignedZerosFPMath(false),
HonorSignDependentRoundingFPMathOption(false), NoZerosInBSS(false),		HonorSignDependentRoundingFPMathOption(false), NoZerosInBSS(false),
GuaranteedTailCallOpt(false), StackSymbolOrdering(true),		GuaranteedTailCallOpt(false), StackSymbolOrdering(true),
EnableFastISel(false), EnableGlobalISel(false), UseInitArray(false),		EnableFastISel(false), EnableGlobalISel(false), UseInitArray(false),
DisableIntegratedAS(false), RelaxELFRelocations(false),		DisableIntegratedAS(false), RelaxELFRelocations(false),
FunctionSections(false), DataSections(false),		FunctionSections(false), DataSections(false),
UniqueSectionNames(true), TrapUnreachable(false),		UniqueSectionNames(true), UniqueBBSectionNames(false),
NoTrapAfterNoreturn(false), TLSSize(0), EmulatedTLS(false),		TrapUnreachable(false), NoTrapAfterNoreturn(false), TLSSize(0),
ExplicitEmulatedTLS(false), EnableIPRA(false),		EmulatedTLS(false), ExplicitEmulatedTLS(false), EnableIPRA(false),
EmitStackSizeSection(false), EnableMachineOutliner(false),		EmitStackSizeSection(false), EnableMachineOutliner(false),
SupportsDefaultOutlining(false), EmitAddrsig(false),		SupportsDefaultOutlining(false), EmitAddrsig(false),
EmitCallSiteInfo(false), EnableDebugEntryValues(false),		EmitCallSiteInfo(false), EnableDebugEntryValues(false),
ForceDwarfFrameSection(false) {}		ForceDwarfFrameSection(false) {}

/// PrintMachineCode - This flag is enabled when the -print-machineinstrs		/// PrintMachineCode - This flag is enabled when the -print-machineinstrs
/// option is specified on the command line, and should enable debugging		/// option is specified on the command line, and should enable debugging
/// output from the code generator.		/// output from the code generator.
▲ Show 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	public:
/// Emit functions into separate sections.		/// Emit functions into separate sections.
unsigned FunctionSections : 1;		unsigned FunctionSections : 1;

/// Emit data into separate sections.		/// Emit data into separate sections.
unsigned DataSections : 1;		unsigned DataSections : 1;

unsigned UniqueSectionNames : 1;		unsigned UniqueSectionNames : 1;

		/// Use unique names for basic block sections.
		unsigned UniqueBBSectionNames : 1;

/// Emit target-specific trap instruction for 'unreachable' IR instructions.		/// Emit target-specific trap instruction for 'unreachable' IR instructions.
unsigned TrapUnreachable : 1;		unsigned TrapUnreachable : 1;

/// Do not emit a trap instruction for 'unreachable' IR instructions behind		/// Do not emit a trap instruction for 'unreachable' IR instructions behind
/// noreturn calls, even if TrapUnreachable is true.		/// noreturn calls, even if TrapUnreachable is true.
unsigned NoTrapAfterNoreturn : 1;		unsigned NoTrapAfterNoreturn : 1;

/// Bit size of immediate TLS offsets (0 == use the default).		/// Bit size of immediate TLS offsets (0 == use the default).
Show All 16 Lines	public:
unsigned EnableMachineOutliner : 1;		unsigned EnableMachineOutliner : 1;

/// Set if the target supports default outlining behaviour.		/// Set if the target supports default outlining behaviour.
unsigned SupportsDefaultOutlining : 1;		unsigned SupportsDefaultOutlining : 1;

/// Emit address-significance table.		/// Emit address-significance table.
unsigned EmitAddrsig : 1;		unsigned EmitAddrsig : 1;

		/// Emit basic blocks into separate sections.
		BasicBlockSection BBSections = BasicBlockSection::None;

		/// Memory Buffer that contains information on sampled basic blocks and used
		/// to selectively generate basic block sections.
		std::shared_ptr<MemoryBuffer> BBSectionsFuncListBuf;
		echristoUnsubmitted Done Reply Inline Actions Why would you want to be able to copy it again? echristo: Why would you want to be able to copy it again?
		tmsriramAuthorUnsubmitted Done Reply Inline Actions I am not copying Options but this is existing code. I can dig exactly where but Options is being copied and that disallows unique_ptr. shared_ptr seems fine as I only want one copy of the MemoryBuffer and that is guaranteed. tmsriram: I am not copying Options but this is existing code. I can dig exactly where but Options is…

/// The flag enables call site info production. It is used only for debug		/// The flag enables call site info production. It is used only for debug
/// info, and it is restricted only to optimized code. This can be used for		/// info, and it is restricted only to optimized code. This can be used for
/// something else, so that should be controlled in the frontend.		/// something else, so that should be controlled in the frontend.
unsigned EmitCallSiteInfo : 1;		unsigned EmitCallSiteInfo : 1;
/// Emit debug info about parameter's entry values.		/// Emit debug info about parameter's entry values.
unsigned EnableDebugEntryValues : 1;		unsigned EnableDebugEntryValues : 1;

/// Emit DWARF debug frame section.		/// Emit DWARF debug frame section.
▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Propeller: LLVM support for basic block sectionsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 250395

llvm/include/llvm/CodeGen/CommandFlags.inc

llvm/include/llvm/Target/TargetMachine.h

llvm/include/llvm/Target/TargetOptions.h

Propeller: LLVM support for basic block sections
ClosedPublic