This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
flang/
-
include/flang/Frontend/
-
flang/
-
Frontend/
2/5
FrontendActions.h
-
lib/
-
Frontend/
7/17
FrontendActions.cpp
-
FrontendTool/
-
ExecuteCompilerInvocation.cpp

Differential D124665

[flang][driver] Re-organise the code-gen actions (nfc)
ClosedPublic

Authored by awarzynski on Apr 29 2022, 2:43 AM.

Download Raw Diff

Details

Reviewers

rovka
kiranchandramohan
Leporacanthicus
unterumarmung
ekieri
sscalpone

Commits

rGbb177edc44f4: [flang][driver] Re-organise the code-gen actions (nfc)

Summary

All frontend actions that generate code (MLIR, LLVM IR/BC,
Assembly/Object Code) are re-factored as essentially one action,
CodeGenAction, with minor specialisations. To facilate all this,
CodeGenAction is extended to hold TargetMachine and backend action
type (MLIR vs LLVM IR vs LLVM BC vs Assembly vs Object Code).

CodeGenAction is no longer a pure abstract class and the
corresponding ExecuteAction is implemented so that it covers all use
cases. All this allows a much better code re-use.

Key functionality is extracted into some helpful hooks:

SetUpTargetMachine
GetOutputStream
EmitObjectCodeHelper
EmitBCHelper

I hope that this clarifies the overall structure. I suspect that we may
need to revisit this again as the functionality grows in complexity.

Depends on D124664

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

awarzynski created this revision.Apr 29 2022, 2:43 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptApr 29 2022, 2:43 AM

Herald added subscribers: Chia-hungDuan, rriddle. · View Herald Transcript

awarzynski requested review of this revision.Apr 29 2022, 2:43 AM

Herald added subscribers: stephenneuendorffer, jdoerfert. · View Herald TranscriptApr 29 2022, 2:43 AM

@rovka That's a bit "bolder" change compared to what you suggested in https://reviews.llvm.org/D123211. WDYT?

Harbormaster completed remote builds in B161951: Diff 426004.Apr 29 2022, 3:13 AM

awarzynski added a child revision: D124667: [flang][driver] Add support for consuming LLVM IR/BC files.Apr 29 2022, 3:50 AM

awarzynski added a child revision: D124669: [flang][driver] Add support for -save-temps.Apr 29 2022, 5:07 AM

I have only nits and questions about the patch:

flang/include/flang/Frontend/FrontendActions.h
175–179	Do we really need `Backend_` prefix here? Seems obsolete.
flang/lib/Frontend/FrontendActions.cpp
419	Why not to define the destructor in the header?
517–540	I'd suggest to unify variable names in the function
539	I'm not quite familiar with legacy pass manager, so I have a question: will `CodeGenPasses` delete the pass that created here?

@unterumarmung , thanks for taking a look!

flang/include/flang/Frontend/FrontendActions.h
175–179	Naming is hard :) Consistency with Clang means that navigation will be easier for people familiar with it. It also means that design-wise the drivers are similar. These actions use either LLVM or MLIR to run the final phase of code-gen/lowering. From the point of view of Flang, LLVM and MLIR are backends for the Flang Fortran frontend. So, these actions are "backend" actions. Also, note that e.g. `EmitObj` is also defined in FrontendOptions.h. It's a completely different enum (so technically there would be no name clash), but wouldn't unique names be a bit less ambiguous?
flang/lib/Frontend/FrontendActions.cpp
419	Good question! When you define a destructor for `CodeGenAction` in the header file, it will be available in every translation unit that includes that header, e.g. ExecuteCompilerInvocation.cpp. This would mean that the corresponding symbol would be defined in the `flangFrontendTool` library. However, that's not desirable. The destructor for `CodeGenAction` needs to know how to destroy its member variables, e.g.`llvm::TargetMachine`. So, it depends on the library that defines the corresponding destructor. But `flangFrontendTool` is meant as a lightweight utility lib. It should only depend on `flangFrontend` (which defines `CodeGenAction`) and be oblivious to its implementation details. That's a lot of text for one line of code! Does it make sense?
517–540	Sadly, the coding style in the driver is a bit of a mix. We started with Flang's C++ style as that was the only style that was in use in LLVM Flang back then. However, lowering and code-gen that was introduced later uses the MLIR style (from Allocatable.cpp): // Coding style: https://mlir.llvm.org/getting_started/DeveloperGuide/ As the driver also depends on Clang and uses various LLVM libs (e.g. for machine code generation), it in fact mixes 3 different styles. Not ideal and a ton of confusing - most/all of my own making! I like your suggestion, but I'd prefer to address it in a separate patch. In fact, I intend to suggest refactoring the driver to use the MLIR style soon (probably this week). This should fix the consistency in other places too. Would that be OK with you?
539	I don't consider an expert myself :) Sadly, there's not a single hint in the method declaration. However, there are multiple `delete`s in LegacyPassManager.cpp. That's a good sign! I am also assuming that Clang would definitely be doing the right thing: CodeGenPasses.add. `llc` and `opt` do the same, see here and here. So based on that, I assume that this is OK.

rovka added inline comments.May 2 2022, 12:48 AM

flang/include/flang/Frontend/FrontendActions.h
175–179	Actually, since this isn't part of a class, I think the coding style says we should use something like CGAT_EmitBlah (although CG_EmitBlah might also be ok). I don't feel strongly either way, so use whatever names you prefer :)
flang/lib/Frontend/FrontendActions.cpp
511–533	Nit: ObjectCode makes me think more about bc and object files than about assembly and object files (I'm not sure MachineCode is much better, since assembly is not technically machine code, but I'm looking for something that suggests that we're running the actual backend passes and generating some kind of machine/target-specific representation). Also a small comment saying that it only works for Backend_EmitAssembly and Backend_EmitObj would be nice (or better yet, start the body with an assert about that).
556–569
631	Nit: I would move this above the other `if`, so it's closer to the EmitLL handling.

unterumarmung accepted this revision.May 2 2022, 9:38 AM

unterumarmung added inline comments.

flang/include/flang/Frontend/FrontendActions.h
175–179	@awarzynski, thank you for the explanation! I agree with the chosen naming now.
flang/lib/Frontend/FrontendActions.cpp
419	Sure it does! Thank you for the explanation
517–540	Sure!
539	Seems reasonable!

This revision is now accepted and ready to land.May 2 2022, 9:38 AM

awarzynski mentioned this in D124664: [flang][driver] Define the default frontend driver triple.May 3 2022, 2:11 AM

awarzynski added inline comments.May 3 2022, 3:22 AM

flang/include/flang/Frontend/FrontendActions.h
175–179	Actually, since this isn't part of a class, I think the coding style says we should use something like CGAT_EmitBlah (although CG_EmitBlah might also be ok). I don't feel strongly either way, so use whatever names you prefer :) Glad to see that at least one of us _actually_ read the coding style guideline :) This clearly needs updating then. I will try `BackendActionTy` for the enum and then the actual fields can be kept intact. Unless I misunderstood the guideline 🤔 . WDTY?
flang/lib/Frontend/FrontendActions.cpp
511–533	Good points! I feel that `GenerateMachineCode` would be a bit too similar to GenerateLLVMIR. But that is a member method that will change the internal state of the corresponding `FrontendAction` (it sets `CodeGenAction::llvmModule`). This is a free function though. Let me try `GenerateMachineCodeOrAssemblyImpl` instead - this way we also make sure that machine-code vs assembly distinction is addressed. Also a small comment saying that it only works for Backend_EmitAssembly and Backend_EmitObj would be nice (or better yet, start the body with an assert about that). Will add a comment, an assert and rename to make this limitation clear :)
556–569	Let me try `GenerateLLVMBCImpl` instead, just to differentiate this a bit from `GenerateLLVMIR` (see my earlier comment). WDYT?
631	Good point, moved!

Address comments from Diana

Moved some code around, simplified a bit, added comments, renamed CodeGenActionTy as BackendActionTy.

Herald added a reviewer: sscalpone. · View Herald TranscriptMay 3 2022, 3:24 AM

Harbormaster completed remote builds in B162408: Diff 426629.May 3 2022, 3:39 AM

Rebase on top of main

Harbormaster completed remote builds in B162704: Diff 427027.May 4 2022, 9:22 AM

Just one microscopic issue, otherwise looks great, thanks!

flang/lib/Frontend/FrontendActions.cpp
525	Nit: && "" ? :) Are you going to put a message there? If not you can just remove that part.

awarzynski added inline comments.May 5 2022, 6:22 AM

flang/lib/Frontend/FrontendActions.cpp
525	I swear there used to be some very useful message there before 🤔 :) Good catch, will update before merging!

This revision was landed with ongoing or failed builds.May 5 2022, 7:06 AM

Closed by commit rGbb177edc44f4: [flang][driver] Re-organise the code-gen actions (nfc) (authored by awarzynski). · Explain Why

This revision was automatically updated to reflect the committed changes.

awarzynski added a commit: rGbb177edc44f4: [flang][driver] Re-organise the code-gen actions (nfc).

awarzynski mentioned this in D125027: [flang][driver] Add missing parentheses in an assert.May 5 2022, 11:02 AM

awarzynski mentioned this in rGc12ef70d2b0a: [flang][driver] Add missing parentheses in an assert.May 5 2022, 11:03 AM

Revision Contents

Path

Size

flang/

include/

flang/

Frontend/

FrontendActions.h

51 lines

lib/

Frontend/

FrontendActions.cpp

242 lines

FrontendTool/

ExecuteCompilerInvocation.cpp

6 lines

Diff 427307

flang/include/flang/Frontend/FrontendActions.h

	Show All 10 Lines

	#include "flang/Frontend/FrontendAction.h"			#include "flang/Frontend/FrontendAction.h"
	#include "flang/Parser/parsing.h"			#include "flang/Parser/parsing.h"
	#include "flang/Semantics/semantics.h"			#include "flang/Semantics/semantics.h"

	#include "mlir/IR/BuiltinOps.h"			#include "mlir/IR/BuiltinOps.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/IR/Module.h"			#include "llvm/IR/Module.h"
				#include "llvm/Target/TargetMachine.h"
	#include <memory>			#include <memory>

	namespace Fortran::frontend {			namespace Fortran::frontend {

	// TODO: This is a copy from f18.cpp. It doesn't really belong here and should			// TODO: This is a copy from f18.cpp. It doesn't really belong here and should
	// be moved to a more suitable place in future.			// be moved to a more suitable place in future.
	struct MeasurementVisitor {			struct MeasurementVisitor {
	template <typename A> bool Pre(const A &) { return true; }			template <typename A> bool Pre(const A &) { return true; }
	▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines

	class DebugDumpAllAction : public PrescanAndSemaDebugAction {			class DebugDumpAllAction : public PrescanAndSemaDebugAction {
	void ExecuteAction() override;			void ExecuteAction() override;
	};			};

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// CodeGen Actions			// CodeGen Actions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
				/// Represents the type of "backend" action to perform by the corresponding
				/// CodeGenAction. Note that from Flang's perspective, both LLVM and MLIR are
				/// "backends" that are used for generating LLVM IR/BC, assembly files or
				/// machine code. This enum captures "what" exactly one of these backends is to
				/// do. The names are similar to what is used in Clang - this allows us to
				/// maintain some level of consistency/similarity between the drivers.
				unterumarmungUnsubmitted Not Done Reply Inline Actions Do we really need `Backend_` prefix here? Seems obsolete. unterumarmung: Do we really need `Backend_` prefix here? Seems obsolete.
				awarzynskiAuthorUnsubmitted Done Reply Inline Actions Naming is hard :) Consistency with Clang means that navigation will be easier for people familiar with it. It also means that design-wise the drivers are similar. These actions use either LLVM or MLIR to run the final phase of code-gen/lowering. From the point of view of Flang, LLVM and MLIR are backends for the Flang Fortran frontend. So, these actions are "backend" actions. Also, note that e.g. `EmitObj` is also defined in FrontendOptions.h. It's a completely different enum (so technically there would be no name clash), but wouldn't unique names be a bit less ambiguous? awarzynski: Naming is hard :) 1. Consistency with [[ https://github.com/llvm/llvm…
				rovkaUnsubmitted Not Done Reply Inline Actions Actually, since this isn't part of a class, I think the coding style says we should use something like CGAT_EmitBlah (although CG_EmitBlah might also be ok). I don't feel strongly either way, so use whatever names you prefer :) rovka: Actually, since this isn't part of a class, I think the [[ https://llvm.
				awarzynskiAuthorUnsubmitted Done Reply Inline Actions Actually, since this isn't part of a class, I think the coding style says we should use something like CGAT_EmitBlah (although CG_EmitBlah might also be ok). I don't feel strongly either way, so use whatever names you prefer :) Glad to see that at least one of us _actually_ read the coding style guideline :) This clearly needs updating then. I will try `BackendActionTy` for the enum and then the actual fields can be kept intact. Unless I misunderstood the guideline 🤔 . WDTY? awarzynski: > Actually, since this isn't part of a class, I think the coding style says we should use…
				unterumarmungUnsubmitted Not Done Reply Inline Actions @awarzynski, thank you for the explanation! I agree with the chosen naming now. unterumarmung: @awarzynski, thank you for the explanation! I agree with the chosen naming now.
				enum class BackendActionTy {
				Backend_EmitAssembly, ///< Emit native assembly files
				Backend_EmitObj, ///< Emit native object files
				Backend_EmitBC, ///< Emit LLVM bitcode files
				Backend_EmitLL, ///< Emit human-readable LLVM assembly
				Backend_EmitMLIR ///< Emit MLIR files
				};

	/// Abstract base class for actions that generate code (MLIR, LLVM IR, assembly			/// Abstract base class for actions that generate code (MLIR, LLVM IR, assembly
	/// and machine code). Every action that inherits from this class will at			/// and machine code). Every action that inherits from this class will at
	/// least run the prescanning, parsing, semantic checks and lower the parse			/// least run the prescanning, parsing, semantic checks and lower the parse
	/// tree to an MLIR module.			/// tree to an MLIR module.
	class CodeGenAction : public FrontendAction {			class CodeGenAction : public FrontendAction {

	void ExecuteAction() override = 0;			void ExecuteAction() override;
	/// Runs prescan, parsing, sema and lowers to MLIR.			/// Runs prescan, parsing, sema and lowers to MLIR.
	bool BeginSourceFileAction() override;			bool BeginSourceFileAction() override;
				void SetUpTargetMachine();

	protected:			protected:
				CodeGenAction(BackendActionTy act) : action{act} {};
	/// @name MLIR			/// @name MLIR
	/// {			/// {
	std::unique_ptr<mlir::ModuleOp> mlirModule;			std::unique_ptr<mlir::ModuleOp> mlirModule;
	std::unique_ptr<mlir::MLIRContext> mlirCtx;			std::unique_ptr<mlir::MLIRContext> mlirCtx;
	/// }			/// }

	/// @name LLVM IR			/// @name LLVM IR
	std::unique_ptr<llvm::LLVMContext> llvmCtx;			std::unique_ptr<llvm::LLVMContext> llvmCtx;
	std::unique_ptr<llvm::Module> llvmModule;			std::unique_ptr<llvm::Module> llvmModule;

	/// Generates an LLVM IR module from CodeGenAction::mlirModule and saves it			/// Generates an LLVM IR module from CodeGenAction::mlirModule and saves it
	/// in CodeGenAction::llvmModule.			/// in CodeGenAction::llvmModule.
	void GenerateLLVMIR();			void GenerateLLVMIR();

				BackendActionTy action;

				std::unique_ptr<llvm::TargetMachine> TM;
	/// }			/// }
				public:
				~CodeGenAction() override;
	};			};

	class EmitMLIRAction : public CodeGenAction {			class EmitMLIRAction : public CodeGenAction {
	void ExecuteAction() override;			public:
				EmitMLIRAction() : CodeGenAction(BackendActionTy::Backend_EmitMLIR) {}
	};			};

	class EmitLLVMAction : public CodeGenAction {			class EmitLLVMAction : public CodeGenAction {
	void ExecuteAction() override;			public:
				EmitLLVMAction() : CodeGenAction(BackendActionTy::Backend_EmitLL) {}
	};			};

	class EmitLLVMBitcodeAction : public CodeGenAction {			class EmitLLVMBitcodeAction : public CodeGenAction {
	void ExecuteAction() override;			public:
				EmitLLVMBitcodeAction() : CodeGenAction(BackendActionTy::Backend_EmitBC) {}
	};			};

	class BackendAction : public CodeGenAction {			class EmitObjAction : public CodeGenAction {
	public:			public:
	enum class BackendActionTy {			EmitObjAction() : CodeGenAction(BackendActionTy::Backend_EmitObj) {}
	Backend_EmitAssembly, ///< Emit native assembly files
	Backend_EmitObj ///< Emit native object files
	};			};

	BackendAction(BackendActionTy act) : action{act} {};			class EmitAssemblyAction : public CodeGenAction {
				public:
	private:			EmitAssemblyAction() : CodeGenAction(BackendActionTy::Backend_EmitAssembly) {}
	void ExecuteAction() override;

	BackendActionTy action;
	};			};

	} // namespace Fortran::frontend			} // namespace Fortran::frontend

	#endif // LLVM_FLANG_FRONTEND_FRONTENDACTIONS_H			#endif // LLVM_FLANG_FRONTEND_FRONTENDACTIONS_H

flang/lib/Frontend/FrontendActions.cpp

Show All 30 Lines

#include "mlir/Pass/PassManager.h" #include "mlir/Pass/PassManager.h"

#include "mlir/Target/LLVMIR/ModuleTranslation.h" #include "mlir/Target/LLVMIR/ModuleTranslation.h"

#include "clang/Basic/DiagnosticFrontend.h" #include "clang/Basic/DiagnosticFrontend.h"

#include "llvm/ADT/StringRef.h" #include "llvm/ADT/StringRef.h"

#include "llvm/Analysis/TargetLibraryInfo.h" #include "llvm/Analysis/TargetLibraryInfo.h"

#include "llvm/Analysis/TargetTransformInfo.h" #include "llvm/Analysis/TargetTransformInfo.h"

#include "llvm/Bitcode/BitcodeWriterPass.h" #include "llvm/Bitcode/BitcodeWriterPass.h"

#include "llvm/IR/LegacyPassManager.h" #include "llvm/IR/LegacyPassManager.h"

#include "llvm/IRReader/IRReader.h"

#include "llvm/MC/TargetRegistry.h" #include "llvm/MC/TargetRegistry.h"

#include "llvm/Passes/PassBuilder.h" #include "llvm/Passes/PassBuilder.h"

#include "llvm/Support/ErrorHandling.h" #include "llvm/Support/ErrorHandling.h"

#include "llvm/Support/SourceMgr.h"

#include "llvm/Target/TargetMachine.h" #include "llvm/Target/TargetMachine.h"

#include <clang/Basic/Diagnostic.h> #include <clang/Basic/Diagnostic.h>

#include <memory> #include <memory>

using namespace Fortran::frontend; using namespace Fortran::frontend;

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

// Custom BeginSourceFileAction // Custom BeginSourceFileAction

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

bool PrescanAction::BeginSourceFileAction() { return RunPrescan(); } bool PrescanAction::BeginSourceFileAction() { return RunPrescan(); }

bool PrescanAndParseAction::BeginSourceFileAction() { bool PrescanAndParseAction::BeginSourceFileAction() {

return RunPrescan() && RunParse(); return RunPrescan() && RunParse();

} }

bool PrescanAndSemaAction::BeginSourceFileAction() { bool PrescanAndSemaAction::BeginSourceFileAction() {

return RunPrescan() && RunParse() && RunSemanticChecks() && return RunPrescan() && RunParse() && RunSemanticChecks() &&

▲ Show 20 Lines • Show All 345 Lines • ▼ Show 20 Lines void GetSymbolsSourcesAction::ExecuteAction() {

// Report and exit if fatal semantic errors are present // Report and exit if fatal semantic errors are present

if (reportFatalSemanticErrors()) { if (reportFatalSemanticErrors()) {

return; return;

} }

ci.semantics().DumpSymbolsSources(llvm::outs()); ci.semantics().DumpSymbolsSources(llvm::outs());

} }

//===----------------------------------------------------------------------===//

// CodeGenActions

//===----------------------------------------------------------------------===//

CodeGenAction::~CodeGenAction() = default;

unterumarmungUnsubmitted

Not Done

Why not to define the destructor in the header?

unterumarmung: Why not to define the destructor in the header?

awarzynskiAuthorUnsubmitted

Done

Good question!

When you define a destructor for CodeGenAction in the header file, it will be available in every translation unit that includes that header, e.g. ExecuteCompilerInvocation.cpp. This would mean that the corresponding symbol would be defined in the flangFrontendTool library. However, that's not desirable.

The destructor for CodeGenAction needs to know how to destroy its member variables, e.g.llvm::TargetMachine. So, it depends on the library that defines the corresponding destructor. But flangFrontendTool is meant as a lightweight utility lib. It should only depend on flangFrontend (which defines CodeGenAction) and be oblivious to its implementation details.

That's a lot of text for one line of code! Does it make sense?

awarzynski: Good question! When you define a destructor for `CodeGenAction` in the header file, it will be…

unterumarmungUnsubmitted

Not Done

Sure it does! Thank you for the explanation

unterumarmung: Sure it does! Thank you for the explanation

#include "flang/Tools/CLOptions.inc" #include "flang/Tools/CLOptions.inc"

// Lower the previously generated MLIR module into an LLVM IR module // Lower the previously generated MLIR module into an LLVM IR module

void CodeGenAction::GenerateLLVMIR() { void CodeGenAction::GenerateLLVMIR() {

assert(mlirModule && "The MLIR module has not been generated yet."); assert(mlirModule && "The MLIR module has not been generated yet.");

CompilerInstance &ci = this->instance(); CompilerInstance &ci = this->instance();

Show All 26 Lines void CodeGenAction::GenerateLLVMIR() {

if (!llvmModule) { if (!llvmModule) {

unsigned diagID = ci.diagnostics().getCustomDiagID( unsigned diagID = ci.diagnostics().getCustomDiagID(

clang::DiagnosticsEngine::Error, "failed to create the LLVM module"); clang::DiagnosticsEngine::Error, "failed to create the LLVM module");

ci.diagnostics().Report(diagID); ci.diagnostics().Report(diagID);

return; return;

} }

void EmitLLVMAction::ExecuteAction() { void CodeGenAction::SetUpTargetMachine() {

CompilerInstance &ci = this->instance();

GenerateLLVMIR();

// If set, use the predefined outupt stream to print the generated module.

if (!ci.IsOutputStreamNull()) {

llvmModule->print(

ci.GetOutputStream(), /*AssemblyAnnotationWriter=*/nullptr);

return;

}

// No predefined output stream was set. Create an output file and dump the

// generated module there.

std::unique_ptr<llvm::raw_ostream> os = ci.CreateDefaultOutputFile(

/*Binary=*/false, /*InFile=*/GetCurrentFileOrBufferName(), "ll");

if (!os) {

unsigned diagID = ci.diagnostics().getCustomDiagID(

clang::DiagnosticsEngine::Error, "failed to create the output file");

ci.diagnostics().Report(diagID);

return;

}

llvmModule->print(*os, /*AssemblyAnnotationWriter=*/nullptr);

}

void EmitLLVMBitcodeAction::ExecuteAction() {

CompilerInstance &ci = this->instance(); CompilerInstance &ci = this->instance();

// Generate an LLVM module if it's not already present (it will already be

// present if the input file is an LLVM IR/BC file).

if (!llvmModule)

GenerateLLVMIR();

// Set the triple based on the CompilerInvocation set-up // Set the triple based on the CompilerInvocation set-up

const std::string &theTriple = ci.invocation().targetOpts().triple; const std::string &theTriple = ci.invocation().targetOpts().triple;

if (llvmModule->getTargetTriple() != theTriple) { if (llvmModule->getTargetTriple() != theTriple) {

ci.diagnostics().Report(clang::diag::warn_fe_override_module) << theTriple; ci.diagnostics().Report(clang::diag::warn_fe_override_module) << theTriple;

llvmModule->setTargetTriple(theTriple); llvmModule->setTargetTriple(theTriple);

} }

// Create `Target` // Create `Target`

std::string error; std::string error;

const llvm::Target *theTarget = const llvm::Target *theTarget =

llvm::TargetRegistry::lookupTarget(theTriple, error); llvm::TargetRegistry::lookupTarget(theTriple, error);

assert(theTarget && "Failed to create Target"); assert(theTarget && "Failed to create Target");

// Create and configure `TargetMachine` // Create `TargetMachine`

std::unique_ptr<llvm::TargetMachine> TM( TM.reset(theTarget->createTargetMachine(theTriple, /*CPU=*/"",

theTarget->createTargetMachine(theTriple, /*CPU=*/"", /*Features=*/"",

/*Features=*/"", llvm::TargetOptions(), llvm::None)); llvm::TargetOptions(), llvm::None));

assert(TM && "Failed to create TargetMachine"); assert(TM && "Failed to create TargetMachine");

llvmModule->setDataLayout(TM->createDataLayout()); llvmModule->setDataLayout(TM->createDataLayout());

// Generate an output file

std::unique_ptr<llvm::raw_ostream> os = ci.CreateDefaultOutputFile(

/*Binary=*/true, /*InFile=*/GetCurrentFileOrBufferName(), "bc");

if (!os) {

unsigned diagID = ci.diagnostics().getCustomDiagID(

clang::DiagnosticsEngine::Error, "failed to create the output file");

ci.diagnostics().Report(diagID);

return;

} }

// Set-up the pass manager static std::unique_ptr<llvm::raw_pwrite_stream>

llvm::ModulePassManager MPM; GetOutputStream(CompilerInstance &ci, llvm::StringRef inFile,

llvm::ModuleAnalysisManager MAM; BackendActionTy action) {

llvm::PassBuilder PB(TM.get()); switch (action) {

PB.registerModuleAnalyses(MAM); case BackendActionTy::Backend_EmitAssembly:

MPM.addPass(llvm::BitcodeWriterPass(*os)); return ci.CreateDefaultOutputFile(

/*Binary=*/false, inFile, /*extension=*/"s");

case BackendActionTy::Backend_EmitLL:

return ci.CreateDefaultOutputFile(

/*Binary=*/false, inFile, /*extension=*/"ll");

case BackendActionTy::Backend_EmitMLIR:

return ci.CreateDefaultOutputFile(

/*Binary=*/false, inFile, /*extension=*/"mlir");

case BackendActionTy::Backend_EmitBC:

return ci.CreateDefaultOutputFile(

/*Binary=*/true, inFile, /*extension=*/"bc");

case BackendActionTy::Backend_EmitObj:

return ci.CreateDefaultOutputFile(

/*Binary=*/true, inFile, /*extension=*/"o");

}

// Run the passes llvm_unreachable("Invalid action!");

MPM.run(*llvmModule, MAM);

} }

void EmitMLIRAction::ExecuteAction() { /// Generate target-specific machine-code or assembly file from the input LLVM

CompilerInstance &ci = this->instance(); /// module.

///

/// \param [in] diags Diagnostics engine for reporting errors

/// \param [in] TM Target machine to aid the code-gen pipeline set-up

/// \param [in] act Backend act to run (assembly vs machine-code generation)

/// \param [in] llvmModule LLVM module to lower to assembly/machine-code

/// \param [out] os Output stream to emit the generated code to

static void GenerateMachineCodeOrAssemblyImpl(clang::DiagnosticsEngine &diags,

llvm::TargetMachine &TM,

BackendActionTy act,

llvm::Module &llvmModule,

llvm::raw_pwrite_stream &os) {

assert((act == BackendActionTy::Backend_EmitObj) ||

(act == BackendActionTy::Backend_EmitAssembly) &&

rovkaUnsubmitted

Not Done

Nit: && "" ? :) Are you going to put a message there? If not you can just remove that part.

rovka: Nit: && "" ? :) Are you going to put a message there? If not you can just remove that part.

awarzynskiAuthorUnsubmitted

Done

I swear there used to be some very useful message there before 🤔 :)

Good catch, will update before merging!

awarzynski: I swear there used to be some very useful message there before 🤔 :) Good catch, will update…

"Unsupported action");

// Print the output. If a pre-defined output stream exists, dump the MLIR // Set-up the pass manager, i.e create an LLVM code-gen pass pipeline.

// content there. // Currently only the legacy pass manager is supported.

if (!ci.IsOutputStreamNull()) { // TODO: Switch to the new PM once it's available in the backend.

mlirModule->print(ci.GetOutputStream()); llvm::legacy::PassManager CodeGenPasses;

return; CodeGenPasses.add(

} createTargetTransformInfoWrapperPass(TM.getTargetIRAnalysis()));

rovkaUnsubmitted

Not Done

llvm_unreachable("Invalid action!");

}

- void EmitObjectCodeHelper(CompilerInstance &ci, llvm::TargetMachine &TM,

+ void GenerateMachineCode(CompilerInstance &ci, llvm::TargetMachine &TM,

CodeGenActionTy action, llvm::Module &llvmModule,

Nit: ObjectCode makes me think more about bc and object files than about assembly and object files (I'm not sure MachineCode is much better, since assembly is not technically machine code, but I'm looking for something that suggests that we're running the actual backend passes and generating some kind of machine/target-specific representation).

Also a small comment saying that it only works for Backend_EmitAssembly and Backend_EmitObj would be nice (or better yet, start the body with an assert about that).

rovka: Nit: ObjectCode makes me think more about bc and object files than about assembly and object…

awarzynskiAuthorUnsubmitted

Done

Good points!

I feel that GenerateMachineCode would be a bit too similar to GenerateLLVMIR. But that is a member method that will change the internal state of the corresponding FrontendAction (it sets CodeGenAction::llvmModule). This is a free function though. Let me try GenerateMachineCodeOrAssemblyImpl instead - this way we also make sure that machine-code vs assembly distinction is addressed.

Also a small comment saying that it only works for Backend_EmitAssembly and Backend_EmitObj would be nice (or better yet, start the body with an assert about that).

Will add a comment, an assert and rename to make this limitation clear :)

awarzynski: Good points! I feel that `GenerateMachineCode` would be a bit too similar to [[ https://github.

// ... otherwise, print to a file. llvm::Triple triple(llvmModule.getTargetTriple());

std::unique_ptr<llvm::raw_pwrite_stream> os{ci.CreateDefaultOutputFile( std::unique_ptr<llvm::TargetLibraryInfoImpl> TLII =

/*Binary=*/true, /*InFile=*/GetCurrentFileOrBufferName(), "mlir")}; std::make_unique<llvm::TargetLibraryInfoImpl>(triple);

if (!os) { assert(TLII && "Failed to create TargetLibraryInfo");

unsigned diagID = ci.diagnostics().getCustomDiagID( CodeGenPasses.add(new llvm::TargetLibraryInfoWrapperPass(*TLII));

unterumarmungUnsubmitted

Not Done

I'm not quite familiar with legacy pass manager, so I have a question: will CodeGenPasses delete the pass that created here?

unterumarmung: I'm not quite familiar with legacy pass manager, so I have a question: will `CodeGenPasses`…

awarzynskiAuthorUnsubmitted

Done

I don't consider an expert myself :)

Sadly, there's not a single hint in the method declaration. However, there are multiple deletes in LegacyPassManager.cpp. That's a good sign!

I am also assuming that Clang would definitely be doing the right thing: CodeGenPasses.add. llc and opt do the same, see here and here. So based on that, I assume that this is OK.

awarzynski: I don't consider an expert myself :) Sadly, there's not a single hint in the method [[ https…

unterumarmungUnsubmitted

Not Done

Seems reasonable!

unterumarmung: Seems reasonable!

clang::DiagnosticsEngine::Error, "failed to create the output file");

unterumarmungUnsubmitted

Not Done

// TODO: Switch to the new PM once it's available in the backend.

- llvm::legacy::PassManager CodeGenPasses;

- CodeGenPasses.add(

+ llvm::legacy::PassManager codeGenPasses;

+ codeGenPasses.add(

createTargetTransformInfoWrapperPass(TM.getTargetIRAnalysis()));

llvm::Triple triple(llvmModule.getTargetTriple());

- std::unique_ptr<llvm::TargetLibraryInfoImpl> TLII =

+ std::unique_ptr<llvm::TargetLibraryInfoImpl> tlii =

std::make_unique<llvm::TargetLibraryInfoImpl>(triple);

- assert(TLII && "Failed to create TargetLibraryInfo");

- CodeGenPasses.add(new llvm::TargetLibraryInfoWrapperPass(*TLII));

+ assert(tlii && "Failed to create TargetLibraryInfo");

+ CodeGenPasses.add(new llvm::TargetLibraryInfoWrapperPass(*tlii));

llvm::CodeGenFileType cgft = (action == CodeGenActionTy::Backend_EmitAssembly)

? llvm::CodeGenFileType::CGFT_AssemblyFile

: llvm::CodeGenFileType::CGFT_ObjectFile;

if (TM.addPassesToEmitFile(CodeGenPasses,

ci.IsOutputStreamNull() ? os : ci.GetOutputStream(), nullptr, cgft)) {

unsigned diagID =

ci.diagnostics().getCustomDiagID(clang::DiagnosticsEngine::Error,

"emission of this file type is not supported");

ci.diagnostics().Report(diagID);

return;

}

// Run the code-gen passes

- CodeGenPasses.run(llvmModule);

+ codeGenPasses.run(llvmModule);

}

void EmitBCHelper(llvm::TargetMachine &TM, llvm::Module &llvmModule,

I'd suggest to unify variable names in the function

unterumarmung: I'd suggest to unify variable names in the function

awarzynskiAuthorUnsubmitted

Done

Sadly, the coding style in the driver is a bit of a mix.

We started with Flang's C++ style as that was the only style that was in use in LLVM Flang back then. However, lowering and code-gen that was introduced later uses the MLIR style (from Allocatable.cpp):

// Coding style: https://mlir.llvm.org/getting_started/DeveloperGuide/

As the driver also depends on Clang and uses various LLVM libs (e.g. for machine code generation), it in fact mixes 3 different styles. Not ideal and a ton of confusing - most/all of my own making!

I like your suggestion, but I'd prefer to address it in a separate patch. In fact, I intend to suggest refactoring the driver to use the MLIR style soon (probably this week). This should fix the consistency in other places too. Would that be OK with you?

awarzynski: Sadly, the coding style in the driver is a bit of a mix. We started with Flang's [[ https…

unterumarmungUnsubmitted

Not Done

Sure!

unterumarmung: Sure!

ci.diagnostics().Report(diagID); llvm::CodeGenFileType cgft = (act == BackendActionTy::Backend_EmitAssembly)

? llvm::CodeGenFileType::CGFT_AssemblyFile

: llvm::CodeGenFileType::CGFT_ObjectFile;

if (TM.addPassesToEmitFile(CodeGenPasses, os, nullptr, cgft)) {

unsigned diagID =

diags.getCustomDiagID(clang::DiagnosticsEngine::Error,

"emission of this file type is not supported");

diags.Report(diagID);

return; return;

} }

mlirModule->print(*os); // Run the passes

CodeGenPasses.run(llvmModule);

} }

void BackendAction::ExecuteAction() { /// Generate LLVM byte code file from the input LLVM module.

CompilerInstance &ci = this->instance(); ///

// Generate an LLVM module if it's not already present (it will already be /// \param [in] TM Target machine to aid the code-gen pipeline set-up

// present if the input file is an LLVM IR/BC file). /// \param [in] llvmModule LLVM module to lower to assembly/machine-code

if (!llvmModule) /// \param [out] os Output stream to emit the generated code to

GenerateLLVMIR(); static void GenerateLLVMBCImpl(llvm::TargetMachine &TM,

llvm::Module &llvmModule,

llvm::raw_pwrite_stream &os) {

// Set-up the pass manager

llvm::ModulePassManager MPM;

llvm::ModuleAnalysisManager MAM;

llvm::PassBuilder PB(&TM);

PB.registerModuleAnalyses(MAM);

MPM.addPass(llvm::BitcodeWriterPass(os));

rovkaUnsubmitted

Not Done

CodeGenPasses.run(llvmModule);

}

- void EmitBCHelper(llvm::TargetMachine &TM, llvm::Module &llvmModule,

+ void GenerateLLVMBC(llvm::TargetMachine &TM, llvm::Module &llvmModule,

llvm::raw_pwrite_stream &os) {

rovka:

awarzynskiAuthorUnsubmitted

Done

Let me try GenerateLLVMBCImpl instead, just to differentiate this a bit from GenerateLLVMIR (see my earlier comment). WDYT?

awarzynski: Let me try `GenerateLLVMBCImpl` instead, just to differentiate this a bit from `GenerateLLVMIR`…

// Set the triple based on the CompilerInvocation set-up // Run the passes

const std::string &theTriple = ci.invocation().targetOpts().triple; MPM.run(llvmModule, MAM);

if (llvmModule->getTargetTriple() != theTriple) {

ci.diagnostics().Report(clang::diag::warn_fe_override_module) << theTriple;

llvmModule->setTargetTriple(theTriple);

} }

// Create `Target` void CodeGenAction::ExecuteAction() {

std::string error; CompilerInstance &ci = this->instance();

const llvm::Target *theTarget =

llvm::TargetRegistry::lookupTarget(theTriple, error);

assert(theTarget && "Failed to create Target");

// Create `TargetMachine`

std::unique_ptr<llvm::TargetMachine> TM(

theTarget->createTargetMachine(theTriple, /*CPU=*/"",

/*Features=*/"", llvm::TargetOptions(), llvm::None));

assert(TM && "Failed to create TargetMachine");

llvmModule->setDataLayout(TM->createDataLayout());

// If the output stream is a file, generate it and define the corresponding // If the output stream is a file, generate it and define the corresponding

// output stream. If a pre-defined output stream is available, we will use // output stream. If a pre-defined output stream is available, we will use

// that instead. // that instead.

// //

// NOTE: `os` is a smart pointer that will be destroyed at the end of this // NOTE: `os` is a smart pointer that will be destroyed at the end of this

// method. However, it won't be written to until `CodeGenPasses` is // method. However, it won't be written to until `CodeGenPasses` is

// destroyed. By defining `os` before `CodeGenPasses`, we make sure that the // destroyed. By defining `os` before `CodeGenPasses`, we make sure that the

// output stream won't be destroyed before it is written to. This only // output stream won't be destroyed before it is written to. This only

// applies when an output file is used (i.e. there is no pre-defined output // applies when an output file is used (i.e. there is no pre-defined output

// stream). // stream).

// TODO: Revisit once the new PM is ready (i.e. when `CodeGenPasses` is // TODO: Revisit once the new PM is ready (i.e. when `CodeGenPasses` is

// updated to use it). // updated to use it).

std::unique_ptr<llvm::raw_pwrite_stream> os; std::unique_ptr<llvm::raw_pwrite_stream> os;

if (ci.IsOutputStreamNull()) { if (ci.IsOutputStreamNull()) {

// Get the output buffer/file os = GetOutputStream(ci, GetCurrentFileOrBufferName(), action);

switch (action) {

case BackendActionTy::Backend_EmitAssembly:

os = ci.CreateDefaultOutputFile(

/*Binary=*/false, /*InFile=*/GetCurrentFileOrBufferName(), "s");

break;

case BackendActionTy::Backend_EmitObj:

os = ci.CreateDefaultOutputFile(

/*Binary=*/true, /*InFile=*/GetCurrentFileOrBufferName(), "o");

break;

}

if (!os) { if (!os) {

unsigned diagID = ci.diagnostics().getCustomDiagID( unsigned diagID = ci.diagnostics().getCustomDiagID(

clang::DiagnosticsEngine::Error, "failed to create the output file"); clang::DiagnosticsEngine::Error, "failed to create the output file");

ci.diagnostics().Report(diagID); ci.diagnostics().Report(diagID);

return; return;

} }

// Create an LLVM code-gen pass pipeline. Currently only the legacy pass if (action == BackendActionTy::Backend_EmitMLIR) {

// manager is supported. mlirModule->print(ci.IsOutputStreamNull() ? *os : ci.GetOutputStream());

// TODO: Switch to the new PM once it's available in the backend. return;

llvm::legacy::PassManager CodeGenPasses; }

CodeGenPasses.add(

createTargetTransformInfoWrapperPass(TM->getTargetIRAnalysis()));

llvm::Triple triple(theTriple);

std::unique_ptr<llvm::TargetLibraryInfoImpl> TLII = // Generate an LLVM module if it's not already present (it will already be

std::make_unique<llvm::TargetLibraryInfoImpl>(triple); // present if the input file is an LLVM IR/BC file).

assert(TLII && "Failed to create TargetLibraryInfo"); if (!llvmModule)

CodeGenPasses.add(new llvm::TargetLibraryInfoWrapperPass(*TLII)); GenerateLLVMIR();

llvm::CodeGenFileType cgft = (action == BackendActionTy::Backend_EmitAssembly) if (action == BackendActionTy::Backend_EmitLL) {

? llvm::CodeGenFileType::CGFT_AssemblyFile llvmModule->print(ci.IsOutputStreamNull() ? *os : ci.GetOutputStream(),

: llvm::CodeGenFileType::CGFT_ObjectFile; /*AssemblyAnnotationWriter=*/nullptr);

if (TM->addPassesToEmitFile(CodeGenPasses,

ci.IsOutputStreamNull() ? *os : ci.GetOutputStream(), nullptr,

cgft)) {

unsigned diagID =

ci.diagnostics().getCustomDiagID(clang::DiagnosticsEngine::Error,

"emission of this file type is not supported");

ci.diagnostics().Report(diagID);

return; return;

} }

// Run the code-gen passes SetUpTargetMachine();

CodeGenPasses.run(*llvmModule); if (action == BackendActionTy::Backend_EmitBC) {

GenerateLLVMBCImpl(*TM, *llvmModule, *os);

return;

}

if (action == BackendActionTy::Backend_EmitAssembly ||

action == BackendActionTy::Backend_EmitObj) {

GenerateMachineCodeOrAssemblyImpl(

ci.diagnostics(), *TM, action, *llvmModule,

ci.IsOutputStreamNull() ? *os : ci.GetOutputStream());

return;

}

} }

rovkaUnsubmitted

Not Done

Nit: I would move this above the other if, so it's closer to the EmitLL handling.

rovka: Nit: I would move this above the other `if`, so it's closer to the EmitLL handling.

awarzynskiAuthorUnsubmitted

Done

Good point, moved!

awarzynski: Good point, moved!

void InitOnlyAction::ExecuteAction() { void InitOnlyAction::ExecuteAction() {

CompilerInstance &ci = this->instance(); CompilerInstance &ci = this->instance();

unsigned DiagID = unsigned DiagID =

ci.diagnostics().getCustomDiagID(clang::DiagnosticsEngine::Warning, ci.diagnostics().getCustomDiagID(clang::DiagnosticsEngine::Warning,

"Use `-init-only` for testing purposes only"); "Use `-init-only` for testing purposes only");

ci.diagnostics().Report(DiagID); ci.diagnostics().Report(DiagID);

} }

Show All 30 Lines

flang/lib/FrontendTool/ExecuteCompilerInvocation.cpp

Show All 35 Lines	case ParseSyntaxOnly:
return std::make_unique<ParseSyntaxOnlyAction>();		return std::make_unique<ParseSyntaxOnlyAction>();
case EmitMLIR:		case EmitMLIR:
return std::make_unique<EmitMLIRAction>();		return std::make_unique<EmitMLIRAction>();
case EmitLLVM:		case EmitLLVM:
return std::make_unique<EmitLLVMAction>();		return std::make_unique<EmitLLVMAction>();
case EmitLLVMBitcode:		case EmitLLVMBitcode:
return std::make_unique<EmitLLVMBitcodeAction>();		return std::make_unique<EmitLLVMBitcodeAction>();
case EmitObj:		case EmitObj:
return std::make_unique<BackendAction>(		return std::make_unique<EmitObjAction>();
BackendAction::BackendActionTy::Backend_EmitObj);
case EmitAssembly:		case EmitAssembly:
return std::make_unique<BackendAction>(		return std::make_unique<EmitAssemblyAction>();
BackendAction::BackendActionTy::Backend_EmitAssembly);
case DebugUnparse:		case DebugUnparse:
return std::make_unique<DebugUnparseAction>();		return std::make_unique<DebugUnparseAction>();
case DebugUnparseNoSema:		case DebugUnparseNoSema:
return std::make_unique<DebugUnparseNoSemaAction>();		return std::make_unique<DebugUnparseNoSemaAction>();
case DebugUnparseWithSymbols:		case DebugUnparseWithSymbols:
return std::make_unique<DebugUnparseWithSymbolsAction>();		return std::make_unique<DebugUnparseWithSymbolsAction>();
case DebugDumpSymbols:		case DebugDumpSymbols:
return std::make_unique<DebugDumpSymbolsAction>();		return std::make_unique<DebugDumpSymbolsAction>();
▲ Show 20 Lines • Show All 111 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[flang][driver] Re-organise the code-gen actions (nfc)ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 427307

flang/include/flang/Frontend/FrontendActions.h

flang/lib/Frontend/FrontendActions.cpp

flang/lib/FrontendTool/ExecuteCompilerInvocation.cpp

[flang][driver] Re-organise the code-gen actions (nfc)
ClosedPublic