This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
Common/
-
TargetOptionsCommandFlags.cpp
-
ELF/
-
LTO.cpp
-
include/lld/Common/
-
lld/
-
Common/
-
TargetOptionsCommandFlags.h
-
test/ELF/lto/
-
ELF/
-
lto/
1
cpu-string.ll

Differential D42183

[LLD] [ELF] Pass CPU string to LTO pipeline.
Needs ReviewPublic

Authored by pbhatu on Jan 17 2018, 7:09 AM.

Download Raw Diff

Details

Reviewers

ruiu
davide
• espindola

Summary

Previously an empty CPU string was passed to the LTO engine which
resulted in a generic CPU for which certain features like NOPL
were disabled. This fixes that.

Diff Detail

Event Timeline

pbhatu created this revision.Jan 17 2018, 7:09 AM

Herald added subscribers: llvm-commits, inglorion, emaste. · View Herald TranscriptJan 17 2018, 7:09 AM

GGanesh added a subscriber: GGanesh.Jan 17 2018, 7:25 AM

You should add a testcase as Rafael pointed out. Overall it looks fine.
We used mllvm in lld to pass this kind of options to the backend but this seems justifiable.
BTW, if you're using lld or the gold plugin you might consider implementing the linker counterpart as well (if you have your proprietary linker, just ignore this last bit).

This revision now requires changes to proceed.Jan 17 2018, 10:22 AM

I added a test case. Since znver1 supports NOPW, it should be generated when we specify it as the CPU.

In D42183#978862, @davide wrote:

You should add a testcase as Rafael pointed out. Overall it looks fine.
We used mllvm in lld to pass this kind of options to the backend but this seems justifiable.
BTW, if you're using lld or the gold plugin you might consider implementing the linker counterpart as well (if you have your proprietary linker, just ignore this last bit).

Thanks for the prompt review Davide and Rafael!

I don't think the mllvm option works as intended. The specified CPU(through mllvm) is definitely being used in the middle end for optimizations during LTO. However, it is not picked up when we initialize the target machine for the backend. This patch tries to fix that.

Also, I'm a bit confused by your last statement. What do you mean by implementing the linker counterpart as well? I think I'm missing the larger picture.

Addressed comments: clang-formatted the patch and removed the -m option from the test case.

I do not have commit access, could you please do it instead?

As of r322227 I would have though the target-cpu attribute would control the NOPL creation and override the CPU string.

Committed as r323801.

craig.topper added inline comments.Jan 30 2018, 11:02 AM

test/ELF/lto/cpu-string.ll
10	This CHECK line doesn't prove anything. nop would nopw. This should be a CHECK-NOT nopw.

• espindola resigned from this revision.Feb 26 2018, 2:44 PM

Herald added a subscriber: arichardson. · View Herald TranscriptFeb 26 2018, 2:44 PM

Revision Contents

Path

Size

Common/

TargetOptionsCommandFlags.cpp

2 lines

ELF/

LTO.cpp

1 line

include/

lld/

Common/

TargetOptionsCommandFlags.h

1 line

test/

ELF/

lto/

cpu-string.ll

23 lines

Diff 131927

Common/TargetOptionsCommandFlags.cpp

	Show All 24 Lines
	// would lead to multiple definitions of the command line flags.			// would lead to multiple definitions of the command line flags.
	llvm::TargetOptions lld::InitTargetOptionsFromCodeGenFlags() {			llvm::TargetOptions lld::InitTargetOptionsFromCodeGenFlags() {
	return ::InitTargetOptionsFromCodeGenFlags();			return ::InitTargetOptionsFromCodeGenFlags();
	}			}

	llvm::Optional<llvm::CodeModel::Model> lld::GetCodeModelFromCMModel() {			llvm::Optional<llvm::CodeModel::Model> lld::GetCodeModelFromCMModel() {
	return getCodeModel();			return getCodeModel();
	}			}

				std::string lld::GetCPUStr() { return ::getCPUStr(); }

ELF/LTO.cpp

Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	static std::unique_ptr<lto::LTO> createLTO() {
else if (Config->Pic)		else if (Config->Pic)
Conf.RelocModel = Reloc::PIC_;		Conf.RelocModel = Reloc::PIC_;
else		else
Conf.RelocModel = Reloc::Static;		Conf.RelocModel = Reloc::Static;
Conf.CodeModel = GetCodeModelFromCMModel();		Conf.CodeModel = GetCodeModelFromCMModel();
Conf.DisableVerify = Config->DisableVerify;		Conf.DisableVerify = Config->DisableVerify;
Conf.DiagHandler = diagnosticHandler;		Conf.DiagHandler = diagnosticHandler;
Conf.OptLevel = Config->LTOO;		Conf.OptLevel = Config->LTOO;
		Conf.CPU = GetCPUStr();

// Set up a custom pipeline if we've been asked to.		// Set up a custom pipeline if we've been asked to.
Conf.OptPipeline = Config->LTONewPmPasses;		Conf.OptPipeline = Config->LTONewPmPasses;
Conf.AAPipeline = Config->LTOAAPipeline;		Conf.AAPipeline = Config->LTOAAPipeline;

// Set up optimization remarks if we've been asked to.		// Set up optimization remarks if we've been asked to.
Conf.RemarksFilename = Config->OptRemarksFilename;		Conf.RemarksFilename = Config->OptRemarksFilename;
Conf.RemarksWithHotness = Config->OptRemarksWithHotness;		Conf.RemarksWithHotness = Config->OptRemarksWithHotness;
▲ Show 20 Lines • Show All 126 Lines • Show Last 20 Lines

include/lld/Common/TargetOptionsCommandFlags.h

	Show All 12 Lines

	#include "llvm/ADT/Optional.h"			#include "llvm/ADT/Optional.h"
	#include "llvm/Support/CodeGen.h"			#include "llvm/Support/CodeGen.h"
	#include "llvm/Target/TargetOptions.h"			#include "llvm/Target/TargetOptions.h"

	namespace lld {			namespace lld {
	llvm::TargetOptions InitTargetOptionsFromCodeGenFlags();			llvm::TargetOptions InitTargetOptionsFromCodeGenFlags();
	llvm::Optional<llvm::CodeModel::Model> GetCodeModelFromCMModel();			llvm::Optional<llvm::CodeModel::Model> GetCodeModelFromCMModel();
				std::string GetCPUStr();
	}			}

test/ELF/lto/cpu-string.ll

This file was added.

				; REQUIRES: x86
				; RUN: llvm-as %s -o %t.o

				; RUN: ld.lld %t.o -o %t.so -shared
				; RUN: llvm-objdump -d -section=".text" -no-leading-addr -no-show-raw-insn %t.so \| FileCheck %s

				; RUN: ld.lld -mllvm -mcpu=znver1 %t.o -o %m.so -shared
				; RUN: llvm-objdump -d -section=".text" -no-leading-addr -no-show-raw-insn %m.so \| FileCheck -check-prefix=ZNVER1 %s

				; CHECK: nop
				craig.topperUnsubmitted Not Done Reply Inline Actions This CHECK line doesn't prove anything. nop would nopw. This should be a CHECK-NOT nopw. craig.topper: This CHECK line doesn't prove anything. nop would nopw. This should be a CHECK-NOT nopw.

				; ZNVER1: nopw{{.*}}

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				define void @foo() #0 {
				entry:
				call void asm sideeffect ".p2align 4, 0x90", "~{dirflag},~{fpsr},~{flags}"()
				ret void
				}

				attributes #0 = { "no-frame-pointer-elim"="true" "target-cpu"="znver1"}