Download Raw Diff

Details

Reviewers

dmgreen
samparker
simon_tatham
ostannard

Commits

rGf95e6c065342: [ARM] Allow "-march=foo+fp" to vary with foo
rGa1bb4fb79d86: [ARM] Allow "-march=foo+fp" to vary with foo
rL362601: [ARM] Allow "-march=foo+fp" to vary with foo
rC362601: [ARM] Allow "-march=foo+fp" to vary with foo
rL362600: [ARM] Allow "-march=foo+fp" to vary with foo

Summary

Now, when clang processes an argument of the form "-march=foo+x+y+z",
then instead of calling getArchExtFeature() for each of the extension
names "x", "y", "z" and appending the returned string to its list of
low-level subtarget features, it will call appendArchExtFeatures()
which does the appending itself.

The difference is that appendArchExtFeatures can add _more_ than one
low-level feature name to the output feature list if it has to, and
also, it gets told some information about what base architecture and
CPU the extension is going to go with, which means that "+fp" can now
mean something different for different CPUs. Namely, "+fp" now selects
whatever the _default_ FPU is for the selected CPU and/or
architecture, as defined in the ARM_ARCH or ARM_CPU_NAME macros in
ARMTargetParser.def.

On the clang side, I adjust DecodeARMFeatures to call the new
appendArchExtFeatures function in place of getArchExtFeature. This
means DecodeARMFeatures needs to be passed a CPU name and an ArchKind,
which meant changing its call sites to make those available, and also
sawing getLLVMArchSuffixForARM in half so that you can get an ArchKind
enum value out of it instead of a string.

Also, I add support here for the extension name "+fp.dp", which will
automatically look through the FPU list for something that looks just
like the default FPU except for also supporting double precision.

Diff Detail

Event Timeline

simon_tatham created this revision.Apr 15 2019, 5:57 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptApr 15 2019, 5:57 AM

Herald added subscribers: llvm-commits, cfe-commits, hiraditya and 2 others. · View Herald Transcript

Harbormaster completed remote builds in B30540: Diff 195155.Apr 15 2019, 5:58 AM

simon_tatham added a parent revision: D60696: [TableGen] New default operand "undef_tied_input"..Apr 15 2019, 6:10 AM

simon_tatham added a child revision: D60698: [ARM] add target arch definitions for 8.1-M and MVE..

This needs some tests. I'm also not quite sure when you'd use bare "+fp", if it's the default anyway.

llvm/lib/Support/ARMTargetParser.cpp
487–490	I think `isNegated` is probably more in line with existing naming.
515–518	Doesn't this mean `+nofp.dp` disables all floats? That seems surprising behaviour to me, but I can't find any existing tests covering it.
527–528	Doesn't this silently disable the FPU entirely if we decide "fp.dp" is useless? That seems unlikely to be what a user wants, especially without a diagnostic. Could you also expand on the comment a bit more. I had to look up exactly what FPURestrictions existed to get this, and I'm not even 100% sure I'm right now.

The aim of this change is that it will apply to the v8.1-M (mainline) architecture introduced in D60698, in which +fp won't be the default: -march=armv8.1m.main by itself gives you the base 8.1-M architecture without any FP, -march=armv8.1m.main+fp gives you the optional single-precision FP extension on top of that, and +fp.dp gives you double precision as well.

llvm/lib/Support/ARMTargetParser.cpp
487–490	Hmmm. I thought that would be a confusing name because it hides the fact that the function strips off the `no` prefix. (The use of 'was' was intended to hint that by the time the function returns, it's not true any more!) Perhaps `stripNegationPrefix` (returning bool to indicate success)?
515–518	Hmmm, that's a good point. What would a user expect in that situation? If double-precision FP was the default for that architecture and a single-precision version existed, then perhaps `nofp.dp` should fall back to that, but what if it's double or nothing?
527–528	I don't think it silently disables it, does it? Returning false from this function is a failure indication that ends up back in `checkARMArchName` in `clang/lib/Driver/ToolChains/Arch/ARM.cpp`, which will generate a diagnostic. For example, if I try `-march=armv6m+fp.dp` then I see error: the clang compiler does not support '-march=armv6m+fp.dp'

t.p.northover added inline comments.Apr 16 2019, 3:15 AM

llvm/lib/Support/ARMTargetParser.cpp
487–490	Ah yes, I see. I think your alternative is probably better.
515–518	I think I'd go for a diagnostic in that case. There's already a way to strip out the FPU then (`+nofp`).
527–528	Ah, I missed the only way return true could happen and assumed the return value was vestigial. Sorry.

dnsampaio added a subscriber: dnsampaio.Apr 18 2019, 3:17 AM

SjoerdMeijer mentioned this in rC362100: Follow up of r362096.May 30 2019, 11:10 AM

SjoerdMeijer mentioned this in rL362100: Follow up of r362096.

SjoerdMeijer mentioned this in rGd74c2131c31b: Follow up of r362096.

SjoerdMeijer commandeered this revision.May 31 2019, 1:10 AM

SjoerdMeijer edited reviewers, added: simon_tatham; removed: SjoerdMeijer.

This addresses @t.p.northover comment.

This still needs tests adding.

This revision now requires changes to proceed.May 31 2019, 6:32 AM

Ah yes, the school boy error! ;-) Actually, there was a test, but in a different patch; I will move it to here.

dnsampaio removed a subscriber: dnsampaio.May 31 2019, 7:08 AM

This time with tests.

ostannard added inline comments.Jun 3 2019, 3:03 AM

llvm/lib/Support/ARMTargetParser.cpp
551	Could you also add tests for the error cases here? I think these are: +fp.dp, but the FPU is already double-precision +fp.dp, but no double-precision FPU exists (are there any FPUs which cause this?) +[no]fp or +[no]fp.dp for a CPU/arch which doesn't have any FPUs. I also don't see any tests for the negated forms of either feature.

Hi Oliver, thanks for your comments!

This was the easy one, they have been added:

I also don't see any tests for the negated forms of either feature.

The trouble begun with this:

+fp.dp, but the FPU is already double-precision
+fp.dp, but no double-precision FPU exists (are there any FPUs which cause this?)
+[no]fp or +[no]fp.dp for a CPU/arch which doesn't have any FPUs.

Because I found that basically none of this worked. The main reason was that we were always passing generic. To address that we at least have a chance of seeing a sensible CPU name, I have swapped the order of parsing -march and -mcpu. I.e., we parse -mcpu first, and pass that to checkARMArchName, which will eventually call appendArchExtFeatures. I think that makes more sense when we use the CPUname to query getDefaultFPU.

Then about the more fancy diagnostics (e.g. "fp.dp, but the FPU is already double-precision"): I've removed any attempt to throw clever diagnostics. I don't think, in general, that we provide this kind of service level. I.e., we need to do a lot more work here to avoid a meaningless, confusing, and thus useless "--march=... not supported" error message when we provide +fp.dp on the -march when e.g. the CPU already enabled this.

Fair enough, I don't think we currently try to diagnose any other invalid combinations of features. LGTM.

This revision is now accepted and ready to land.Jun 5 2019, 3:53 AM

Closed by commit rL362600: [ARM] Allow "-march=foo+fp" to vary with foo (authored by SjoerdMeijer). · Explain WhyJun 5 2019, 6:09 AM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: kristina. · View Herald TranscriptJun 5 2019, 6:09 AM

Diff 202433

clang/lib/Driver/ToolChains/Arch/ARM.h

	//===--- ARM.h - ARM-specific (not AArch64) Tool Helpers --------- C++ --===//			//===--- ARM.h - ARM-specific (not AArch64) Tool Helpers --------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_CLANG_LIB_DRIVER_TOOLCHAINS_ARCH_ARM_H			#ifndef LLVM_CLANG_LIB_DRIVER_TOOLCHAINS_ARCH_ARM_H
	#define LLVM_CLANG_LIB_DRIVER_TOOLCHAINS_ARCH_ARM_H			#define LLVM_CLANG_LIB_DRIVER_TOOLCHAINS_ARCH_ARM_H

	#include "clang/Driver/ToolChain.h"			#include "clang/Driver/ToolChain.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/ADT/Triple.h"			#include "llvm/ADT/Triple.h"
	#include "llvm/Option/Option.h"			#include "llvm/Option/Option.h"
				#include "llvm/Support/TargetParser.h"
	#include <string>			#include <string>
	#include <vector>			#include <vector>

	namespace clang {			namespace clang {
	namespace driver {			namespace driver {
	namespace tools {			namespace tools {
	namespace arm {			namespace arm {

	std::string getARMTargetCPU(StringRef CPU, llvm::StringRef Arch,			std::string getARMTargetCPU(StringRef CPU, llvm::StringRef Arch,
	const llvm::Triple &Triple);			const llvm::Triple &Triple);
	const std::string getARMArch(llvm::StringRef Arch, const llvm::Triple &Triple);			const std::string getARMArch(llvm::StringRef Arch, const llvm::Triple &Triple);
	StringRef getARMCPUForMArch(llvm::StringRef Arch, const llvm::Triple &Triple);			StringRef getARMCPUForMArch(llvm::StringRef Arch, const llvm::Triple &Triple);
				llvm::ARM::ArchKind getLLVMArchKindForARM(StringRef CPU, StringRef Arch,
				const llvm::Triple &Triple);
	StringRef getLLVMArchSuffixForARM(llvm::StringRef CPU, llvm::StringRef Arch,			StringRef getLLVMArchSuffixForARM(llvm::StringRef CPU, llvm::StringRef Arch,
	const llvm::Triple &Triple);			const llvm::Triple &Triple);

	void appendBE8LinkFlag(const llvm::opt::ArgList &Args,			void appendBE8LinkFlag(const llvm::opt::ArgList &Args,
	llvm::opt::ArgStringList &CmdArgs,			llvm::opt::ArgStringList &CmdArgs,
	const llvm::Triple &Triple);			const llvm::Triple &Triple);
	enum class ReadTPMode {			enum class ReadTPMode {
	Invalid,			Invalid,
	Show All 31 Lines

clang/lib/Driver/ToolChains/Arch/ARM.cpp

Show First 20 Lines • Show All 66 Lines • ▼ Show 20 Lines	static void getARMFPUFeatures(const Driver &D, const Arg *A,
std::vector<StringRef> &Features) {		std::vector<StringRef> &Features) {
unsigned FPUID = llvm::ARM::parseFPU(FPU);		unsigned FPUID = llvm::ARM::parseFPU(FPU);
if (!llvm::ARM::getFPUFeatures(FPUID, Features))		if (!llvm::ARM::getFPUFeatures(FPUID, Features))
D.Diag(clang::diag::err_drv_clang_unsupported) << A->getAsString(Args);		D.Diag(clang::diag::err_drv_clang_unsupported) << A->getAsString(Args);
}		}

// Decode ARM features from string like +[no]featureA+[no]featureB+...		// Decode ARM features from string like +[no]featureA+[no]featureB+...
static bool DecodeARMFeatures(const Driver &D, StringRef text,		static bool DecodeARMFeatures(const Driver &D, StringRef text,
		StringRef CPU, llvm::ARM::ArchKind ArchKind,
std::vector<StringRef> &Features) {		std::vector<StringRef> &Features) {
SmallVector<StringRef, 8> Split;		SmallVector<StringRef, 8> Split;
text.split(Split, StringRef("+"), -1, false);		text.split(Split, StringRef("+"), -1, false);

for (StringRef Feature : Split) {		for (StringRef Feature : Split) {
StringRef FeatureName = llvm::ARM::getArchExtFeature(Feature);		if (!appendArchExtFeatures(CPU, ArchKind, Feature, Features))
if (!FeatureName.empty())
Features.push_back(FeatureName);
else
return false;		return false;
}		}
return true;		return true;
}		}

static void DecodeARMFeaturesFromCPU(const Driver &D, StringRef CPU,		static void DecodeARMFeaturesFromCPU(const Driver &D, StringRef CPU,
std::vector<StringRef> &Features) {		std::vector<StringRef> &Features) {
CPU = CPU.split("+").first;		CPU = CPU.split("+").first;
Show All 9 Lines
// to handle -march=native correctly.		// to handle -march=native correctly.
static void checkARMArchName(const Driver &D, const Arg *A, const ArgList &Args,		static void checkARMArchName(const Driver &D, const Arg *A, const ArgList &Args,
llvm::StringRef ArchName,		llvm::StringRef ArchName,
std::vector<StringRef> &Features,		std::vector<StringRef> &Features,
const llvm::Triple &Triple) {		const llvm::Triple &Triple) {
std::pair<StringRef, StringRef> Split = ArchName.split("+");		std::pair<StringRef, StringRef> Split = ArchName.split("+");

std::string MArch = arm::getARMArch(ArchName, Triple);		std::string MArch = arm::getARMArch(ArchName, Triple);
if (llvm::ARM::parseArch(MArch) == llvm::ARM::ArchKind::INVALID \|\|		llvm::ARM::ArchKind ArchKind = llvm::ARM::parseArch(MArch);
(Split.second.size() && !DecodeARMFeatures(D, Split.second, Features)))		if (ArchKind == llvm::ARM::ArchKind::INVALID \|\|
		(Split.second.size() && !DecodeARMFeatures(
		D, Split.second, "generic", ArchKind, Features)))
D.Diag(clang::diag::err_drv_clang_unsupported) << A->getAsString(Args);		D.Diag(clang::diag::err_drv_clang_unsupported) << A->getAsString(Args);
}		}

// Check -mcpu=. Needs ArchName to handle -mcpu=generic.		// Check -mcpu=. Needs ArchName to handle -mcpu=generic.
static void checkARMCPUName(const Driver &D, const Arg *A, const ArgList &Args,		static void checkARMCPUName(const Driver &D, const Arg *A, const ArgList &Args,
llvm::StringRef CPUName, llvm::StringRef ArchName,		llvm::StringRef CPUName, llvm::StringRef ArchName,
std::vector<StringRef> &Features,		std::vector<StringRef> &Features,
const llvm::Triple &Triple) {		const llvm::Triple &Triple) {
std::pair<StringRef, StringRef> Split = CPUName.split("+");		std::pair<StringRef, StringRef> Split = CPUName.split("+");

std::string CPU = arm::getARMTargetCPU(CPUName, ArchName, Triple);		std::string CPU = arm::getARMTargetCPU(CPUName, ArchName, Triple);
if (arm::getLLVMArchSuffixForARM(CPU, ArchName, Triple).empty() \|\|		llvm::ARM::ArchKind ArchKind =
(Split.second.size() && !DecodeARMFeatures(D, Split.second, Features)))		arm::getLLVMArchKindForARM(CPU, ArchName, Triple);
		if (ArchKind == llvm::ARM::ArchKind::INVALID \|\|
		(Split.second.size() && !DecodeARMFeatures(
		D, Split.second, CPU, ArchKind, Features)))
D.Diag(clang::diag::err_drv_clang_unsupported) << A->getAsString(Args);		D.Diag(clang::diag::err_drv_clang_unsupported) << A->getAsString(Args);
}		}

bool arm::useAAPCSForMachO(const llvm::Triple &T) {		bool arm::useAAPCSForMachO(const llvm::Triple &T) {
// The backend is hardwired to assume AAPCS for M-class processors, ensure		// The backend is hardwired to assume AAPCS for M-class processors, ensure
// the frontend matches that.		// the frontend matches that.
return T.getEnvironment() == llvm::Triple::EABI \|\|		return T.getEnvironment() == llvm::Triple::EABI \|\|
T.getOS() == llvm::Triple::UnknownOS \|\| isARMMProfile(T);		T.getOS() == llvm::Triple::UnknownOS \|\| isARMMProfile(T);
▲ Show 20 Lines • Show All 483 Lines • ▼ Show 20 Lines	if (MCPU == "native")
return llvm::sys::getHostCPUName();		return llvm::sys::getHostCPUName();
else		else
return MCPU;		return MCPU;
}		}

return getARMCPUForMArch(Arch, Triple);		return getARMCPUForMArch(Arch, Triple);
}		}

/// getLLVMArchSuffixForARM - Get the LLVM arch name to use for a particular		/// getLLVMArchSuffixForARM - Get the LLVM ArchKind value to use for a
/// CPU (or Arch, if CPU is generic).		/// particular CPU (or Arch, if CPU is generic). This is needed to
// FIXME: This is redundant with -mcpu, why does LLVM use this.		/// pass to functions like llvm::ARM::getDefaultFPU which need an
StringRef arm::getLLVMArchSuffixForARM(StringRef CPU, StringRef Arch,		/// ArchKind as well as a CPU name.
		llvm::ARM::ArchKind arm::getLLVMArchKindForARM(StringRef CPU, StringRef Arch,
const llvm::Triple &Triple) {		const llvm::Triple &Triple) {
llvm::ARM::ArchKind ArchKind;		llvm::ARM::ArchKind ArchKind;
if (CPU == "generic") {		if (CPU == "generic") {
std::string ARMArch = tools::arm::getARMArch(Arch, Triple);		std::string ARMArch = tools::arm::getARMArch(Arch, Triple);
ArchKind = llvm::ARM::parseArch(ARMArch);		ArchKind = llvm::ARM::parseArch(ARMArch);
if (ArchKind == llvm::ARM::ArchKind::INVALID)		if (ArchKind == llvm::ARM::ArchKind::INVALID)
// In case of generic Arch, i.e. "arm",		// In case of generic Arch, i.e. "arm",
// extract arch from default cpu of the Triple		// extract arch from default cpu of the Triple
ArchKind = llvm::ARM::parseCPUArch(Triple.getARMCPUForArch(ARMArch));		ArchKind = llvm::ARM::parseCPUArch(Triple.getARMCPUForArch(ARMArch));
} else {		} else {
// FIXME: horrible hack to get around the fact that Cortex-A7 is only an		// FIXME: horrible hack to get around the fact that Cortex-A7 is only an
// armv7k triple if it's actually been specified via "-arch armv7k".		// armv7k triple if it's actually been specified via "-arch armv7k".
ArchKind = (Arch == "armv7k" \|\| Arch == "thumbv7k")		ArchKind = (Arch == "armv7k" \|\| Arch == "thumbv7k")
? llvm::ARM::ArchKind::ARMV7K		? llvm::ARM::ArchKind::ARMV7K
: llvm::ARM::parseCPUArch(CPU);		: llvm::ARM::parseCPUArch(CPU);
}		}
		return ArchKind;
		}

		/// getLLVMArchSuffixForARM - Get the LLVM arch name to use for a particular
		/// CPU (or Arch, if CPU is generic).
		// FIXME: This is redundant with -mcpu, why does LLVM use this.
		StringRef arm::getLLVMArchSuffixForARM(StringRef CPU, StringRef Arch,
		const llvm::Triple &Triple) {
		llvm::ARM::ArchKind ArchKind = getLLVMArchKindForARM(CPU, Arch, Triple);
if (ArchKind == llvm::ARM::ArchKind::INVALID)		if (ArchKind == llvm::ARM::ArchKind::INVALID)
return "";		return "";
return llvm::ARM::getSubArch(ArchKind);		return llvm::ARM::getSubArch(ArchKind);
}		}

void arm::appendBE8LinkFlag(const ArgList &Args, ArgStringList &CmdArgs,		void arm::appendBE8LinkFlag(const ArgList &Args, ArgStringList &CmdArgs,
const llvm::Triple &Triple) {		const llvm::Triple &Triple) {
if (Args.hasArg(options::OPT_r))		if (Args.hasArg(options::OPT_r))
return;		return;

// ARMv7 (and later) and ARMv6-M do not support BE-32, so instruct the linker		// ARMv7 (and later) and ARMv6-M do not support BE-32, so instruct the linker
// to generate BE-8 executables.		// to generate BE-8 executables.
if (arm::getARMSubArchVersionNumber(Triple) >= 7 \|\| arm::isARMMProfile(Triple))		if (arm::getARMSubArchVersionNumber(Triple) >= 7 \|\| arm::isARMMProfile(Triple))
CmdArgs.push_back("--be8");		CmdArgs.push_back("--be8");
}		}

clang/test/Driver/armv8.1m.main.c

	// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+dsp -### %s 2> %t			// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+dsp -### %s 2> %t
	// RUN: FileCheck --check-prefix=CHECK-DSP < %t %s			// RUN: FileCheck --check-prefix=CHECK-DSP < %t %s
	// CHECK-DSP: "-target-feature" "+dsp"			// CHECK-DSP: "-target-feature" "+dsp"

				// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+fp -### %s 2> %t
				// RUN: FileCheck --check-prefix=CHECK-FP < %t %s
				// CHECK-FP: "-target-feature" "+fp-armv8"
				// CHECK-FP-NOT: "-target-feature" "+fp64"
				// CHECK-FP-NOT: "-target-feature" "+d32"
				// CHECK-FP: "-target-feature" "+fullfp16"

				// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+fp.dp -### %s 2> %t
				// RUN: FileCheck --check-prefix=CHECK-FPDP < %t %s
				// CHECK-FPDP: "-target-feature" "+fp-armv8"
				// CHECK-FPDP: "-target-feature" "+fullfp16"
				// CHECK-FPDP: "-target-feature" "+fp64"
				// CHECK-FPDP-NOT: "-target-feature" "+d32"

	// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+mve -### %s 2> %t			// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+mve -### %s 2> %t
	// RUN: FileCheck --check-prefix=CHECK-MVE < %t %s			// RUN: FileCheck --check-prefix=CHECK-MVE < %t %s
	// CHECK-MVE: "-target-feature" "+mve"			// CHECK-MVE: "-target-feature" "+mve"

	// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+mve.fp -### %s 2> %t			// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+mve.fp -### %s 2> %t
	// RUN: FileCheck --check-prefix=CHECK-MVEFP < %t %s			// RUN: FileCheck --check-prefix=CHECK-MVEFP < %t %s
	// CHECK-MVEFP: "-target-feature" "+mve.fp"			// CHECK-MVEFP: "-target-feature" "+mve.fp"
	// CHECK-MVEFP-NOT: "-target-feature" "+fp64"			// CHECK-MVEFP-NOT: "-target-feature" "+fp64"

				// RUN: %clang -target arm-arm-none-eabi -march=armv8.1-m.main+mve.fp+fp.dp -### %s 2> %t
				// RUN: FileCheck --check-prefix=CHECK-MVEFP_DP < %t %s
				// CHECK-MVEFP_DP: "-target-feature" "+mve.fp"
				// CHECK-MVEFP_DP: "-target-feature" "+fp64"

	double foo (double a) { return a; }			double foo (double a) { return a; }

clang/test/Driver/armv8.1m.main.s

	# REQUIRES: arm-registered-target			# REQUIRES: arm-registered-target
	# RUN: not %clang -c -target arm-none-none-eabi -march=armv8-m.main -o /dev/null %s 2>%t			# RUN: not %clang -c -target arm-none-none-eabi -march=armv8-m.main -o /dev/null %s 2>%t
	# RUN: FileCheck --check-prefix=ERROR-V8M < %t %s			# RUN: FileCheck --check-prefix=ERROR-V8M < %t %s
	# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main -o /dev/null %s 2>%t			# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main -o /dev/null %s 2>%t
	# RUN: FileCheck --check-prefix=ERROR-V81M < %t %s			# RUN: FileCheck --check-prefix=ERROR-V81M < %t %s
	# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+dsp -o /dev/null %s 2>%t			# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+dsp -o /dev/null %s 2>%t
	# RUN: FileCheck --check-prefix=ERROR-V81M_DSP < %t %s			# RUN: FileCheck --check-prefix=ERROR-V81M_DSP < %t %s
				# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+fp -o /dev/null %s 2>%t
				# RUN: FileCheck --check-prefix=ERROR-V81M_FP < %t %s
				# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+fp.dp -o /dev/null %s 2>%t
				# RUN: FileCheck --check-prefix=ERROR-V81M_FPDP < %t %s
	# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+mve -o /dev/null %s 2>%t			# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+mve -o /dev/null %s 2>%t
	# RUN: FileCheck --check-prefix=ERROR-V81M_MVE < %t %s			# RUN: FileCheck --check-prefix=ERROR-V81M_MVE < %t %s
				# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+mve+fp -o /dev/null %s 2>%t
				# RUN: FileCheck --check-prefix=ERROR-V81M_MVE_FP < %t %s
	# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+mve.fp -o /dev/null %s 2>%t			# RUN: not %clang -c -target arm-none-none-eabi -march=armv8.1-m.main+mve.fp -o /dev/null %s 2>%t
	# RUN: FileCheck --check-prefix=ERROR-V81M_MVEFP < %t %s			# RUN: FileCheck --check-prefix=ERROR-V81M_MVEFP < %t %s

	.syntax unified			.syntax unified
	.thumb			.thumb
	.text			.text

	csinc r0, r1, r2, eq			csinc r0, r1, r2, eq
	Show All 16 Lines
	# ERROR-V81M: :[[@LINE-2]]:1: error			# ERROR-V81M: :[[@LINE-2]]:1: error
	# ERROR-V81M_DSP: :[[@LINE-3]]:1: error			# ERROR-V81M_DSP: :[[@LINE-3]]:1: error
	# ERROR-V81M_MVE: :[[@LINE-4]]:1: error			# ERROR-V81M_MVE: :[[@LINE-4]]:1: error

	vcmp.f64 d0,d1			vcmp.f64 d0,d1
	# ERROR-V8M: :[[@LINE-1]]:1: error			# ERROR-V8M: :[[@LINE-1]]:1: error
	# ERROR-V81M: :[[@LINE-2]]:1: error			# ERROR-V81M: :[[@LINE-2]]:1: error
	# ERROR-V81M_DSP: :[[@LINE-3]]:1: error			# ERROR-V81M_DSP: :[[@LINE-3]]:1: error
	# ERROR-V81M_MVE: :[[@LINE-4]]:1: error			# ERROR-V81M_FP: :[[@LINE-4]]:1: error
	# ERROR-V81M_MVEFP: :[[@LINE-5]]:1: error			# ERROR-V81M_MVE: :[[@LINE-5]]:1: error
				# ERROR-V81M_MVE_FP: :[[@LINE-6]]:1: error
				# ERROR-V81M_MVEFP: :[[@LINE-7]]:1: error

	asrl r0, r1, r2			asrl r0, r1, r2
	# ERROR-V8M: :[[@LINE-1]]:1: error			# ERROR-V8M: :[[@LINE-1]]:1: error
	# ERROR-V81M: :[[@LINE-2]]:1: error			# ERROR-V81M: :[[@LINE-2]]:1: error
	# ERROR-V81M_DSP: :[[@LINE-3]]:1: error			# ERROR-V81M_DSP: :[[@LINE-3]]:1: error
				# ERROR-V81M_FP: :[[@LINE-4]]:1: error
				# ERROR-V81M_FPDP: :[[@LINE-5]]:1: error

	vcadd.i8 q0, q1, q2, #90			vcadd.i8 q0, q1, q2, #90
	# ERROR-V8M: :[[@LINE-1]]:1: error			# ERROR-V8M: :[[@LINE-1]]:1: error
	# ERROR-V81M: :[[@LINE-2]]:1: error			# ERROR-V81M: :[[@LINE-2]]:1: error
	# ERROR-V81M_DSP: :[[@LINE-3]]:1: error			# ERROR-V81M_DSP: :[[@LINE-3]]:1: error
				# ERROR-V81M_FP: :[[@LINE-4]]:1: error
				# ERROR-V81M_FPDP: :[[@LINE-5]]:1: error

llvm/include/llvm/Support/ARMTargetParser.h

Show First 20 Lines • Show All 234 Lines • ▼ Show 20 Lines	bool getExtensionFeatures(unsigned Extensions,
std::vector<StringRef> &Features);		std::vector<StringRef> &Features);

StringRef getArchName(ArchKind AK);		StringRef getArchName(ArchKind AK);
unsigned getArchAttr(ArchKind AK);		unsigned getArchAttr(ArchKind AK);
StringRef getCPUAttr(ArchKind AK);		StringRef getCPUAttr(ArchKind AK);
StringRef getSubArch(ArchKind AK);		StringRef getSubArch(ArchKind AK);
StringRef getArchExtName(unsigned ArchExtKind);		StringRef getArchExtName(unsigned ArchExtKind);
StringRef getArchExtFeature(StringRef ArchExt);		StringRef getArchExtFeature(StringRef ArchExt);
		bool appendArchExtFeatures(StringRef CPU, ARM::ArchKind AK, StringRef ArchExt,
		std::vector<StringRef> &Features);
StringRef getHWDivName(unsigned HWDivKind);		StringRef getHWDivName(unsigned HWDivKind);

// Information by Name		// Information by Name
unsigned getDefaultFPU(StringRef CPU, ArchKind AK);		unsigned getDefaultFPU(StringRef CPU, ArchKind AK);
unsigned getDefaultExtensions(StringRef CPU, ArchKind AK);		unsigned getDefaultExtensions(StringRef CPU, ArchKind AK);
StringRef getDefaultCPU(StringRef Arch);		StringRef getDefaultCPU(StringRef Arch);
StringRef getCanonicalArchName(StringRef Arch);		StringRef getCanonicalArchName(StringRef Arch);
StringRef getFPUSynonym(StringRef FPU);		StringRef getFPUSynonym(StringRef FPU);
Show All 20 Lines

llvm/lib/Support/ARMTargetParser.cpp

	Show First 20 Lines • Show All 478 Lines • ▼ Show 20 Lines
	StringRef ARM::getArchExtName(unsigned ArchExtKind) {			StringRef ARM::getArchExtName(unsigned ArchExtKind) {
	for (const auto AE : ARCHExtNames) {			for (const auto AE : ARCHExtNames) {
	if (ArchExtKind == AE.ID)			if (ArchExtKind == AE.ID)
	return AE.getName();			return AE.getName();
	}			}
	return StringRef();			return StringRef();
	}			}

	StringRef ARM::getArchExtFeature(StringRef ArchExt) {			static bool stripNegationPrefix(StringRef &Name) {
	if (ArchExt.startswith("no")) {			if (Name.startswith("no")) {
	StringRef ArchExtBase(ArchExt.substr(2));			Name = Name.substr(2);
	for (const auto AE : ARCHExtNames) {			return true;
				t.p.northoverUnsubmitted Not Done Reply Inline Actions I think `isNegated` is probably more in line with existing naming. t.p.northover: I think `isNegated` is probably more in line with existing naming.
				simon_tathamUnsubmitted Done Reply Inline Actions Hmmm. I thought that would be a confusing name because it hides the fact that the function strips off the `no` prefix. (The use of 'was' was intended to hint that by the time the function returns, it's not true any more!) Perhaps `stripNegationPrefix` (returning bool to indicate success)? simon_tatham: Hmmm. I thought that would be a confusing name because it hides the fact that the function…
				t.p.northoverUnsubmitted Not Done Reply Inline Actions Ah yes, I see. I think your alternative is probably better. t.p.northover: Ah yes, I see. I think your alternative is probably better.
	if (AE.NegFeature && ArchExtBase == AE.getName())
	return StringRef(AE.NegFeature);
	}			}
				return false;
	}			}

				StringRef ARM::getArchExtFeature(StringRef ArchExt) {
				bool Negated = stripNegationPrefix(ArchExt);
	for (const auto AE : ARCHExtNames) {			for (const auto AE : ARCHExtNames) {
	if (AE.Feature && ArchExt == AE.getName())			if (AE.Feature && ArchExt == AE.getName())
	return StringRef(AE.Feature);			return StringRef(Negated ? AE.NegFeature : AE.Feature);
	}			}

	return StringRef();			return StringRef();
	}			}

				static unsigned findDoublePrecisionFPU(unsigned InputFPUKind) {
				const ARM::FPUName &InputFPU = ARM::FPUNames[InputFPUKind];

				// If the input FPU already supports double-precision, then there
				// isn't any different FPU we can return here.
				//
				// The current available FPURestriction values are None (no
				// restriction), D16 (only 16 d-regs) and SP_D16 (16 d-regs
				// and single precision only); there's no value representing
				// SP restriction without D16. So this test just means 'is it
				// SP only?'.
				if (InputFPU.Restriction != ARM::FPURestriction::SP_D16)
				return ARM::FK_INVALID;

				t.p.northoverUnsubmitted Not Done Reply Inline Actions Doesn't this mean `+nofp.dp` disables all floats? That seems surprising behaviour to me, but I can't find any existing tests covering it. t.p.northover: Doesn't this mean `+nofp.dp` disables all floats? That seems surprising behaviour to me, but I…
				simon_tathamUnsubmitted Done Reply Inline Actions Hmmm, that's a good point. What would a user expect in that situation? If double-precision FP was the default for that architecture and a single-precision version existed, then perhaps `nofp.dp` should fall back to that, but what if it's double or nothing? simon_tatham: Hmmm, that's a good point. What //would// a user expect in that situation? If double-precision…
				t.p.northoverUnsubmitted Not Done Reply Inline Actions I think I'd go for a diagnostic in that case. There's already a way to strip out the FPU then (`+nofp`). t.p.northover: I think I'd go for a diagnostic in that case. There's already a way to strip out the FPU then…
				// Otherwise, look for an FPU entry with all the same fields, except
				// that SP_D16 has been replaced with just D16, representing adding
				// double precision and not changing anything else.
				for (const ARM::FPUName &CandidateFPU : ARM::FPUNames) {
				if (CandidateFPU.FPUVer == InputFPU.FPUVer &&
				CandidateFPU.NeonSupport == InputFPU.NeonSupport &&
				CandidateFPU.Restriction == ARM::FPURestriction::D16) {
				return CandidateFPU.ID;
				}
				}
				t.p.northoverUnsubmitted Not Done Reply Inline Actions Doesn't this silently disable the FPU entirely if we decide "fp.dp" is useless? That seems unlikely to be what a user wants, especially without a diagnostic. Could you also expand on the comment a bit more. I had to look up exactly what FPURestrictions existed to get this, and I'm not even 100% sure I'm right now. t.p.northover: Doesn't this silently disable the FPU entirely if we decide "fp.dp" is useless? That seems…
				simon_tathamUnsubmitted Done Reply Inline Actions I don't think it silently disables it, does it? Returning false from this function is a failure indication that ends up back in `checkARMArchName` in `clang/lib/Driver/ToolChains/Arch/ARM.cpp`, which will generate a diagnostic. For example, if I try `-march=armv6m+fp.dp` then I see error: the clang compiler does not support '-march=armv6m+fp.dp' simon_tatham: I don't think it //silently// disables it, does it? Returning false from this function is a…
				t.p.northoverUnsubmitted Not Done Reply Inline Actions Ah, I missed the only way return true could happen and assumed the return value was vestigial. Sorry. t.p.northover: Ah, I missed the only way return true could happen and assumed the return value was vestigial.

				// nothing found
				return ARM::FK_INVALID;
				}

				bool ARM::appendArchExtFeatures(
				StringRef CPU, ARM::ArchKind AK, StringRef ArchExt,
				std::vector<StringRef> &Features) {
				StringRef StandardFeature = getArchExtFeature(ArchExt);
				if (!StandardFeature.empty()) {
				Features.push_back(StandardFeature);
				return true;
				}

				bool Negated = stripNegationPrefix(ArchExt);

				if (ArchExt == "fp" \|\| ArchExt == "fp.dp") {
				unsigned FPUKind;

				if (ArchExt == "fp.dp") {
				unsigned DoubleFPUKind = findDoublePrecisionFPU(getDefaultFPU(CPU, AK));

				// If the default FPU already supports double-precision, or if
				ostannardUnsubmitted Not Done Reply Inline Actions Could you also add tests for the error cases here? I think these are: +fp.dp, but the FPU is already double-precision +fp.dp, but no double-precision FPU exists (are there any FPUs which cause this?) +[no]fp or +[no]fp.dp for a CPU/arch which doesn't have any FPUs. I also don't see any tests for the negated forms of either feature. ostannard: Could you also add tests for the error cases here? I think these are: * +fp.dp, but the FPU is…
				// there is no double-prec FPU that extends it, then "fp.dp"
				// doesn't have a separate meaning, and we treat it as an
				// invalid extension name.
				if (DoubleFPUKind == FK_INVALID)
				return false;

				// If there _is_ a separate double-precision FPU, then "nofp.dp"
				// should disable just the double-precision extension, leaving
				// the base FPU still enabled if it previously was.
				if (Negated) {
				Features.push_back("-fp64");
				return true;
				}

				// Otherwise, select the double-precision FPU.
				FPUKind = DoubleFPUKind;
				} else if (Negated) {
				FPUKind = ARM::FK_NONE;
				} else {
				FPUKind = getDefaultFPU(CPU, AK);
				if (FPUKind == ARM::FK_NONE)
				return false;
				}
				return ARM::getFPUFeatures(FPUKind, Features);
				}

				return false;
				}

	StringRef ARM::getHWDivName(unsigned HWDivKind) {			StringRef ARM::getHWDivName(unsigned HWDivKind) {
	for (const auto D : HWDivNames) {			for (const auto D : HWDivNames) {
	if (HWDivKind == D.ID)			if (HWDivKind == D.ID)
	return D.getName();			return D.getName();
	}			}
	return StringRef();			return StringRef();
	}			}

	▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[ARM] Allow "-march=foo+fp" to vary with foo.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 202433

clang/lib/Driver/ToolChains/Arch/ARM.h

clang/lib/Driver/ToolChains/Arch/ARM.cpp

clang/test/Driver/armv8.1m.main.c

clang/test/Driver/armv8.1m.main.s

llvm/include/llvm/Support/ARMTargetParser.h

llvm/lib/Support/ARMTargetParser.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[ARM] Allow "-march=foo+fp" to vary with foo.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 202433

clang/lib/Driver/ToolChains/Arch/ARM.h

clang/lib/Driver/ToolChains/Arch/ARM.cpp

clang/test/Driver/armv8.1m.main.c

clang/test/Driver/armv8.1m.main.s

llvm/include/llvm/Support/ARMTargetParser.h

llvm/lib/Support/ARMTargetParser.cpp

[ARM] Allow "-march=foo+fp" to vary with foo.
ClosedPublic