This is an archive of the discontinued LLVM Phabricator instance.

Add denormal-fp-math attribute for f16
AbandonedPublic

Authored by dcandler on Jul 7 2022, 8:53 AM.

Download Raw Diff

Details

Reviewers

arsenm
spatel
echristo
andrew.w.kaylor
cameron.mcinally

Summary

Denormal flushing behavior is currently controlled with the
denormal-fp-math attribute, with a denormal-fp-math-f32 variant for
targets such as AMDGPU where f32 denormals are controlled separately
from f16/f64. However there are other targets such as Arm (and I
think x86) where f16 denormals can be distinct from f32/f64. As the
attributes are now used for constant folding, this can lead to
incorrect folded values for half precision floats on those targets.

This patch adds a denormal-fp-math-f16 attribute, which functions
identically to denormal-fp-math-f32, but overrides the denormal
handling mode for f16 only. Constant folding tests have been
expanded to include half floats, and check both f16 and f32
variants of the attribute.

Diff Detail

Event Timeline

dcandler created this revision.Jul 7 2022, 8:53 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 7 2022, 8:53 AM

Herald added subscribers: jsji, kosarev, jdoerfert and 4 others. · View Herald Transcript

dcandler requested review of this revision.Jul 7 2022, 8:53 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJul 7 2022, 8:53 AM

Herald added subscribers: llvm-commits, cfe-commits, MaskRay, wdng. · View Herald Transcript

Missing ARM changes that demonstrate the use of the control?

Harbormaster completed remote builds in B174173: Diff 442919.Jul 7 2022, 9:54 AM

There are currently no Arm specific changes, this is just being able to more accurately describe the floating point environment via attributes in the case where singles and doubles should be flushed, but not halves.

With three precisions to control, an alternative may be to specify them individually (denormal-fp-math-f64, denormal-fp-math-f32 and denormal-fp-math-f16) so that one doesn't override another, but that would be a much larger and more intrusive change.

In D129298#3638759, @dcandler wrote:

There are currently no Arm specific changes, this is just being able to more accurately describe the floating point environment via attributes in the case where singles and doubles should be flushed, but not halves.

But presumably this corresponds with a directive in the assembly output needed to get a consistent FP mode at start. e.g. ARMAsmPrinter has checks for denormal-fp-math and emits something from it. I would expect a similar check and corresponding test if you can change these separately

Reverse ping

Sorry for the quiet on this. I'm going to abandon this for the moment, as what I eventually found was that there was some ambiguity in the ARM ABI regarding half-floats which would be better to address first, so that the attributes can map directly. There is currently only one ARM build attribute for denormals which reads as though it affects all precisions, but may not have been updated after half-float support was added. Since that maps to denormal-fp-math, which also controls all precisions, both may need splitting rather than just the function level attribute.

dcandler mentioned this in D125807: [ConstantFolding] Pre-commit tests showing denormal handling during folding.Mar 15 2023, 3:20 AM

simonwallis2 added a subscriber: simonwallis2.Mar 15 2023, 4:16 AM

Herald added a subscriber: StephenFan. · View Herald TranscriptMar 15 2023, 4:16 AM

Revision Contents

Path

Size

clang/

docs/

UsersManual.rst

2 lines

include/

clang/

Basic/

CodeGenOptions.h

3 lines

Driver/

Options.td

2 lines

lib/

CodeGen/

CGCall.cpp

4 lines

Driver/

ToolChains/

Clang.cpp

24 lines

Frontend/

CompilerInvocation.cpp

13 lines

llvm/

docs/

LangRef.rst

7 lines

include/

llvm/

CodeGen/

CommandFlags.h

1 line

Target/

TargetOptions.h

8 lines

lib/

CodeGen/

CommandFlags.cpp

17 lines

IR/

Function.cpp

11 lines

test/

Transforms/

InstSimplify/

constant-fold-fp-denormal.ll

715 lines

Diff 442919

clang/docs/UsersManual.rst

Show First 20 Lines • Show All 1,303 Lines • ▼ Show 20 Lines	.. csv-table:: Floating Point Semantic Modes
:widths: 15, 30, 30		:widths: 15, 30, 30

"ffp-exception-behavior", "{ignore, strict, may_trap}",		"ffp-exception-behavior", "{ignore, strict, may_trap}",
"fenv_access", "{off, on}", "(none)"		"fenv_access", "{off, on}", "(none)"
"frounding-math", "{dynamic, tonearest, downward, upward, towardzero}"		"frounding-math", "{dynamic, tonearest, downward, upward, towardzero}"
"ffp-contract", "{on, off, fast, fast-honor-pragmas}"		"ffp-contract", "{on, off, fast, fast-honor-pragmas}"
"fdenormal-fp-math", "{IEEE, PreserveSign, PositiveZero}"		"fdenormal-fp-math", "{IEEE, PreserveSign, PositiveZero}"
"fdenormal-fp-math-fp32", "{IEEE, PreserveSign, PositiveZero}"		"fdenormal-fp-math-fp32", "{IEEE, PreserveSign, PositiveZero}"
		"fdenormal-fp-math-fp16", "{IEEE, PreserveSign, PositiveZero}"
"fmath-errno", "{on, off}"		"fmath-errno", "{on, off}"
"fhonor-nans", "{on, off}"		"fhonor-nans", "{on, off}"
"fhonor-infinities", "{on, off}"		"fhonor-infinities", "{on, off}"
"fsigned-zeros", "{on, off}"		"fsigned-zeros", "{on, off}"
"freciprocal-math", "{on, off}"		"freciprocal-math", "{on, off}"
"allow_approximate_fns", "{on, off}"		"allow_approximate_fns", "{on, off}"
"fassociative-math", "{on, off}"		"fassociative-math", "{on, off}"

This table describes the option settings that correspond to the three		This table describes the option settings that correspond to the three
floating point semantic models: precise (the default), strict, and fast.		floating point semantic models: precise (the default), strict, and fast.


.. csv-table:: Floating Point Models		.. csv-table:: Floating Point Models
:header: "Mode", "Precise", "Strict", "Fast"		:header: "Mode", "Precise", "Strict", "Fast"
:widths: 25, 15, 15, 15		:widths: 25, 15, 15, 15

"except_behavior", "ignore", "strict", "ignore"		"except_behavior", "ignore", "strict", "ignore"
"fenv_access", "off", "on", "off"		"fenv_access", "off", "on", "off"
"rounding_mode", "tonearest", "dynamic", "tonearest"		"rounding_mode", "tonearest", "dynamic", "tonearest"
"contract", "on", "off", "fast"		"contract", "on", "off", "fast"
"denormal_fp_math", "IEEE", "IEEE", "PreserveSign"		"denormal_fp_math", "IEEE", "IEEE", "PreserveSign"
"denormal_fp32_math", "IEEE","IEEE", "PreserveSign"		"denormal_fp32_math", "IEEE","IEEE", "PreserveSign"
		"denormal_fp16_math", "IEEE","IEEE", "PreserveSign"
"support_math_errno", "on", "on", "off"		"support_math_errno", "on", "on", "off"
"no_honor_nans", "off", "off", "on"		"no_honor_nans", "off", "off", "on"
"no_honor_infinities", "off", "off", "on"		"no_honor_infinities", "off", "off", "on"
"no_signed_zeros", "off", "off", "on"		"no_signed_zeros", "off", "off", "on"
"allow_reciprocal", "off", "off", "on"		"allow_reciprocal", "off", "off", "on"
"allow_approximate_fns", "off", "off", "on"		"allow_approximate_fns", "off", "off", "on"
"allow_reassociation", "off", "off", "on"		"allow_reassociation", "off", "off", "on"

▲ Show 20 Lines • Show All 2,871 Lines • Show Last 20 Lines

clang/include/clang/Basic/CodeGenOptions.h

Show First 20 Lines • Show All 190 Lines • ▼ Show 20 Lines	public:
std::string DIBugsReportFilePath;		std::string DIBugsReportFilePath;

/// The floating-point denormal mode to use.		/// The floating-point denormal mode to use.
llvm::DenormalMode FPDenormalMode = llvm::DenormalMode::getIEEE();		llvm::DenormalMode FPDenormalMode = llvm::DenormalMode::getIEEE();

/// The floating-point denormal mode to use, for float.		/// The floating-point denormal mode to use, for float.
llvm::DenormalMode FP32DenormalMode = llvm::DenormalMode::getIEEE();		llvm::DenormalMode FP32DenormalMode = llvm::DenormalMode::getIEEE();

		/// The floating-point denormal mode to use, for half float.
		llvm::DenormalMode FP16DenormalMode = llvm::DenormalMode::getIEEE();

/// The float precision limit to use, if non-empty.		/// The float precision limit to use, if non-empty.
std::string LimitFloatPrecision;		std::string LimitFloatPrecision;

struct BitcodeFileToLink {		struct BitcodeFileToLink {
/// The filename of the bitcode file to link in.		/// The filename of the bitcode file to link in.
std::string Filename;		std::string Filename;
/// If true, we set attributes functions in the bitcode library according to		/// If true, we set attributes functions in the bitcode library according to
/// our CodeGenOptions, much as we set attrs on functions that we generate		/// our CodeGenOptions, much as we set attrs on functions that we generate
▲ Show 20 Lines • Show All 283 Lines • Show Last 20 Lines

clang/include/clang/Driver/Options.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,501 Lines • ▼ Show 20 Lines	def cfguard : Flag<["-"], "cfguard">,
HelpText<"Emit Windows Control Flow Guard tables and checks">,		HelpText<"Emit Windows Control Flow Guard tables and checks">,
MarshallingInfoFlag<CodeGenOpts<"ControlFlowGuard">>;		MarshallingInfoFlag<CodeGenOpts<"ControlFlowGuard">>;
def ehcontguard : Flag<["-"], "ehcontguard">,		def ehcontguard : Flag<["-"], "ehcontguard">,
HelpText<"Emit Windows EH Continuation Guard tables">,		HelpText<"Emit Windows EH Continuation Guard tables">,
MarshallingInfoFlag<CodeGenOpts<"EHContGuard">>;		MarshallingInfoFlag<CodeGenOpts<"EHContGuard">>;

def fdenormal_fp_math_f32_EQ : Joined<["-"], "fdenormal-fp-math-f32=">,		def fdenormal_fp_math_f32_EQ : Joined<["-"], "fdenormal-fp-math-f32=">,
Group<f_Group>;		Group<f_Group>;
		def fdenormal_fp_math_f16_EQ : Joined<["-"], "fdenormal-fp-math-f16=">,
		Group<f_Group>;

} // let Flags = [CC1Option, NoDriverOption]		} // let Flags = [CC1Option, NoDriverOption]

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Dependency Output Options		// Dependency Output Options
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

let Flags = [CC1Option, NoDriverOption] in {		let Flags = [CC1Option, NoDriverOption] in {
▲ Show 20 Lines • Show All 1,348 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGCall.cpp

Show First 20 Lines • Show All 1,842 Lines • ▼ Show 20 Lines	if (AttrOnCallSite) {
if (CodeGenOpts.FPDenormalMode != llvm::DenormalMode::getIEEE())		if (CodeGenOpts.FPDenormalMode != llvm::DenormalMode::getIEEE())
FuncAttrs.addAttribute("denormal-fp-math",		FuncAttrs.addAttribute("denormal-fp-math",
CodeGenOpts.FPDenormalMode.str());		CodeGenOpts.FPDenormalMode.str());
if (CodeGenOpts.FP32DenormalMode != CodeGenOpts.FPDenormalMode) {		if (CodeGenOpts.FP32DenormalMode != CodeGenOpts.FPDenormalMode) {
FuncAttrs.addAttribute(		FuncAttrs.addAttribute(
"denormal-fp-math-f32",		"denormal-fp-math-f32",
CodeGenOpts.FP32DenormalMode.str());		CodeGenOpts.FP32DenormalMode.str());
}		}
		if (CodeGenOpts.FP16DenormalMode != CodeGenOpts.FPDenormalMode) {
		FuncAttrs.addAttribute("denormal-fp-math-f16",
		CodeGenOpts.FP16DenormalMode.str());
		}

if (LangOpts.getDefaultExceptionMode() == LangOptions::FPE_Ignore)		if (LangOpts.getDefaultExceptionMode() == LangOptions::FPE_Ignore)
FuncAttrs.addAttribute("no-trapping-math", "true");		FuncAttrs.addAttribute("no-trapping-math", "true");

// TODO: Are these all needed?		// TODO: Are these all needed?
// unsafe/inf/nan/nsz are handled by instruction-level FastMathFlags.		// unsafe/inf/nan/nsz are handled by instruction-level FastMathFlags.
if (LangOpts.NoHonorInfs)		if (LangOpts.NoHonorInfs)
FuncAttrs.addAttribute("no-infs-fp-math", "true");		FuncAttrs.addAttribute("no-infs-fp-math", "true");
▲ Show 20 Lines • Show All 3,758 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,757 Lines • ▼ Show 20 Lines	static void RenderFloatingPointOptions(const ToolChain &TC, const Driver &D,
// -ffp-exception-behavior options: strict, maytrap, ignore		// -ffp-exception-behavior options: strict, maytrap, ignore
StringRef FPExceptionBehavior = "";		StringRef FPExceptionBehavior = "";
// -ffp-eval-method options: double, extended, source		// -ffp-eval-method options: double, extended, source
StringRef FPEvalMethod = "";		StringRef FPEvalMethod = "";
const llvm::DenormalMode DefaultDenormalFPMath =		const llvm::DenormalMode DefaultDenormalFPMath =
TC.getDefaultDenormalModeForType(Args, JA);		TC.getDefaultDenormalModeForType(Args, JA);
const llvm::DenormalMode DefaultDenormalFP32Math =		const llvm::DenormalMode DefaultDenormalFP32Math =
TC.getDefaultDenormalModeForType(Args, JA, &llvm::APFloat::IEEEsingle());		TC.getDefaultDenormalModeForType(Args, JA, &llvm::APFloat::IEEEsingle());
		const llvm::DenormalMode DefaultDenormalFP16Math =
		TC.getDefaultDenormalModeForType(Args, JA, &llvm::APFloat::IEEEhalf());

llvm::DenormalMode DenormalFPMath = DefaultDenormalFPMath;		llvm::DenormalMode DenormalFPMath = DefaultDenormalFPMath;
llvm::DenormalMode DenormalFP32Math = DefaultDenormalFP32Math;		llvm::DenormalMode DenormalFP32Math = DefaultDenormalFP32Math;
		llvm::DenormalMode DenormalFP16Math = DefaultDenormalFP16Math;
// CUDA and HIP don't rely on the frontend to pass an ffp-contract option.		// CUDA and HIP don't rely on the frontend to pass an ffp-contract option.
// If one wasn't given by the user, don't pass it here.		// If one wasn't given by the user, don't pass it here.
StringRef FPContract;		StringRef FPContract;
if (!JA.isDeviceOffloading(Action::OFK_Cuda) &&		if (!JA.isDeviceOffloading(Action::OFK_Cuda) &&
!JA.isOffloading(Action::OFK_HIP))		!JA.isOffloading(Action::OFK_HIP))
FPContract = "on";		FPContract = "on";
bool StrictFPModel = false;		bool StrictFPModel = false;

Show All 20 Lines	case options::OPT_ffp_model_EQ: {
// -fno_fast_math restores default denormal and fpcontract handling		// -fno_fast_math restores default denormal and fpcontract handling
FPContract = "on";		FPContract = "on";
DenormalFPMath = llvm::DenormalMode::getIEEE();		DenormalFPMath = llvm::DenormalMode::getIEEE();

// FIXME: The target may have picked a non-IEEE default mode here based on		// FIXME: The target may have picked a non-IEEE default mode here based on
// -cl-denorms-are-zero. Should the target consider -fp-model interaction?		// -cl-denorms-are-zero. Should the target consider -fp-model interaction?
DenormalFP32Math = llvm::DenormalMode::getIEEE();		DenormalFP32Math = llvm::DenormalMode::getIEEE();

		DenormalFP16Math = llvm::DenormalMode::getIEEE();

StringRef Val = A->getValue();		StringRef Val = A->getValue();
if (OFastEnabled && !Val.equals("fast")) {		if (OFastEnabled && !Val.equals("fast")) {
// Only -ffp-model=fast is compatible with OFast, ignore.		// Only -ffp-model=fast is compatible with OFast, ignore.
D.Diag(clang::diag::warn_drv_overriding_flag_option)		D.Diag(clang::diag::warn_drv_overriding_flag_option)
<< Args.MakeArgString("-ffp-model=" + Val)		<< Args.MakeArgString("-ffp-model=" + Val)
<< "-Ofast";		<< "-Ofast";
break;		break;
}		}
▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	for (const Arg *A : Args) {
case options::OPT_fdenormal_fp_math_f32_EQ:		case options::OPT_fdenormal_fp_math_f32_EQ:
DenormalFP32Math = llvm::parseDenormalFPAttribute(A->getValue());		DenormalFP32Math = llvm::parseDenormalFPAttribute(A->getValue());
if (!DenormalFP32Math.isValid()) {		if (!DenormalFP32Math.isValid()) {
D.Diag(diag::err_drv_invalid_value)		D.Diag(diag::err_drv_invalid_value)
<< A->getAsString(Args) << A->getValue();		<< A->getAsString(Args) << A->getValue();
}		}
break;		break;

		case options::OPT_fdenormal_fp_math_f16_EQ:
		DenormalFP16Math = llvm::parseDenormalFPAttribute(A->getValue());
		if (!DenormalFP16Math.isValid()) {
		D.Diag(diag::err_drv_invalid_value)
		<< A->getAsString(Args) << A->getValue();
		}
		break;

// Validate and pass through -ffp-contract option.		// Validate and pass through -ffp-contract option.
case options::OPT_ffp_contract: {		case options::OPT_ffp_contract: {
StringRef Val = A->getValue();		StringRef Val = A->getValue();
if (PreciseFPModel) {		if (PreciseFPModel) {
// -ffp-model=precise enables ffp-contract=on.		// -ffp-model=precise enables ffp-contract=on.
// -ffp-model=precise sets PreciseFPModel to on and Val to		// -ffp-model=precise sets PreciseFPModel to on and Val to
// "precise". FPContract is set.		// "precise". FPContract is set.
;		;
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	case options::OPT_fno_unsafe_math_optimizations:
SignedZeros = true;		SignedZeros = true;
ApproxFunc = false;		ApproxFunc = false;
TrappingMath = true;		TrappingMath = true;
FPExceptionBehavior = "strict";		FPExceptionBehavior = "strict";

// The target may have opted to flush by default, so force IEEE.		// The target may have opted to flush by default, so force IEEE.
DenormalFPMath = llvm::DenormalMode::getIEEE();		DenormalFPMath = llvm::DenormalMode::getIEEE();
DenormalFP32Math = llvm::DenormalMode::getIEEE();		DenormalFP32Math = llvm::DenormalMode::getIEEE();
		DenormalFP16Math = llvm::DenormalMode::getIEEE();
break;		break;

case options::OPT_Ofast:		case options::OPT_Ofast:
// If -Ofast is the optimization level, then -ffast-math should be enabled		// If -Ofast is the optimization level, then -ffast-math should be enabled
if (!OFastEnabled)		if (!OFastEnabled)
continue;		continue;
LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
case options::OPT_ffast_math:		case options::OPT_ffast_math:
Show All 18 Lines	case options::OPT_fno_fast_math:
MathErrno = TC.IsMathErrnoDefault();		MathErrno = TC.IsMathErrnoDefault();
AssociativeMath = false;		AssociativeMath = false;
ReciprocalMath = false;		ReciprocalMath = false;
ApproxFunc = false;		ApproxFunc = false;
SignedZeros = true;		SignedZeros = true;
// -fno_fast_math restores default denormal and fpcontract handling		// -fno_fast_math restores default denormal and fpcontract handling
DenormalFPMath = DefaultDenormalFPMath;		DenormalFPMath = DefaultDenormalFPMath;
DenormalFP32Math = llvm::DenormalMode::getIEEE();		DenormalFP32Math = llvm::DenormalMode::getIEEE();
		DenormalFP16Math = llvm::DenormalMode::getIEEE();
if (!JA.isDeviceOffloading(Action::OFK_Cuda) &&		if (!JA.isDeviceOffloading(Action::OFK_Cuda) &&
!JA.isOffloading(Action::OFK_HIP))		!JA.isOffloading(Action::OFK_HIP))
if (FPContract == "fast") {		if (FPContract == "fast") {
FPContract = "on";		FPContract = "on";
D.Diag(clang::diag::warn_drv_overriding_flag_option)		D.Diag(clang::diag::warn_drv_overriding_flag_option)
<< "-ffp-contract=fast"		<< "-ffp-contract=fast"
<< "-ffp-contract=on";		<< "-ffp-contract=on";
}		}
break;		break;
}		}
if (StrictFPModel) {		if (StrictFPModel) {
// If -ffp-model=strict has been specified on command line but		// If -ffp-model=strict has been specified on command line but
// subsequent options conflict then emit warning diagnostic.		// subsequent options conflict then emit warning diagnostic.
if (HonorINFs && HonorNaNs && !AssociativeMath && !ReciprocalMath &&		if (HonorINFs && HonorNaNs && !AssociativeMath && !ReciprocalMath &&
SignedZeros && TrappingMath && RoundingFPMath && !ApproxFunc &&		SignedZeros && TrappingMath && RoundingFPMath && !ApproxFunc &&
DenormalFPMath == llvm::DenormalMode::getIEEE() &&		DenormalFPMath == llvm::DenormalMode::getIEEE() &&
DenormalFP32Math == llvm::DenormalMode::getIEEE() &&		DenormalFP32Math == llvm::DenormalMode::getIEEE() &&
		DenormalFP16Math == llvm::DenormalMode::getIEEE() &&
FPContract.equals("off"))		FPContract.equals("off"))
// OK: Current Arg doesn't conflict with -ffp-model=strict		// OK: Current Arg doesn't conflict with -ffp-model=strict
;		;
else {		else {
StrictFPModel = false;		StrictFPModel = false;
FPModel = "";		FPModel = "";
D.Diag(clang::diag::warn_drv_overriding_flag_option)		D.Diag(clang::diag::warn_drv_overriding_flag_option)
<< "-ffp-model=strict" <<		<< "-ffp-model=strict" <<
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	static void RenderFloatingPointOptions(const ToolChain &TC, const Driver &D,
// Add f32 specific denormal mode flag if it's different.		// Add f32 specific denormal mode flag if it's different.
if (DenormalFP32Math != DenormalFPMath) {		if (DenormalFP32Math != DenormalFPMath) {
llvm::SmallString<64> DenormFlag;		llvm::SmallString<64> DenormFlag;
llvm::raw_svector_ostream ArgStr(DenormFlag);		llvm::raw_svector_ostream ArgStr(DenormFlag);
ArgStr << "-fdenormal-fp-math-f32=" << DenormalFP32Math;		ArgStr << "-fdenormal-fp-math-f32=" << DenormalFP32Math;
CmdArgs.push_back(Args.MakeArgString(ArgStr.str()));		CmdArgs.push_back(Args.MakeArgString(ArgStr.str()));
}		}

		// Add f16 specific denormal mode flag if it's different.
		if (DenormalFP16Math != DenormalFPMath) {
		llvm::SmallString<64> DenormFlag;
		llvm::raw_svector_ostream ArgStr(DenormFlag);
		ArgStr << "-fdenormal-fp-math-f16=" << DenormalFP16Math;
		CmdArgs.push_back(Args.MakeArgString(ArgStr.str()));
		}

if (!FPContract.empty())		if (!FPContract.empty())
CmdArgs.push_back(Args.MakeArgString("-ffp-contract=" + FPContract));		CmdArgs.push_back(Args.MakeArgString("-ffp-contract=" + FPContract));

if (!RoundingFPMath)		if (!RoundingFPMath)
CmdArgs.push_back(Args.MakeArgString("-fno-rounding-math"));		CmdArgs.push_back(Args.MakeArgString("-fno-rounding-math"));

if (RoundingFPMath && RoundingMathPresent)		if (RoundingFPMath && RoundingMathPresent)
CmdArgs.push_back(Args.MakeArgString("-frounding-math"));		CmdArgs.push_back(Args.MakeArgString("-frounding-math"));
▲ Show 20 Lines • Show All 5,365 Lines • Show Last 20 Lines

clang/lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 1,503 Lines • ▼ Show 20 Lines	#undef CODEGEN_OPTION_WITH_MARSHALLING
if (Opts.FPDenormalMode != llvm::DenormalMode::getIEEE())		if (Opts.FPDenormalMode != llvm::DenormalMode::getIEEE())
GenerateArg(Args, OPT_fdenormal_fp_math_EQ, Opts.FPDenormalMode.str(), SA);		GenerateArg(Args, OPT_fdenormal_fp_math_EQ, Opts.FPDenormalMode.str(), SA);

if ((Opts.FPDenormalMode != Opts.FP32DenormalMode) \|\|		if ((Opts.FPDenormalMode != Opts.FP32DenormalMode) \|\|
(Opts.FP32DenormalMode != llvm::DenormalMode::getIEEE()))		(Opts.FP32DenormalMode != llvm::DenormalMode::getIEEE()))
GenerateArg(Args, OPT_fdenormal_fp_math_f32_EQ, Opts.FP32DenormalMode.str(),		GenerateArg(Args, OPT_fdenormal_fp_math_f32_EQ, Opts.FP32DenormalMode.str(),
SA);		SA);

		if ((Opts.FPDenormalMode != Opts.FP16DenormalMode) \|\|
		(Opts.FP16DenormalMode != llvm::DenormalMode::getIEEE()))
		GenerateArg(Args, OPT_fdenormal_fp_math_f16_EQ, Opts.FP16DenormalMode.str(),
		SA);

if (Opts.StructReturnConvention == CodeGenOptions::SRCK_OnStack) {		if (Opts.StructReturnConvention == CodeGenOptions::SRCK_OnStack) {
OptSpecifier Opt =		OptSpecifier Opt =
T.isPPC32() ? OPT_maix_struct_return : OPT_fpcc_struct_return;		T.isPPC32() ? OPT_maix_struct_return : OPT_fpcc_struct_return;
GenerateArg(Args, Opt, SA);		GenerateArg(Args, Opt, SA);
} else if (Opts.StructReturnConvention == CodeGenOptions::SRCK_InRegs) {		} else if (Opts.StructReturnConvention == CodeGenOptions::SRCK_InRegs) {
OptSpecifier Opt =		OptSpecifier Opt =
T.isPPC32() ? OPT_msvr4_struct_return : OPT_freg_struct_return;		T.isPPC32() ? OPT_msvr4_struct_return : OPT_freg_struct_return;
GenerateArg(Args, Opt, SA);		GenerateArg(Args, Opt, SA);
▲ Show 20 Lines • Show All 334 Lines • ▼ Show 20 Lines	if (T.isOSAIX()) {
Diags.Report(diag::err_aix_unsupported_tls_model) << Name;		Diags.Report(diag::err_aix_unsupported_tls_model) << Name;
}		}
}		}

if (Arg *A = Args.getLastArg(OPT_fdenormal_fp_math_EQ)) {		if (Arg *A = Args.getLastArg(OPT_fdenormal_fp_math_EQ)) {
StringRef Val = A->getValue();		StringRef Val = A->getValue();
Opts.FPDenormalMode = llvm::parseDenormalFPAttribute(Val);		Opts.FPDenormalMode = llvm::parseDenormalFPAttribute(Val);
Opts.FP32DenormalMode = Opts.FPDenormalMode;		Opts.FP32DenormalMode = Opts.FPDenormalMode;
		Opts.FP16DenormalMode = Opts.FPDenormalMode;
if (!Opts.FPDenormalMode.isValid())		if (!Opts.FPDenormalMode.isValid())
Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args) << Val;		Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args) << Val;
}		}

if (Arg *A = Args.getLastArg(OPT_fdenormal_fp_math_f32_EQ)) {		if (Arg *A = Args.getLastArg(OPT_fdenormal_fp_math_f32_EQ)) {
StringRef Val = A->getValue();		StringRef Val = A->getValue();
Opts.FP32DenormalMode = llvm::parseDenormalFPAttribute(Val);		Opts.FP32DenormalMode = llvm::parseDenormalFPAttribute(Val);
if (!Opts.FP32DenormalMode.isValid())		if (!Opts.FP32DenormalMode.isValid())
Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args) << Val;		Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args) << Val;
}		}

		if (Arg *A = Args.getLastArg(OPT_fdenormal_fp_math_f16_EQ)) {
		StringRef Val = A->getValue();
		Opts.FP16DenormalMode = llvm::parseDenormalFPAttribute(Val);
		if (!Opts.FP16DenormalMode.isValid())
		Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args) << Val;
		}

// X86_32 has -fppc-struct-return and -freg-struct-return.		// X86_32 has -fppc-struct-return and -freg-struct-return.
// PPC32 has -maix-struct-return and -msvr4-struct-return.		// PPC32 has -maix-struct-return and -msvr4-struct-return.
if (Arg *A =		if (Arg *A =
Args.getLastArg(OPT_fpcc_struct_return, OPT_freg_struct_return,		Args.getLastArg(OPT_fpcc_struct_return, OPT_freg_struct_return,
OPT_maix_struct_return, OPT_msvr4_struct_return)) {		OPT_maix_struct_return, OPT_msvr4_struct_return)) {
// TODO: We might want to consider enabling these options on AIX in the		// TODO: We might want to consider enabling these options on AIX in the
// future.		// future.
if (T.isOSAIX())		if (T.isOSAIX())
▲ Show 20 Lines • Show All 2,793 Lines • Show Last 20 Lines

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 2,165 Lines • ▼ Show 20 Lines
	``"denormal-fp-math-f32"``			``"denormal-fp-math-f32"``
	Same as ``"denormal-fp-math"``, but only controls the behavior of			Same as ``"denormal-fp-math"``, but only controls the behavior of
	the 32-bit float type (or vectors of 32-bit floats). If both are			the 32-bit float type (or vectors of 32-bit floats). If both are
	are present, this overrides ``"denormal-fp-math"``. Not all targets			are present, this overrides ``"denormal-fp-math"``. Not all targets
	support separately setting the denormal mode per type, and no			support separately setting the denormal mode per type, and no
	attempt is made to diagnose unsupported uses. Currently this			attempt is made to diagnose unsupported uses. Currently this
	attribute is respected by the AMDGPU and NVPTX backends.			attribute is respected by the AMDGPU and NVPTX backends.

				``"denormal-fp-math-f16"``
				Same as ``"denormal-fp-math"``, but only controls the behavior of
				the 16-bit float type (or vectors of 16-bit floats). If both are
				are present, this overrides ``"denormal-fp-math"``. Not all targets
				support separately setting the denormal mode per type, and no
				attempt is made to diagnose unsupported uses.

	``"thunk"``			``"thunk"``
	This attribute indicates that the function will delegate to some other			This attribute indicates that the function will delegate to some other
	function with a tail call. The prototype of a thunk should not be used for			function with a tail call. The prototype of a thunk should not be used for
	optimization purposes. The caller is expected to cast the thunk prototype to			optimization purposes. The caller is expected to cast the thunk prototype to
	match the thunk target prototype.			match the thunk target prototype.

	``"tls-load-hoist"``			``"tls-load-hoist"``
	This attribute indicates that the function will try to reduce redundant			This attribute indicates that the function will try to reduce redundant
	▲ Show 20 Lines • Show All 23,041 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/CommandFlags.h

	Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines
	bool getEnableNoSignedZerosFPMath();			bool getEnableNoSignedZerosFPMath();

	bool getEnableApproxFuncFPMath();			bool getEnableApproxFuncFPMath();

	bool getEnableNoTrappingFPMath();			bool getEnableNoTrappingFPMath();

	DenormalMode::DenormalModeKind getDenormalFPMath();			DenormalMode::DenormalModeKind getDenormalFPMath();
	DenormalMode::DenormalModeKind getDenormalFP32Math();			DenormalMode::DenormalModeKind getDenormalFP32Math();
				DenormalMode::DenormalModeKind getDenormalFP16Math();

	bool getEnableHonorSignDependentRoundingFPMath();			bool getEnableHonorSignDependentRoundingFPMath();

	llvm::FloatABI::ABIType getFloatABIForCalls();			llvm::FloatABI::ABIType getFloatABIForCalls();

	llvm::FPOpFusion::FPOpFusionMode getFuseFPOps();			llvm::FPOpFusion::FPOpFusionMode getFuseFPOps();

	SwiftAsyncFramePointerMode getSwiftAsyncFramePointer();			SwiftAsyncFramePointerMode getSwiftAsyncFramePointer();
	▲ Show 20 Lines • Show All 109 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetOptions.h

Show First 20 Lines • Show All 410 Lines • ▼ Show 20 Lines	namespace llvm {
private:		private:
/// Flushing mode to assume in default FP environment.		/// Flushing mode to assume in default FP environment.
DenormalMode FPDenormalMode;		DenormalMode FPDenormalMode;

/// Flushing mode to assume in default FP environment, for float/vector of		/// Flushing mode to assume in default FP environment, for float/vector of
/// float.		/// float.
DenormalMode FP32DenormalMode;		DenormalMode FP32DenormalMode;

		/// Flushing mode to assume in default FP environment, for half float/vector
		/// of half float.
		DenormalMode FP16DenormalMode;

public:		public:
void setFPDenormalMode(DenormalMode Mode) {		void setFPDenormalMode(DenormalMode Mode) {
FPDenormalMode = Mode;		FPDenormalMode = Mode;
}		}

void setFP32DenormalMode(DenormalMode Mode) {		void setFP32DenormalMode(DenormalMode Mode) {
FP32DenormalMode = Mode;		FP32DenormalMode = Mode;
}		}

		void setFP16DenormalMode(DenormalMode Mode) { FP16DenormalMode = Mode; }

DenormalMode getRawFPDenormalMode() const {		DenormalMode getRawFPDenormalMode() const {
return FPDenormalMode;		return FPDenormalMode;
}		}

DenormalMode getRawFP32DenormalMode() const {		DenormalMode getRawFP32DenormalMode() const {
return FP32DenormalMode;		return FP32DenormalMode;
}		}

		DenormalMode getRawFP16DenormalMode() const { return FP16DenormalMode; }

DenormalMode getDenormalMode(const fltSemantics &FPType) const;		DenormalMode getDenormalMode(const fltSemantics &FPType) const;

/// What exception model to use		/// What exception model to use
ExceptionHandling ExceptionModel = ExceptionHandling::None;		ExceptionHandling ExceptionModel = ExceptionHandling::None;

/// Machine level options.		/// Machine level options.
MCTargetOptions MCOptions;		MCTargetOptions MCOptions;

Show All 9 Lines

llvm/lib/CodeGen/CommandFlags.cpp

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
CGOPT(bool, EnableNoInfsFPMath)		CGOPT(bool, EnableNoInfsFPMath)
CGOPT(bool, EnableNoNaNsFPMath)		CGOPT(bool, EnableNoNaNsFPMath)
CGOPT(bool, EnableNoSignedZerosFPMath)		CGOPT(bool, EnableNoSignedZerosFPMath)
CGOPT(bool, EnableApproxFuncFPMath)		CGOPT(bool, EnableApproxFuncFPMath)
CGOPT(bool, EnableNoTrappingFPMath)		CGOPT(bool, EnableNoTrappingFPMath)
CGOPT(bool, EnableAIXExtendedAltivecABI)		CGOPT(bool, EnableAIXExtendedAltivecABI)
CGOPT(DenormalMode::DenormalModeKind, DenormalFPMath)		CGOPT(DenormalMode::DenormalModeKind, DenormalFPMath)
CGOPT(DenormalMode::DenormalModeKind, DenormalFP32Math)		CGOPT(DenormalMode::DenormalModeKind, DenormalFP32Math)
		CGOPT(DenormalMode::DenormalModeKind, DenormalFP16Math)
CGOPT(bool, EnableHonorSignDependentRoundingFPMath)		CGOPT(bool, EnableHonorSignDependentRoundingFPMath)
CGOPT(FloatABI::ABIType, FloatABIForCalls)		CGOPT(FloatABI::ABIType, FloatABIForCalls)
CGOPT(FPOpFusion::FPOpFusionMode, FuseFPOps)		CGOPT(FPOpFusion::FPOpFusionMode, FuseFPOps)
CGOPT(SwiftAsyncFramePointerMode, SwiftAsyncFramePointer)		CGOPT(SwiftAsyncFramePointerMode, SwiftAsyncFramePointer)
CGOPT(bool, DontPlaceZerosInBSS)		CGOPT(bool, DontPlaceZerosInBSS)
CGOPT(bool, EnableGuaranteedTailCallOpt)		CGOPT(bool, EnableGuaranteedTailCallOpt)
CGOPT(bool, DisableTailCalls)		CGOPT(bool, DisableTailCalls)
CGOPT(bool, StackSymbolOrdering)		CGOPT(bool, StackSymbolOrdering)
▲ Show 20 Lines • Show All 179 Lines • ▼ Show 20 Lines	#define CGBINDOPT(NAME) \

static cl::opt<DenormalMode::DenormalModeKind> DenormalFP32Math(		static cl::opt<DenormalMode::DenormalModeKind> DenormalFP32Math(
"denormal-fp-math-f32",		"denormal-fp-math-f32",
cl::desc("Select which denormal numbers the code is permitted to require for float"),		cl::desc("Select which denormal numbers the code is permitted to require for float"),
cl::init(DenormalMode::Invalid),		cl::init(DenormalMode::Invalid),
DenormFlagEnumOptions);		DenormFlagEnumOptions);
CGBINDOPT(DenormalFP32Math);		CGBINDOPT(DenormalFP32Math);

		static cl::opt<DenormalMode::DenormalModeKind> DenormalFP16Math(
		"denormal-fp-math-f16",
		cl::desc("Select which denormal numbers the code is permitted to require "
		"for half float"),
		cl::init(DenormalMode::Invalid), DenormFlagEnumOptions);
		CGBINDOPT(DenormalFP16Math);

static cl::opt<bool> EnableHonorSignDependentRoundingFPMath(		static cl::opt<bool> EnableHonorSignDependentRoundingFPMath(
"enable-sign-dependent-rounding-fp-math", cl::Hidden,		"enable-sign-dependent-rounding-fp-math", cl::Hidden,
cl::desc("Force codegen to assume rounding mode can change dynamically"),		cl::desc("Force codegen to assume rounding mode can change dynamically"),
cl::init(false));		cl::init(false));
CGBINDOPT(EnableHonorSignDependentRoundingFPMath);		CGBINDOPT(EnableHonorSignDependentRoundingFPMath);

static cl::opt<FloatABI::ABIType> FloatABIForCalls(		static cl::opt<FloatABI::ABIType> FloatABIForCalls(
"float-abi", cl::desc("Choose float ABI type"),		"float-abi", cl::desc("Choose float ABI type"),
▲ Show 20 Lines • Show All 414 Lines • ▼ Show 20 Lines	if (DenormalFP32MathView->getNumOccurrences() > 0 &&
// FIXME: Command line flag should expose separate input/output modes.		// FIXME: Command line flag should expose separate input/output modes.
DenormalMode::DenormalModeKind DenormKind = getDenormalFP32Math();		DenormalMode::DenormalModeKind DenormKind = getDenormalFP32Math();

NewAttrs.addAttribute(		NewAttrs.addAttribute(
"denormal-fp-math-f32",		"denormal-fp-math-f32",
DenormalMode(DenormKind, DenormKind).str());		DenormalMode(DenormKind, DenormKind).str());
}		}

		if (DenormalFP16MathView->getNumOccurrences() > 0 &&
		!F.hasFnAttribute("denormal-fp-math-f16")) {
		// FIXME: Command line flag should expose separate input/output modes.
		DenormalMode::DenormalModeKind DenormKind = getDenormalFP16Math();

		NewAttrs.addAttribute("denormal-fp-math-f16",
		DenormalMode(DenormKind, DenormKind).str());
		}

if (TrapFuncNameView->getNumOccurrences() > 0)		if (TrapFuncNameView->getNumOccurrences() > 0)
for (auto &B : F)		for (auto &B : F)
for (auto &I : B)		for (auto &I : B)
if (auto *Call = dyn_cast<CallInst>(&I))		if (auto *Call = dyn_cast<CallInst>(&I))
if (const auto *F = Call->getCalledFunction())		if (const auto *F = Call->getCalledFunction())
if (F->getIntrinsicID() == Intrinsic::debugtrap \|\|		if (F->getIntrinsicID() == Intrinsic::debugtrap \|\|
F->getIntrinsicID() == Intrinsic::trap)		F->getIntrinsicID() == Intrinsic::trap)
Call->addFnAttr(		Call->addFnAttr(
Show All 13 Lines

llvm/lib/IR/Function.cpp

	Show First 20 Lines • Show All 669 Lines • ▼ Show 20 Lines
	}			}

	DenormalMode Function::getDenormalMode(const fltSemantics &FPType) const {			DenormalMode Function::getDenormalMode(const fltSemantics &FPType) const {
	if (&FPType == &APFloat::IEEEsingle()) {			if (&FPType == &APFloat::IEEEsingle()) {
	Attribute Attr = getFnAttribute("denormal-fp-math-f32");			Attribute Attr = getFnAttribute("denormal-fp-math-f32");
	StringRef Val = Attr.getValueAsString();			StringRef Val = Attr.getValueAsString();
	if (!Val.empty())			if (!Val.empty())
	return parseDenormalFPAttribute(Val);			return parseDenormalFPAttribute(Val);

	// If the f32 variant of the attribute isn't specified, try to use the
	// generic one.
	}			}
				if (&FPType == &APFloat::IEEEhalf()) {
				Attribute Attr = getFnAttribute("denormal-fp-math-f16");
				StringRef Val = Attr.getValueAsString();
				if (!Val.empty())
				return parseDenormalFPAttribute(Val);
				}
				// If the f32 or f16 variant of the attribute isn't specified, try to use
				// the generic one.

	Attribute Attr = getFnAttribute("denormal-fp-math");			Attribute Attr = getFnAttribute("denormal-fp-math");
	return parseDenormalFPAttribute(Attr.getValueAsString());			return parseDenormalFPAttribute(Attr.getValueAsString());
	}			}

	const std::string &Function::getGC() const {			const std::string &Function::getGC() const {
	assert(hasGC() && "Function has no collector");			assert(hasGC() && "Function has no collector");
	return getContext().getGC(*this);			return getContext().getGC(*this);
	▲ Show 20 Lines • Show All 1,367 Lines • Show Last 20 Lines

llvm/test/Transforms/InstSimplify/constant-fold-fp-denormal.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -instsimplify < %s \| FileCheck %s			; RUN: opt -S -instsimplify < %s \| FileCheck %s

	; Test cases for denormal handling mode when constant folding floating point			; Test cases for denormal handling mode when constant folding floating point
	; operations. Input and output modes are checked seperately.			; operations. Input and output modes are checked seperately.

	; ============================================================================ ;			; ============================================================================ ;
	; fadd tests			; fadd tests
	; Denormal operand added to normal operand produces denormal result.			; Denormal operand added to normal operand produces denormal result.
	; If denormal outputs should be flushed to zero, the result should be zero.			; If denormal outputs should be flushed to zero, the result should be zero.
	; If denormal inputs should be treated as zero, the result should be the			; If denormal inputs should be treated as zero, the result should be the
	; normal operand (a number plus zero is the same number).			; normal operand (a number plus zero is the same number).
	; ============================================================================ ;			; ============================================================================ ;

				define half @test_half_fadd_ieee() #0 {
				; CHECK-LABEL: @test_half_fadd_ieee(
				; CHECK-NEXT: ret half 0xH8200
				;
				; default ieee mode leaves result as a denormal
				%result = fadd half 0xH8400, 0xH0200
				ret half %result
				}

				define half @test_half_fadd_pzero_out() #1 {
				; CHECK-LABEL: @test_half_fadd_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal result is flushed to positive zero
				%result = fadd half 0xH8400, 0xH0200
				ret half %result
				}

				define half @test_half_fadd_psign_out() #2 {
				; CHECK-LABEL: @test_half_fadd_psign_out(
				; CHECK-NEXT: ret half 0xH8000
				;
				; denormal result is flushed to sign preserved zero
				%result = fadd half 0xH8400, 0xH0200
				ret half %result
				}

				define half @test_half_fadd_pzero_in() #3 {
				; CHECK-LABEL: @test_half_fadd_pzero_in(
				; CHECK-NEXT: ret half 0xH8400
				;
				; denormal operand is treated as zero
				; normal operand added to zero results in the same operand as a result
				%result = fadd half 0xH8400, 0xH0200
				ret half %result
				}

				define half @test_half_fadd_psign_in() #4 {
				; CHECK-LABEL: @test_half_fadd_psign_in(
				; CHECK-NEXT: ret half 0xH8400
				;
				; denormal operand is treated as zero
				; normal operand added to zero results in the same operand as a result
				%result = fadd half 0xH8400, 0xH0200
				ret half %result
				}

				define half @test_half_fadd_pzero_f16_pzero_out() #9 {
				; CHECK-LABEL: @test_half_fadd_pzero_f16_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; f16 only attribute should flush half float output
				; same as pzero_out above
				%result = fadd half 0xH8400, 0xH0200
				ret half %result
				}

				define half @test_half_fadd_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_half_fadd_pzero_f32_pzero_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				; f32 only attribute should not flush half float output
				; default ieee mode leaves result as a denormal
				%result = fadd half 0xH8400, 0xH0200
				ret half %result
				}

	define float @test_float_fadd_ieee() #0 {			define float @test_float_fadd_ieee() #0 {
	; CHECK-LABEL: @test_float_fadd_ieee(			; CHECK-LABEL: @test_float_fadd_ieee(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}
	Show All 31 Lines
	; CHECK-NEXT: ret float 0xB810000000000000			; CHECK-NEXT: ret float 0xB810000000000000
	;			;
	; denormal operand is treated as zero			; denormal operand is treated as zero
	; normal operand added to zero results in the same operand as a result			; normal operand added to zero results in the same operand as a result
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fadd_pzero_f32_out() #5 {			define float @test_float_fadd_pzero_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_float_fadd_pzero_f32_out(			; CHECK-LABEL: @test_float_fadd_pzero_f16_pzero_out(
				; CHECK-NEXT: ret float 0xB800000000000000
				;
				; f16 only attribute should not flush float output
				; default ieee mode leaves result as a denormal
				%result = fadd float 0xB810000000000000, 0x3800000000000000
				ret float %result
				}

				define float @test_float_fadd_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_float_fadd_pzero_f32_pzero_out(
	; CHECK-NEXT: ret float 0.000000e+00			; CHECK-NEXT: ret float 0.000000e+00
	;			;
	; f32 only attribute should flush float output			; f32 only attribute should flush float output
	; default ieee mode leaves result as a denormal			; same as pzero_out above
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define double @test_double_fadd_ieee() #0 {			define double @test_double_fadd_ieee() #0 {
	; CHECK-LABEL: @test_double_fadd_ieee(			; CHECK-LABEL: @test_double_fadd_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	;			;
	Show All 35 Lines
	; CHECK-NEXT: ret double 0x8010000000000000			; CHECK-NEXT: ret double 0x8010000000000000
	;			;
	; denormal operand is treated as zero			; denormal operand is treated as zero
	; normal operand added to zero results in the same operand as a result			; normal operand added to zero results in the same operand as a result
	%result = fadd double 0x8010000000000000, 0x0008000000000000			%result = fadd double 0x8010000000000000, 0x0008000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fadd_f32_ieee() #5 {			define double @test_double_fadd_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_double_fadd_f32_ieee(			; CHECK-LABEL: @test_double_fadd_f16_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	;			;
	; f32 only attribute should not flush doubles			; f16 only attribute should not flush double output
				; default ieee mode leaves result as a denormal
				%result = fadd double 0x8010000000000000, 0x0008000000000000
				ret double %result
				}

				define double @test_double_fadd_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_double_fadd_f32_pzero_out(
				; CHECK-NEXT: ret double 0x8008000000000000
				;
				; f32 only attribute should not flush double output
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fadd double 0x8010000000000000, 0x0008000000000000			%result = fadd double 0x8010000000000000, 0x0008000000000000
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; fsub tests			; fsub tests
	; Normal operand subtracted from denormal operand produces denormal result			; Normal operand subtracted from denormal operand produces denormal result
	; If denormal outputs should be flushed to zero, the result should be zero.			; If denormal outputs should be flushed to zero, the result should be zero.
	; If denormal inputs should be treated as zero, the result should be the			; If denormal inputs should be treated as zero, the result should be the
	; negated normal operand (zero minus the original operand).			; negated normal operand (zero minus the original operand).
	; ============================================================================ ;			; ============================================================================ ;

				define half @test_half_fsub_ieee() #0 {
				; CHECK-LABEL: @test_half_fsub_ieee(
				; CHECK-NEXT: ret half 0xH8200
				;
				; default ieee mode leaves result as a denormal
				%result = fsub half 0xH0200, 0xH0400
				ret half %result
				}

				define half @test_half_fsub_pzero_out() #1 {
				; CHECK-LABEL: @test_half_fsub_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal result is flushed to positive zero
				%result = fsub half 0xH0200, 0xH0400
				ret half %result
				}

				define half @test_half_fsub_psign_out() #2 {
				; CHECK-LABEL: @test_half_fsub_psign_out(
				; CHECK-NEXT: ret half 0xH8000
				;
				; denormal result is flushed to sign preserved zero
				%result = fsub half 0xH0200, 0xH0400
				ret half %result
				}

				define half @test_half_fsub_pzero_in() #3 {
				; CHECK-LABEL: @test_half_fsub_pzero_in(
				; CHECK-NEXT: ret half 0xH8400
				;
				; denormal operand is treated as zero
				; normal operand subtracted from zero produces the same operand, negated
				%result = fsub half 0xH0200, 0xH0400
				ret half %result
				}

				define half @test_half_fsub_psign_in() #4 {
				; CHECK-LABEL: @test_half_fsub_psign_in(
				; CHECK-NEXT: ret half 0xH8400
				;
				; denormal operand is treated as zero
				; normal operand subtracted from zero produces the same operand, negated
				%result = fsub half 0xH0200, 0xH0400
				ret half %result
				}

				define half @test_half_fsub_pzero_f16_pzero_out() #9 {
				; CHECK-LABEL: @test_half_fsub_pzero_f16_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; f16 only attribute should flush half float output
				; same as pzero_out above
				%result = fsub half 0xH0200, 0xH0400
				ret half %result
				}

				define half @test_half_fsub_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_half_fsub_pzero_f32_pzero_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				; f32 only attribute should not flush half float output
				; default ieee mode leaves result as a denormal
				%result = fsub half 0xH0200, 0xH0400
				ret half %result
				}

	define float @test_float_fsub_ieee() #0 {			define float @test_float_fsub_ieee() #0 {
	; CHECK-LABEL: @test_float_fsub_ieee(			; CHECK-LABEL: @test_float_fsub_ieee(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}
	Show All 31 Lines
	; CHECK-NEXT: ret float 0xB810000000000000			; CHECK-NEXT: ret float 0xB810000000000000
	;			;
	; denormal operand is treated as zero			; denormal operand is treated as zero
	; normal operand subtracted from zero produces the same operand, negated			; normal operand subtracted from zero produces the same operand, negated
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fsub_pzero_f32_out() #5 {			define float @test_float_fsub_pzero_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_float_fsub_pzero_f32_out(			; CHECK-LABEL: @test_float_fsub_pzero_f16_pzero_out(
				; CHECK-NEXT: ret float 0xB800000000000000
				;
				; f16 only attribute should not flush float output
				; default ieee mode leaves result as a denormal
				%result = fsub float 0x3800000000000000, 0x3810000000000000
				ret float %result
				}

				define float @test_float_fsub_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_float_fsub_pzero_f32_pzero_out(
	; CHECK-NEXT: ret float 0.000000e+00			; CHECK-NEXT: ret float 0.000000e+00
	;			;
	; f32 only attribute should flush float output			; f32 only attribute should flush float output
	; same as pzero_out above			; same as pzero_out above
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	Show All 39 Lines
	; CHECK-NEXT: ret double 0x8010000000000000			; CHECK-NEXT: ret double 0x8010000000000000
	;			;
	; denormal operand is treated as zero			; denormal operand is treated as zero
	; normal operand subtracted from zero produces the same operand, negated			; normal operand subtracted from zero produces the same operand, negated
	%result = fsub double 0x0008000000000000, 0x0010000000000000			%result = fsub double 0x0008000000000000, 0x0010000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fsub_f32_ieee() #5 {			define double @test_double_fsub_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_double_fsub_f32_ieee(			; CHECK-LABEL: @test_double_fsub_f16_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	;			;
	; f32 only attribute should not flush doubles			; f16 only attribute should not flush double output
				; default ieee mode leaves result as a denormal
				%result = fsub double 0x0008000000000000, 0x0010000000000000
				ret double %result
				}

				define double @test_double_fsub_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_double_fsub_f32_pzero_out(
				; CHECK-NEXT: ret double 0x8008000000000000
				;
				; f32 only attribute should not flush double output
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fsub double 0x0008000000000000, 0x0010000000000000			%result = fsub double 0x0008000000000000, 0x0010000000000000
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; fmul tests			; fmul tests
	; Output modes are tested by multiplying the smallest normal number by 0.5,			; Output modes are tested by multiplying the smallest normal number by 0.5,
	; producing a denormal result. If denormal outputs should be flushed to zero,			; producing a denormal result. If denormal outputs should be flushed to zero,
	; the result should be zero.			; the result should be zero.
	; Input modes are tested by the reverse operation: taking the denormal and			; Input modes are tested by the reverse operation: taking the denormal and
	; multiplying by 2 to produce a normal number. If denormal inputs should be			; multiplying by 2 to produce a normal number. If denormal inputs should be
	; treated as zero, the result should also be zero.			; treated as zero, the result should also be zero.
	; ============================================================================ ;			; ============================================================================ ;

				define half @test_half_fmul_ieee() #0 {
				; CHECK-LABEL: @test_half_fmul_ieee(
				; CHECK-NEXT: ret half 0xH8200
				;
				; default ieee mode leaves result as a denormal
				%result = fmul half 0xH0400, 0xHB800
				ret half %result
				}

				define half @test_half_fmul_pzero_out() #1 {
				; CHECK-LABEL: @test_half_fmul_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal result is flushed to positive zero
				%result = fmul half 0xH0400, 0xHB800
				ret half %result
				}

				define half @test_half_fmul_psign_out() #2 {
				; CHECK-LABEL: @test_half_fmul_psign_out(
				; CHECK-NEXT: ret half 0xH8000
				;
				; denormal result is flushed to sign preserved zero
				%result = fmul half 0xH0400, 0xHB800
				ret half %result
				}

				define half @test_half_fmul_pzero_in() #3 {
				; CHECK-LABEL: @test_half_fmul_pzero_in(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal operand is treated as positive zero
				; anything multiplied by zero gives a zero result
				%result = fmul half 0xH8200, 0xH4000
				ret half %result
				}

				define half @test_half_fmul_psign_in() #4 {
				; CHECK-LABEL: @test_half_fmul_psign_in(
				; CHECK-NEXT: ret half 0xH8000
				;
				; denormal operand is treated as signed zero
				; anything multiplied by zero gives a zero result
				%result = fmul half 0xH8200, 0xH4000
				ret half %result
				}

				define half @test_half_fmul_pzero_f16_pzero_out() #9 {
				; CHECK-LABEL: @test_half_fmul_pzero_f16_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; f16 only attribute should flush half float output
				; same as pzero_out above
				%result = fmul half 0xH0400, 0xHB800
				ret half %result
				}

				define half @test_half_fmul_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_half_fmul_pzero_f32_pzero_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				; f32 only attribute should not flush half float output
				; default ieee mode leaves result as a denormal
				%result = fmul half 0xH0400, 0xHB800
				ret half %result
				}

	define float @test_float_fmul_ieee() #0 {			define float @test_float_fmul_ieee() #0 {
	; CHECK-LABEL: @test_float_fmul_ieee(			; CHECK-LABEL: @test_float_fmul_ieee(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fmul float 0x3810000000000000, -5.000000e-01			%result = fmul float 0x3810000000000000, -5.000000e-01
	ret float %result			ret float %result
	}			}
	Show All 31 Lines
	; CHECK-NEXT: ret float -0.000000e+00			; CHECK-NEXT: ret float -0.000000e+00
	;			;
	; denormal operand is treated as signed zero			; denormal operand is treated as signed zero
	; anything multiplied by zero gives a zero result			; anything multiplied by zero gives a zero result
	%result = fmul float 0xB800000000000000, 2.000000e-00			%result = fmul float 0xB800000000000000, 2.000000e-00
	ret float %result			ret float %result
	}			}

	define float @test_float_fmul_pzero_f32_out() #1 {			define float @test_float_fmul_pzero_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_float_fmul_pzero_f32_out(			; CHECK-LABEL: @test_float_fmul_pzero_f16_pzero_out(
				; CHECK-NEXT: ret float 0xB800000000000000
				;
				; f16 only attribute should not flush float output
				; default ieee mode leaves result as a denormal
				%result = fmul float 0x3810000000000000, -5.000000e-01
				ret float %result
				}

				define float @test_float_fmul_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_float_fmul_pzero_f32_pzero_out(
	; CHECK-NEXT: ret float 0.000000e+00			; CHECK-NEXT: ret float 0.000000e+00
	;			;
	; f32 only attribute should flush float output			; f32 only attribute should flush float output
	; same as pzero_out above			; same as pzero_out above
	%result = fmul float 0x3810000000000000, -5.000000e-01			%result = fmul float 0x3810000000000000, -5.000000e-01
	ret float %result			ret float %result
	}			}

	Show All 39 Lines
	; CHECK-NEXT: ret double -0.000000e+00			; CHECK-NEXT: ret double -0.000000e+00
	;			;
	; denormal operand is treated as signed zero			; denormal operand is treated as signed zero
	; anything multiplied by zero gives a zero result			; anything multiplied by zero gives a zero result
	%result = fmul double 0x8008000000000000, 2.000000e-00			%result = fmul double 0x8008000000000000, 2.000000e-00
	ret double %result			ret double %result
	}			}

	define double @test_double_fmul_f32_ieee() #5 {			define double @test_double_fmul_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_double_fmul_f32_ieee(			; CHECK-LABEL: @test_double_fmul_f16_pzero_out(
				; CHECK-NEXT: ret double 0x8008000000000000
				;
				; f16 only attribute should not flush double output
				; default ieee mode leaves result as a denormal
				%result = fmul double 0x0010000000000000, -5.000000e-01
				ret double %result
				}

				define double @test_double_fmul_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_double_fmul_f32_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	;			;
	; f32 only attribute should not flush doubles			; f32 only attribute should not flush double output
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fmul double 0x0010000000000000, -5.000000e-01			%result = fmul double 0x0010000000000000, -5.000000e-01
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; fdiv tests			; fdiv tests
	; Output modes are tested by dividing the smallest normal number by 2,			; Output modes are tested by dividing the smallest normal number by 2,
	; producing a denormal result. If denormal outputs should be flushed to zero,			; producing a denormal result. If denormal outputs should be flushed to zero,
	; the result should be zero.			; the result should be zero.
	; Input modes are tested by the reverse operation: taking the denormal and			; Input modes are tested by the reverse operation: taking the denormal and
	; dividing by 0.5 to produce a normal number. If denormal inputs should be			; dividing by 0.5 to produce a normal number. If denormal inputs should be
	; treated as zero, the result should also be zero.			; treated as zero, the result should also be zero.
	; ============================================================================ ;			; ============================================================================ ;

				define half @test_half_fdiv_ieee() #0 {
				; CHECK-LABEL: @test_half_fdiv_ieee(
				; CHECK-NEXT: ret half 0xH8200
				;
				; default ieee mode leaves result as a denormal
				%result = fdiv half 0xH0400, 0xHC000
				ret half %result
				}

				define half @test_half_fdiv_pzero_out() #1 {
				; CHECK-LABEL: @test_half_fdiv_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal result is flushed to positive zero
				%result = fdiv half 0xH0400, 0xHC000
				ret half %result
				}

				define half @test_half_fdiv_psign_out() #2 {
				; CHECK-LABEL: @test_half_fdiv_psign_out(
				; CHECK-NEXT: ret half 0xH8000
				;
				; denormal result is flushed to sign preserved zero
				%result = fdiv half 0xH0400, 0xHC000
				ret half %result
				}

				define half @test_half_fdiv_pzero_in() #3 {
				; CHECK-LABEL: @test_half_fdiv_pzero_in(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal operand is treated as zero
				; zero divided by anything gives a zero result
				%result = fdiv half 0xH8200, 0xH3800
				ret half %result
				}

				define half @test_half_fdiv_psign_in() #4 {
				; CHECK-LABEL: @test_half_fdiv_psign_in(
				; CHECK-NEXT: ret half 0xH8000
				;
				; denormal operand is treated as zero
				; zero divided by anything gives a zero result
				%result = fmul half 0xH8200, 0xH3800
				ret half %result
				}

				define half @test_half_fdiv_pzero_f16_pzero_out() #9 {
				; CHECK-LABEL: @test_half_fdiv_pzero_f16_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; f16 only attribute should flush half float output
				; same as pzero_out above
				%result = fdiv half 0xH0400, 0xHC000
				ret half %result
				}

				define half @test_half_fdiv_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_half_fdiv_pzero_f32_pzero_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				; f32 only attribute should not flush half float output
				; default ieee mode leaves result as a denormal
				%result = fdiv half 0xH0400, 0xHC000
				ret half %result
				}

	define float @test_float_fdiv_ieee() #0 {			define float @test_float_fdiv_ieee() #0 {
	; CHECK-LABEL: @test_float_fdiv_ieee(			; CHECK-LABEL: @test_float_fdiv_ieee(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fdiv float 0x3810000000000000, -2.000000e-00			%result = fdiv float 0x3810000000000000, -2.000000e-00
	ret float %result			ret float %result
	}			}
	Show All 31 Lines
	; CHECK-NEXT: ret float -0.000000e+00			; CHECK-NEXT: ret float -0.000000e+00
	;			;
	; denormal operand is treated as zero			; denormal operand is treated as zero
	; zero divided by anything gives a zero result			; zero divided by anything gives a zero result
	%result = fmul float 0xB800000000000000, 5.000000e-01			%result = fmul float 0xB800000000000000, 5.000000e-01
	ret float %result			ret float %result
	}			}

	define float @test_float_fdiv_pzero_f32_out() #1 {			define float @test_float_fdiv_pzero_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_float_fdiv_pzero_f32_out(			; CHECK-LABEL: @test_float_fdiv_pzero_f16_pzero_out(
				; CHECK-NEXT: ret float 0xB800000000000000
				;
				; f32 only attribute should not flush float output
				; default ieee mode leaves result as a denormal
				%result = fdiv float 0x3810000000000000, -2.000000e-00
				ret float %result
				}

				define float @test_float_fdiv_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_float_fdiv_pzero_f32_pzero_out(
	; CHECK-NEXT: ret float 0.000000e+00			; CHECK-NEXT: ret float 0.000000e+00
	;			;
	; f32 only attribute should flush float output			; f32 only attribute should flush float output
	; same as pzero_out above			; same as pzero_out above
	%result = fdiv float 0x3810000000000000, -2.000000e-00			%result = fdiv float 0x3810000000000000, -2.000000e-00
	ret float %result			ret float %result
	}			}

	Show All 39 Lines
	; CHECK-NEXT: ret double -0.000000e+00			; CHECK-NEXT: ret double -0.000000e+00
	;			;
	; denormal operand is treated as zero			; denormal operand is treated as zero
	; zero divided by anything gives a zero result			; zero divided by anything gives a zero result
	%result = fdiv double 0x8008000000000000, 5.000000e-01			%result = fdiv double 0x8008000000000000, 5.000000e-01
	ret double %result			ret double %result
	}			}

	define double @test_double_fdiv_f32_ieee() #5 {			define double @test_double_fdiv_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_double_fdiv_f32_ieee(			; CHECK-LABEL: @test_double_fdiv_f16_pzero_out(
				; CHECK-NEXT: ret double 0x8008000000000000
				;
				; f16 only attribute should not flush double output
				; default ieee mode leaves result as a denormal
				%result = fdiv double 0x0010000000000000, -2.000000e-00
				ret double %result
				}

				define double @test_double_fdiv_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_double_fdiv_f32_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	;			;
	; f32 only attribute should not flush doubles			; f32 only attribute should not flush double output
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fdiv double 0x0010000000000000, -2.000000e-00			%result = fdiv double 0x0010000000000000, -2.000000e-00
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; frem tests			; frem tests
	; Output modes are tested by using two small normal numbers to produce a			; Output modes are tested by using two small normal numbers to produce a
	; denormal result. If denormal outputs should be flushed to zero, the result			; denormal result. If denormal outputs should be flushed to zero, the result
	; should be zero.			; should be zero.
	; Input modes are tested by calculating the remainder of a denormal number			; Input modes are tested by calculating the remainder of a denormal number
	; and a larger normal number. If denormal inputs should be treated as zero			; and a larger normal number. If denormal inputs should be treated as zero
	; the result also becomes zero.			; the result also becomes zero.
	; ============================================================================ ;			; ============================================================================ ;

				define half @test_half_frem_ieee_out() #0 {
				; CHECK-LABEL: @test_half_frem_ieee_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				; default ieee mode leaves result as a denormal
				%result = frem half 0xH8600, 0xH0400
				ret half %result
				}

				define half @test_half_frem_pzero_out() #1 {
				; CHECK-LABEL: @test_half_frem_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal result is flushed to positive zero
				%result = frem half 0xH8600, 0xH0400
				ret half %result
				}

				define half @test_half_frem_psign_out() #2 {
				; CHECK-LABEL: @test_half_frem_psign_out(
				; CHECK-NEXT: ret half 0xH8000
				;
				; denormal result is flushed to sign preserved zero
				%result = frem half 0xH8600, 0xH0400
				ret half %result
				}

				define half @test_half_frem_ieee_in() #0 {
				; CHECK-LABEL: @test_half_frem_ieee_in(
				; CHECK-NEXT: ret half 0xH0200
				;
				; default ieee mode leaves result same as input
				%result = frem half 0xH0200, 0xH4000
				ret half %result
				}

				define half @test_half_frem_pzero_in() #3 {
				; CHECK-LABEL: @test_half_frem_pzero_in(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal operand is treated as zero
				; remainder is now zero
				%result = frem half 0xH0200, 0xH4000
				ret half %result
				}

				define half @test_half_frem_psign_in() #4 {
				; CHECK-LABEL: @test_half_frem_psign_in(
				; CHECK-NEXT: ret half 0xH0000
				;
				; denormal operand is treated as zero
				; remainder is now zero
				%result = frem half 0xH0200, 0xH4000
				ret half %result
				}

				define half @test_half_frem_pzero_f16_pzero_out() #9 {
				; CHECK-LABEL: @test_half_frem_pzero_f16_pzero_out(
				; CHECK-NEXT: ret half 0xH0000
				;
				; f16 only attribute should flush half float output
				; same as pzero_out above
				%result = frem half 0xH8600, 0xH0400
				ret half %result
				}

				define half @test_half_frem_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_half_frem_pzero_f32_pzero_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				; f32 only attribute should not flush half float output
				; default ieee mode leaves result as a denormal
				%result = frem half 0xH8600, 0xH0400
				ret half %result
				}

	define float @test_float_frem_ieee_out() #0 {			define float @test_float_frem_ieee_out() #0 {
	; CHECK-LABEL: @test_float_frem_ieee_out(			; CHECK-LABEL: @test_float_frem_ieee_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = frem float 0xB818000000000000, 0x3810000000000000			%result = frem float 0xB818000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}
	Show All 40 Lines
	; CHECK-NEXT: ret float 0.000000e+00			; CHECK-NEXT: ret float 0.000000e+00
	;			;
	; denormal operand is treated as zero			; denormal operand is treated as zero
	; remainder is now zero			; remainder is now zero
	%result = frem float 0x3800000000000000, 2.000000e+00			%result = frem float 0x3800000000000000, 2.000000e+00
	ret float %result			ret float %result
	}			}

	define float @test_float_frem_pzero_f32_out() #1 {			define float @test_float_frem_pzero_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_float_frem_pzero_f32_out(			; CHECK-LABEL: @test_float_frem_pzero_f16_pzero_out(
				; CHECK-NEXT: ret float 0xB800000000000000
				;
				; f16 only attribute should not flush float output
				; default ieee mode leaves result as a denormal
				%result = frem float 0xB818000000000000, 0x3810000000000000
				ret float %result
				}

				define float @test_float_frem_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_float_frem_pzero_f32_pzero_out(
	; CHECK-NEXT: ret float 0.000000e+00			; CHECK-NEXT: ret float 0.000000e+00
	;			;
	; f32 only attribute should flush float output			; f32 only attribute should flush float output
	; same as pzero_out above			; same as pzero_out above
	%result = frem float 0xB818000000000000, 0x3810000000000000			%result = frem float 0xB818000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: ret double 0.000000e+00			; CHECK-NEXT: ret double 0.000000e+00
	;			;
	; denormal operand is treated as zero			; denormal operand is treated as zero
	; remainder is now zero			; remainder is now zero
	%result = frem double 0x0008000000000000, 2.000000e+00			%result = frem double 0x0008000000000000, 2.000000e+00
	ret double %result			ret double %result
	}			}

	define double @test_double_frem_f32_ieee() #5 {			define double @test_double_frem_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_double_frem_f32_ieee(			; CHECK-LABEL: @test_double_frem_f16_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	;			;
	; f32 only attribute should not flush doubles			; f16 only attribute should not flush double output
				; default ieee mode leaves result as a denormal
				%result = frem double 0x8018000000000000, 0x0010000000000000
				ret double %result
				}

				define double @test_double_frem_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_double_frem_f32_pzero_out(
				; CHECK-NEXT: ret double 0x8008000000000000
				;
				; f32 only attribute should not flush double output
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = frem double 0x8018000000000000, 0x0010000000000000			%result = frem double 0x8018000000000000, 0x0010000000000000
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; fneg tests			; fneg tests
	; fneg should NOT be affected by denormal handling mode			; fneg should NOT be affected by denormal handling mode
	; these tests confirm fneg results are unchanged			; these tests confirm fneg results are unchanged
	; ============================================================================ ;			; ============================================================================ ;

				define half @test_half_fneg_ieee() #0 {
				; CHECK-LABEL: @test_half_fneg_ieee(
				; CHECK-NEXT: ret half 0xH8200
				;
				%result = fneg half 0xH0200
				ret half %result
				}

				define half @test_half_fneg_pzero_out() #1 {
				; CHECK-LABEL: @test_half_fneg_pzero_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				%result = fneg half 0xH0200
				ret half %result
				}

				define half @test_half_fneg_psign_out() #2 {
				; CHECK-LABEL: @test_half_fneg_psign_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				%result = fneg half 0xH0200
				ret half %result
				}

				define half @test_half_fneg_pzero_in() #3 {
				; CHECK-LABEL: @test_half_fneg_pzero_in(
				; CHECK-NEXT: ret half 0xH8200
				;
				%result = fneg half 0xH0200
				ret half %result
				}

				define half @test_half_fneg_psign_in() #4 {
				; CHECK-LABEL: @test_half_fneg_psign_in(
				; CHECK-NEXT: ret half 0xH8200
				;
				%result = fneg half 0xH0200
				ret half %result
				}

				define half @test_half_fneg_f16_pzero_out() #9 {
				; CHECK-LABEL: @test_half_fneg_f16_pzero_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				%result = fneg half 0xH0200
				ret half %result
				}

				define half @test_half_fneg_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_half_fneg_f32_pzero_out(
				; CHECK-NEXT: ret half 0xH8200
				;
				%result = fneg half 0xH0200
				ret half %result
				}

	define float @test_float_fneg_ieee() #0 {			define float @test_float_fneg_ieee() #0 {
	; CHECK-LABEL: @test_float_fneg_ieee(			; CHECK-LABEL: @test_float_fneg_ieee(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	%result = fneg float 0x3800000000000000			%result = fneg float 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fneg_pzero_out() #0 {			define float @test_float_fneg_pzero_out() #1 {
	; CHECK-LABEL: @test_float_fneg_pzero_out(			; CHECK-LABEL: @test_float_fneg_pzero_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	%result = fneg float 0x3800000000000000			%result = fneg float 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fneg_psign_out() #0 {			define float @test_float_fneg_psign_out() #2 {
	; CHECK-LABEL: @test_float_fneg_psign_out(			; CHECK-LABEL: @test_float_fneg_psign_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	%result = fneg float 0x3800000000000000			%result = fneg float 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fneg_pzero_in() #0 {			define float @test_float_fneg_pzero_in() #3 {
	; CHECK-LABEL: @test_float_fneg_pzero_in(			; CHECK-LABEL: @test_float_fneg_pzero_in(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	%result = fneg float 0x3800000000000000			%result = fneg float 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fneg_psign_in() #0 {			define float @test_float_fneg_psign_in() #4 {
	; CHECK-LABEL: @test_float_fneg_psign_in(			; CHECK-LABEL: @test_float_fneg_psign_in(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	%result = fneg float 0x3800000000000000			%result = fneg float 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fneg_pzero_f32_out() #5 {			define float @test_float_fneg_pzero_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_float_fneg_pzero_f32_out(			; CHECK-LABEL: @test_float_fneg_pzero_f16_pzero_out(
				; CHECK-NEXT: ret float 0xB800000000000000
				;
				%result = fneg float 0x3800000000000000
				ret float %result
				}

				define float @test_float_fneg_pzero_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_float_fneg_pzero_f32_pzero_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	;			;
	%result = fneg float 0x3800000000000000			%result = fneg float 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define double @test_double_fneg_ieee() #0 {			define double @test_double_fneg_ieee() #0 {
	; CHECK-LABEL: @test_double_fneg_ieee(			; CHECK-LABEL: @test_double_fneg_ieee(
	Show All 30 Lines
	define double @test_double_fneg_psign_in() #4 {			define double @test_double_fneg_psign_in() #4 {
	; CHECK-LABEL: @test_double_fneg_psign_in(			; CHECK-LABEL: @test_double_fneg_psign_in(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	;			;
	%result = fneg double 0x0008000000000000			%result = fneg double 0x0008000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fneg_f32_ieee() #5 {			define double @test_double_fneg_f16_pzero_out() #9 {
	; CHECK-LABEL: @test_double_fneg_f32_ieee(			; CHECK-LABEL: @test_double_fneg_f16_pzero_out(
				; CHECK-NEXT: ret double 0x8008000000000000
				;
				%result = fneg double 0x0008000000000000
				ret double %result
				}

				define double @test_double_fneg_f32_pzero_out() #5 {
				; CHECK-LABEL: @test_double_fneg_f32_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	;			;
	%result = fneg double 0x0008000000000000			%result = fneg double 0x0008000000000000
	ret double %result			ret double %result
	}			}

				; ============================================================================ ;
				; fcmp tests
				; ============================================================================ ;

	define i1 @fcmp_double_ieee_in_ieee_out() #0 {			define i1 @fcmp_double_ieee_in_ieee_out() #0 {
	; CHECK-LABEL: @fcmp_double_ieee_in_ieee_out(			; CHECK-LABEL: @fcmp_double_ieee_in_ieee_out(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: ret i1 true			; CHECK-NEXT: ret i1 true
	;			;
	entry:			entry:
	%cmp = fcmp une double 0x0008000000000000, 0x0			%cmp = fcmp une double 0x0008000000000000, 0x0
	ret i1 %cmp			ret i1 %cmp
	▲ Show 20 Lines • Show All 344 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: ret i1 false			; CHECK-NEXT: ret i1 false
	;			;
	entry:			entry:
	%cmp = fcmp uno double 0x0008000000000000, 0x1ff1000000000000			%cmp = fcmp uno double 0x0008000000000000, 0x1ff1000000000000
	ret i1 %cmp			ret i1 %cmp
	}			}

				; check all types don't flush when set to ieee
				define i1 @fcmp_half_ieee_in() #0 {
				; CHECK-LABEL: @fcmp_half_ieee_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%cmp = fcmp une half 0xH0200, 0xH000
				ret i1 %cmp
				}

				define i1 @fcmp_float_ieee_in() #0 {
				; CHECK-LABEL: @fcmp_float_ieee_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%cmp = fcmp une float 0x3800000000000000, 0x0
				ret i1 %cmp
				}

				define i1 @fcmp_double_ieee_in() #0 {
				; CHECK-LABEL: @fcmp_double_ieee_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%cmp = fcmp une double 0x0008000000000000, 0x0
				ret i1 %cmp
				}

				; check all types do flush inputs when set to positive zero
				define i1 @fcmp_half_pzero_in() #3 {
				; CHECK-LABEL: @fcmp_half_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%cmp = fcmp une half 0xH0200, 0xH000
				ret i1 %cmp
				}

				define i1 @fcmp_float_pzero_in() #3 {
				; CHECK-LABEL: @fcmp_float_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%cmp = fcmp une float 0x3800000000000000, 0x0
				ret i1 %cmp
				}

				define i1 @fcmp_double_pzero_in() #3 {
				; CHECK-LABEL: @fcmp_double_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%cmp = fcmp une double 0x0008000000000000, 0x0
				ret i1 %cmp
				}

				; check only f32 flushes when f32 attribute is set
				define i1 @fcmp_half_f32_pzero_in() #10 {
				; CHECK-LABEL: @fcmp_half_f32_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%cmp = fcmp une half 0xH0200, 0xH000
				ret i1 %cmp
				}

				define i1 @fcmp_float_f32_pzero_in() #10 {
				; CHECK-LABEL: @fcmp_float_f32_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%cmp = fcmp une float 0x3800000000000000, 0x0
				ret i1 %cmp
				}

				define i1 @fcmp_double_f32_pzero_in() #10 {
				; CHECK-LABEL: @fcmp_double_f32_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%cmp = fcmp une double 0x0008000000000000, 0x0
				ret i1 %cmp
				}

				; check only f16 flushes when f16 attribute is set
				define i1 @fcmp_half_f16_pzero_in() #11 {
				; CHECK-LABEL: @fcmp_half_f16_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 false
				;
				entry:
				%cmp = fcmp une half 0xH0200, 0xH000
				ret i1 %cmp
				}

				define i1 @fcmp_float_f16_pzero_in() #11 {
				; CHECK-LABEL: @fcmp_float_f16_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%cmp = fcmp une float 0x3800000000000000, 0x0
				ret i1 %cmp
				}

				define i1 @fcmp_double_f16_pzero_in() #11 {
				; CHECK-LABEL: @fcmp_double_f16_pzero_in(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: ret i1 true
				;
				entry:
				%cmp = fcmp une double 0x0008000000000000, 0x0
				ret i1 %cmp
				}

	attributes #0 = { nounwind "denormal-fp-math"="ieee,ieee" }			attributes #0 = { nounwind "denormal-fp-math"="ieee,ieee" }
	attributes #1 = { nounwind "denormal-fp-math"="positive-zero,ieee" }			attributes #1 = { nounwind "denormal-fp-math"="positive-zero,ieee" }
	attributes #2 = { nounwind "denormal-fp-math"="preserve-sign,ieee" }			attributes #2 = { nounwind "denormal-fp-math"="preserve-sign,ieee" }
	attributes #3 = { nounwind "denormal-fp-math"="ieee,positive-zero" }			attributes #3 = { nounwind "denormal-fp-math"="ieee,positive-zero" }
	attributes #4 = { nounwind "denormal-fp-math"="ieee,preserve-sign" }			attributes #4 = { nounwind "denormal-fp-math"="ieee,preserve-sign" }
	attributes #5 = { nounwind "denormal-fp-math"="ieee,ieee" "denormal-fp-math-f32"="positive-zero,ieee" }			attributes #5 = { nounwind "denormal-fp-math"="ieee,ieee" "denormal-fp-math-f32"="positive-zero,ieee" }
	attributes #6 = { nounwind "denormal-fp-math"="positive-zero,positive-zero" }			attributes #6 = { nounwind "denormal-fp-math"="positive-zero,positive-zero" }
	attributes #7 = { nounwind "denormal-fp-math"="preserve-sign,preserve-sign" }			attributes #7 = { nounwind "denormal-fp-math"="preserve-sign,preserve-sign" }
	attributes #8 = { nounwind "denormal-fp-math"="ieee,ieee" "denormal-fp-math-f32"="positive-zero,positive-zero" }			attributes #8 = { nounwind "denormal-fp-math"="ieee,ieee" "denormal-fp-math-f32"="positive-zero,positive-zero" }
				attributes #9 = { nounwind "denormal-fp-math"="ieee,ieee" "denormal-fp-math-f16"="positive-zero,ieee" }
				attributes #10 = { nounwind "denormal-fp-math"="ieee,ieee" "denormal-fp-math-f32"="ieee,positive-zero" }
				attributes #11 = { nounwind "denormal-fp-math"="ieee,ieee" "denormal-fp-math-f16"="ieee,positive-zero" }

This is an archive of the discontinued LLVM Phabricator instance.

Add denormal-fp-math attribute for f16AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 442919

clang/docs/UsersManual.rst

clang/include/clang/Basic/CodeGenOptions.h

clang/include/clang/Driver/Options.td

clang/lib/CodeGen/CGCall.cpp

clang/lib/Driver/ToolChains/Clang.cpp

clang/lib/Frontend/CompilerInvocation.cpp

llvm/docs/LangRef.rst

llvm/include/llvm/CodeGen/CommandFlags.h

llvm/include/llvm/Target/TargetOptions.h

llvm/lib/CodeGen/CommandFlags.cpp

llvm/lib/IR/Function.cpp

llvm/test/Transforms/InstSimplify/constant-fold-fp-denormal.ll

Add denormal-fp-math attribute for f16
AbandonedPublic