This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
3
UsersManual.rst
-
include/clang/
-
clang/
-
Driver/
-
Options.td
-
Frontend/
-
CodeGenOptions.def
-
lib/
-
CodeGen/
-
CGCall.cpp
-
Driver/ToolChains/
-
ToolChains/
-
Clang.cpp
-
Frontend/
-
CompilerInvocation.cpp
-
test/
-
CodeGen/
3
no-junk-ftrunc.c
-
Driver/
-
fast-math.c

Differential D46135

[Driver, CodeGen] add options to enable/disable an FP cast optimization
ClosedPublic

Authored by spatel on Apr 26 2018, 10:44 AM.

Download Raw Diff

Details

Reviewers

jgorbe
chandlerc
scanon
hans
echristo

Commits

rGd17547656620: [Driver, CodeGen] add options to enable/disable an FP cast optimization
rL331041: [Driver, CodeGen] add options to enable/disable an FP cast optimization
rC331041: [Driver, CodeGen] add options to enable/disable an FP cast optimization

Summary

As discussed in the post-commit thread for:
rL330437 ( http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20180423/545906.html )

We need a way to opt-out of a float-to-int-to-float cast optimization because too much existing code relies on the platform-specific undefined result of those casts when the float-to-int overflows.

I speculatively committed the LLVM changes associated with adding this function attribute, but I'll change the name/implementation if there's a better alternative:
rL330947
rL330950
rL330951

Also as suggested, I changed the LLVM doc to mention the specific sanitizer flag that catches this problem:
rL330958

I tested the end-to-end results on x86 and see the expected outcome: 'roundss' is no longer produced in place of cvttss2si + cvtsi2ss with default optimization.

Diff Detail

Event Timeline

spatel created this revision.Apr 26 2018, 10:44 AM

Herald added a subscriber: mcrosier. · View Herald TranscriptApr 26 2018, 10:44 AM

lebedev.ri added a subscriber: lebedev.ri.Apr 26 2018, 10:49 AM

lebedev.ri added inline comments.

test/CodeGen/no-junk-ftrunc.c
2	For a good measure, i'd add one more `RUN` line to test that it is currently the default. (Yes, i noticed that it is already tested in `test/Driver/fast-math.c`)

spatel added inline comments.Apr 26 2018, 11:12 AM

test/CodeGen/no-junk-ftrunc.c
2	The driver alone is handling the default setting, so it passes this flag to the front-end only when we're going to disable the transform. Ie, the driver eats "-fno-fp-cast-overflow-workaround" and sends nothing in that case to the front-end. So there's not currently any case where the function attribute will be "fp-cast-overflow-workaround=false", but I left that as a possibility in case we decide to lift the limit at a finer granularity (scalar vs. vector etc). I may be misunderstanding the question/suggestion - do we want the front-end to independently have a default setting?

Can't comment much on the patch itself (I'm still not very familiar with the codebase, I'll leave that to the other reviewers), but thanks a lot for responding so quickly! :)

chandlerc added inline comments.Apr 26 2018, 1:30 PM

docs/UsersManual.rst
1260–1265	I would phrase this the other way around (and I think the flag name is already phrased the other way around?): """ Enable a workaround for incorrect code that casts floating point values to integers where the floating point value is not representable in the integer type. This code is incorrect according to the language standard, but this flag will attempt to generate code to cause <insert expected behavior with the flag enabled>. """ Essentially, this should be more like '-fwrapv'. Also, I think the default should be what the specification says. People can explicitly pass this flag if their code is broken in this way.

spatel added inline comments.Apr 26 2018, 1:40 PM

docs/UsersManual.rst
1260–1265	Ah - did I misinterpret the earlier comments? I thought we need to have the work-around 'on' by default as the immediate fix for broken programs?

chandlerc added inline comments.Apr 26 2018, 1:47 PM

docs/UsersManual.rst
1260–1265	Maybe others feel strongly about the default. I'm happy to explicitly pass a flag for our code until we get it fixed here. I would suggest starting by adding the flag to toggle the behavior but not changing the default (which as of now is 'optimize, no workaround') and then we can invert the default if we get enough feedback that this is causing users problems. Either way, I'd word this as suggested above.

I like Chandler's wording. Something like:

"... this flag will attempt to cause <clang to generate code as though the result of such conversions were defined to be an unspecified integer value.>"

Patch updated:

Improve the documentation language - more suggestions welcome!
Change the default setting so the work-around is 'off' (ie, by default assume source is compliant and optimize accordingly).
Remove the 'no' version of the flag. Given the change in the default, this seems more natural to me, and it simplifies the patch/tests...but I might have been too pessimistic before and this is too optimistic? Let me know...

In D46135#1080108, @spatel wrote:

Remove the 'no' version of the flag. Given the change in the default, this seems more natural to me, and it simplifies the patch/tests...but I might have been too pessimistic before and this is too optimistic? Let me know...

Please keep the no- version so that the flag can be toggled by appending to the command line.

test/CodeGen/no-junk-ftrunc.c
2	How about a test that checks the attribute is absent in the default mode then? I think it's useful to have the both sides of the test, however a particular side is spelled.

Patch updated:

Restore the 'no' option to allow toggling.
Add a RUN to the codegen test to show that the function attribute is not appended by default.

LGTM, thanks so much!

This revision is now accepted and ready to land.Apr 26 2018, 3:54 PM

Closed by commit rC331041: [Driver, CodeGen] add options to enable/disable an FP cast optimization (authored by spatel). · Explain WhyApr 27 2018, 7:26 AM

This revision was automatically updated to reflect the committed changes.

Should I add a bullet for this new flag/attribute to the clang release notes, the LLVM release, both? Or do we view this as temporary and not making it to the release?

In D46135#1081165, @spatel wrote:

Should I add a bullet for this new flag/attribute to the clang release notes, the LLVM release, both? Or do we view this as temporary and not making it to the release?

(Not everyone is always using trunk snapshots)
I'd guess it would be present for at least one release, so I would expect to see that in both the clang and llvm release notes.

spatel mentioned this in rL331056: [docs] add -ffp-cast-overflow-workaround to the release notes.Apr 27 2018, 9:25 AM

spatel mentioned this in rC331056: [docs] add -ffp-cast-overflow-workaround to the release notes.

In D46135#1081193, @lebedev.ri wrote:

(Not everyone is always using trunk snapshots)
I'd guess it would be present for at least one release, so I would expect to see that in both the clang and llvm release notes.

Thanks - sounds right to me:
rL331056
rL331059

spatel mentioned this in D46236: [Driver, CodeGen] rename options to disable an FP cast optimization.Apr 29 2018, 8:28 AM

spatel mentioned this in rC331209: [Driver, CodeGen] rename options to disable an FP cast optimization.Apr 30 2018, 11:22 AM

spatel mentioned this in rL331209: [Driver, CodeGen] rename options to disable an FP cast optimization.

spatel mentioned this in D47807: Make uitofp and sitofp defined on overflow..Jun 6 2018, 10:31 AM

Revision Contents

Path

Size

docs/

UsersManual.rst

10 lines

include/

clang/

Driver/

Options.td

3 lines

Frontend/

CodeGenOptions.def

6 lines

lib/

CodeGen/

CGCall.cpp

3 lines

Driver/

ToolChains/

Clang.cpp

4 lines

Frontend/

CompilerInvocation.cpp

2 lines

test/

CodeGen/

no-junk-ftrunc.c

9 lines

Driver/

fast-math.c

14 lines

Diff 144200

docs/UsersManual.rst

Show First 20 Lines • Show All 1,249 Lines • ▼ Show 20 Lines	.. option:: -fdenormal-fp-math=[values]

Select which denormal numbers the code is permitted to require.		Select which denormal numbers the code is permitted to require.

Valid values are: ``ieee``, ``preserve-sign``, and ``positive-zero``,		Valid values are: ``ieee``, ``preserve-sign``, and ``positive-zero``,
which correspond to IEEE 754 denormal numbers, the sign of a		which correspond to IEEE 754 denormal numbers, the sign of a
flushed-to-zero number is preserved in the sign of 0, denormals are		flushed-to-zero number is preserved in the sign of 0, denormals are
flushed to positive zero, respectively.		flushed to positive zero, respectively.

		.. option:: -ffp-cast-overflow-workaround

		Enable a workaround for code that casts floating-point values to
		integers and back to floating-point. If the floating-point value
		is not representable in the intermediate integer type, the code is
		incorrect according to the language standard. This flag will attempt
		to generate code as if the result of an overflowing conversion matches
		the overflowing behavior of a target's native float-to-int conversion
		chandlercUnsubmitted Not Done Reply Inline Actions I would phrase this the other way around (and I think the flag name is already phrased the other way around?): """ Enable a workaround for incorrect code that casts floating point values to integers where the floating point value is not representable in the integer type. This code is incorrect according to the language standard, but this flag will attempt to generate code to cause <insert expected behavior with the flag enabled>. """ Essentially, this should be more like '-fwrapv'. Also, I think the default should be what the specification says. People can explicitly pass this flag if their code is broken in this way. chandlerc: I would phrase this the other way around (and I think the flag name is already phrased the…
		spatelAuthorUnsubmitted Not Done Reply Inline Actions Ah - did I misinterpret the earlier comments? I thought we need to have the work-around 'on' by default as the immediate fix for broken programs? spatel: Ah - did I misinterpret the earlier comments? I thought we need to have the work-around 'on'…
		chandlercUnsubmitted Not Done Reply Inline Actions Maybe others feel strongly about the default. I'm happy to explicitly pass a flag for our code until we get it fixed here. I would suggest starting by adding the flag to toggle the behavior but not changing the default (which as of now is 'optimize, no workaround') and then we can invert the default if we get enough feedback that this is causing users problems. Either way, I'd word this as suggested above. chandlerc: Maybe others feel strongly about the default. I'm happy to explicitly pass a flag for our code…
		instructions.

.. option:: -fwhole-program-vtables		.. option:: -fwhole-program-vtables

Enable whole-program vtable optimizations, such as single-implementation		Enable whole-program vtable optimizations, such as single-implementation
devirtualization and virtual constant propagation, for classes with		devirtualization and virtual constant propagation, for classes with
:doc:`hidden LTO visibility <LTOVisibility>`. Requires ``-flto``.		:doc:`hidden LTO visibility <LTOVisibility>`. Requires ``-flto``.

.. option:: -fno-assume-sane-operator-new		.. option:: -fno-assume-sane-operator-new

▲ Show 20 Lines • Show All 1,715 Lines • Show Last 20 Lines

include/clang/Driver/Options.td

	Show First 20 Lines • Show All 1,023 Lines • ▼ Show 20 Lines
	def : Flag<["-"], "fhonor-infinites">, Alias<fhonor_infinities>;			def : Flag<["-"], "fhonor-infinites">, Alias<fhonor_infinities>;
	def : Flag<["-"], "fno-honor-infinites">, Alias<fno_honor_infinities>;			def : Flag<["-"], "fno-honor-infinites">, Alias<fno_honor_infinities>;
	def ftrapping_math : Flag<["-"], "ftrapping-math">, Group<f_Group>, Flags<[CC1Option]>;			def ftrapping_math : Flag<["-"], "ftrapping-math">, Group<f_Group>, Flags<[CC1Option]>;
	def fno_trapping_math : Flag<["-"], "fno-trapping-math">, Group<f_Group>, Flags<[CC1Option]>;			def fno_trapping_math : Flag<["-"], "fno-trapping-math">, Group<f_Group>, Flags<[CC1Option]>;
	def ffp_contract : Joined<["-"], "ffp-contract=">, Group<f_Group>,			def ffp_contract : Joined<["-"], "ffp-contract=">, Group<f_Group>,
	Flags<[CC1Option]>, HelpText<"Form fused FP ops (e.g. FMAs): fast (everywhere)"			Flags<[CC1Option]>, HelpText<"Form fused FP ops (e.g. FMAs): fast (everywhere)"
	" \| on (according to FP_CONTRACT pragma, default) \| off (never fuse)">, Values<"fast,on,off">;			" \| on (according to FP_CONTRACT pragma, default) \| off (never fuse)">, Values<"fast,on,off">;

				def ffp_cast_overflow_workaround : Flag<["-"],
				"ffp-cast-overflow-workaround">, Group<f_Group>, Flags<[CC1Option]>;

	def ffor_scope : Flag<["-"], "ffor-scope">, Group<f_Group>;			def ffor_scope : Flag<["-"], "ffor-scope">, Group<f_Group>;
	def fno_for_scope : Flag<["-"], "fno-for-scope">, Group<f_Group>;			def fno_for_scope : Flag<["-"], "fno-for-scope">, Group<f_Group>;

	def frewrite_includes : Flag<["-"], "frewrite-includes">, Group<f_Group>,			def frewrite_includes : Flag<["-"], "frewrite-includes">, Group<f_Group>,
	Flags<[CC1Option]>;			Flags<[CC1Option]>;
	def fno_rewrite_includes : Flag<["-"], "fno-rewrite-includes">, Group<f_Group>;			def fno_rewrite_includes : Flag<["-"], "fno-rewrite-includes">, Group<f_Group>;

	def frewrite_imports : Flag<["-"], "frewrite-imports">, Group<f_Group>,			def frewrite_imports : Flag<["-"], "frewrite-imports">, Group<f_Group>,
	▲ Show 20 Lines • Show All 1,870 Lines • Show Last 20 Lines

include/clang/Frontend/CodeGenOptions.def

	Show First 20 Lines • Show All 130 Lines • ▼ Show 20 Lines
	CODEGENOPT(NoInfsFPMath , 1, 0) ///< Assume FP arguments, results not +-Inf.			CODEGENOPT(NoInfsFPMath , 1, 0) ///< Assume FP arguments, results not +-Inf.
	CODEGENOPT(NoSignedZeros , 1, 0) ///< Allow ignoring the signedness of FP zero			CODEGENOPT(NoSignedZeros , 1, 0) ///< Allow ignoring the signedness of FP zero
	CODEGENOPT(Reassociate , 1, 0) ///< Allow reassociation of FP math ops			CODEGENOPT(Reassociate , 1, 0) ///< Allow reassociation of FP math ops
	CODEGENOPT(ReciprocalMath , 1, 0) ///< Allow FP divisions to be reassociated.			CODEGENOPT(ReciprocalMath , 1, 0) ///< Allow FP divisions to be reassociated.
	CODEGENOPT(NoTrappingMath , 1, 0) ///< Set when -fno-trapping-math is enabled.			CODEGENOPT(NoTrappingMath , 1, 0) ///< Set when -fno-trapping-math is enabled.
	CODEGENOPT(NoNaNsFPMath , 1, 0) ///< Assume FP arguments, results not NaN.			CODEGENOPT(NoNaNsFPMath , 1, 0) ///< Assume FP arguments, results not NaN.
	CODEGENOPT(FlushDenorm , 1, 0) ///< Allow FP denorm numbers to be flushed to zero			CODEGENOPT(FlushDenorm , 1, 0) ///< Allow FP denorm numbers to be flushed to zero
	CODEGENOPT(CorrectlyRoundedDivSqrt, 1, 0) ///< -cl-fp32-correctly-rounded-divide-sqrt			CODEGENOPT(CorrectlyRoundedDivSqrt, 1, 0) ///< -cl-fp32-correctly-rounded-divide-sqrt

				/// Disable a float-to-int-to-float cast optimization. This attempts to generate
				/// code as if the result of an overflowing conversion matches the overflowing
				/// behavior of a target's native float-to-int conversion instructions.
				CODEGENOPT(FPCastOverflowWorkaround, 1, 0)

	CODEGENOPT(UniformWGSize , 1, 0) ///< -cl-uniform-work-group-size			CODEGENOPT(UniformWGSize , 1, 0) ///< -cl-uniform-work-group-size
	CODEGENOPT(NoZeroInitializedInBSS , 1, 0) ///< -fno-zero-initialized-in-bss.			CODEGENOPT(NoZeroInitializedInBSS , 1, 0) ///< -fno-zero-initialized-in-bss.
	/// \brief Method of Objective-C dispatch to use.			/// \brief Method of Objective-C dispatch to use.
	ENUM_CODEGENOPT(ObjCDispatchMethod, ObjCDispatchMethodKind, 2, Legacy)			ENUM_CODEGENOPT(ObjCDispatchMethod, ObjCDispatchMethodKind, 2, Legacy)
	CODEGENOPT(OmitLeafFramePointer , 1, 0) ///< Set when -momit-leaf-frame-pointer is			CODEGENOPT(OmitLeafFramePointer , 1, 0) ///< Set when -momit-leaf-frame-pointer is
	///< enabled.			///< enabled.

	VALUE_CODEGENOPT(OptimizationLevel, 2, 0) ///< The -O[0-3] option specified.			VALUE_CODEGENOPT(OptimizationLevel, 2, 0) ///< The -O[0-3] option specified.
	▲ Show 20 Lines • Show All 186 Lines • Show Last 20 Lines

lib/CodeGen/CGCall.cpp

Show First 20 Lines • Show All 1,721 Lines • ▼ Show 20 Lines	FuncAttrs.addAttribute("less-precise-fpmad",
llvm::toStringRef(CodeGenOpts.LessPreciseFPMAD));		llvm::toStringRef(CodeGenOpts.LessPreciseFPMAD));

if (!CodeGenOpts.FPDenormalMode.empty())		if (!CodeGenOpts.FPDenormalMode.empty())
FuncAttrs.addAttribute("denormal-fp-math", CodeGenOpts.FPDenormalMode);		FuncAttrs.addAttribute("denormal-fp-math", CodeGenOpts.FPDenormalMode);

FuncAttrs.addAttribute("no-trapping-math",		FuncAttrs.addAttribute("no-trapping-math",
llvm::toStringRef(CodeGenOpts.NoTrappingMath));		llvm::toStringRef(CodeGenOpts.NoTrappingMath));

		if (CodeGenOpts.FPCastOverflowWorkaround)
		FuncAttrs.addAttribute("fp-cast-overflow-workaround", "true");

// TODO: Are these all needed?		// TODO: Are these all needed?
// unsafe/inf/nan/nsz are handled by instruction-level FastMathFlags.		// unsafe/inf/nan/nsz are handled by instruction-level FastMathFlags.
FuncAttrs.addAttribute("no-infs-fp-math",		FuncAttrs.addAttribute("no-infs-fp-math",
llvm::toStringRef(CodeGenOpts.NoInfsFPMath));		llvm::toStringRef(CodeGenOpts.NoInfsFPMath));
FuncAttrs.addAttribute("no-nans-fp-math",		FuncAttrs.addAttribute("no-nans-fp-math",
llvm::toStringRef(CodeGenOpts.NoNaNsFPMath));		llvm::toStringRef(CodeGenOpts.NoNaNsFPMath));
FuncAttrs.addAttribute("unsafe-fp-math",		FuncAttrs.addAttribute("unsafe-fp-math",
llvm::toStringRef(CodeGenOpts.UnsafeFPMath));		llvm::toStringRef(CodeGenOpts.UnsafeFPMath));
▲ Show 20 Lines • Show All 2,740 Lines • Show Last 20 Lines

lib/Driver/ToolChains/Clang.cpp

Show First 20 Lines • Show All 2,235 Lines • ▼ Show 20 Lines	static void RenderFloatingPointOptions(const ToolChain &TC, const Driver &D,
// Handle __FINITE_MATH_ONLY__ similarly.		// Handle __FINITE_MATH_ONLY__ similarly.
if (!HonorINFs && !HonorNaNs)		if (!HonorINFs && !HonorNaNs)
CmdArgs.push_back("-ffinite-math-only");		CmdArgs.push_back("-ffinite-math-only");

if (const Arg *A = Args.getLastArg(options::OPT_mfpmath_EQ)) {		if (const Arg *A = Args.getLastArg(options::OPT_mfpmath_EQ)) {
CmdArgs.push_back("-mfpmath");		CmdArgs.push_back("-mfpmath");
CmdArgs.push_back(A->getValue());		CmdArgs.push_back(A->getValue());
}		}

		// Disable a codegen optimization for floating-point casts.
		if (Args.hasArg(options::OPT_ffp_cast_overflow_workaround))
		CmdArgs.push_back("-ffp-cast-overflow-workaround");
}		}

static void RenderAnalyzerOptions(const ArgList &Args, ArgStringList &CmdArgs,		static void RenderAnalyzerOptions(const ArgList &Args, ArgStringList &CmdArgs,
const llvm::Triple &Triple,		const llvm::Triple &Triple,
const InputInfo &Input) {		const InputInfo &Input) {
// Enable region store model by default.		// Enable region store model by default.
CmdArgs.push_back("-analyzer-store=region");		CmdArgs.push_back("-analyzer-store=region");

▲ Show 20 Lines • Show All 3,367 Lines • Show Last 20 Lines

lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 693 Lines • ▼ Show 20 Lines	static bool ParseCodeGenArgs(CodeGenOptions &Opts, ArgList &Args, InputKind IK,
Opts.FlushDenorm = Args.hasArg(OPT_cl_denorms_are_zero);		Opts.FlushDenorm = Args.hasArg(OPT_cl_denorms_are_zero);
Opts.CorrectlyRoundedDivSqrt =		Opts.CorrectlyRoundedDivSqrt =
Args.hasArg(OPT_cl_fp32_correctly_rounded_divide_sqrt);		Args.hasArg(OPT_cl_fp32_correctly_rounded_divide_sqrt);
Opts.UniformWGSize =		Opts.UniformWGSize =
Args.hasArg(OPT_cl_uniform_work_group_size);		Args.hasArg(OPT_cl_uniform_work_group_size);
Opts.Reciprocals = Args.getAllArgValues(OPT_mrecip_EQ);		Opts.Reciprocals = Args.getAllArgValues(OPT_mrecip_EQ);
Opts.ReciprocalMath = Args.hasArg(OPT_freciprocal_math);		Opts.ReciprocalMath = Args.hasArg(OPT_freciprocal_math);
Opts.NoTrappingMath = Args.hasArg(OPT_fno_trapping_math);		Opts.NoTrappingMath = Args.hasArg(OPT_fno_trapping_math);
		Opts.FPCastOverflowWorkaround = Args.hasArg(OPT_ffp_cast_overflow_workaround);

Opts.NoZeroInitializedInBSS = Args.hasArg(OPT_mno_zero_initialized_in_bss);		Opts.NoZeroInitializedInBSS = Args.hasArg(OPT_mno_zero_initialized_in_bss);
Opts.NumRegisterParameters = getLastArgIntValue(Args, OPT_mregparm, 0, Diags);		Opts.NumRegisterParameters = getLastArgIntValue(Args, OPT_mregparm, 0, Diags);
Opts.NoExecStack = Args.hasArg(OPT_mno_exec_stack);		Opts.NoExecStack = Args.hasArg(OPT_mno_exec_stack);
Opts.FatalWarnings = Args.hasArg(OPT_massembler_fatal_warnings);		Opts.FatalWarnings = Args.hasArg(OPT_massembler_fatal_warnings);
Opts.EnableSegmentedStacks = Args.hasArg(OPT_split_stacks);		Opts.EnableSegmentedStacks = Args.hasArg(OPT_split_stacks);
Opts.RelaxAll = Args.hasArg(OPT_mrelax_all);		Opts.RelaxAll = Args.hasArg(OPT_mrelax_all);
Opts.IncrementalLinkerCompatible =		Opts.IncrementalLinkerCompatible =
Args.hasArg(OPT_mincremental_linker_compatible);		Args.hasArg(OPT_mincremental_linker_compatible);
▲ Show 20 Lines • Show All 2,475 Lines • Show Last 20 Lines

test/CodeGen/no-junk-ftrunc.c

				// RUN: %clang_cc1 -S -ffp-cast-overflow-workaround %s -emit-llvm -o - \| FileCheck %s

				lebedev.riUnsubmitted Not Done Reply Inline Actions For a good measure, i'd add one more `RUN` line to test that it is currently the default. (Yes, i noticed that it is already tested in `test/Driver/fast-math.c`) lebedev.ri: For a good measure, i'd add one more `RUN` line to test that it is currently the default. (Yes…
				spatelAuthorUnsubmitted Not Done Reply Inline Actions The driver alone is handling the default setting, so it passes this flag to the front-end only when we're going to disable the transform. Ie, the driver eats "-fno-fp-cast-overflow-workaround" and sends nothing in that case to the front-end. So there's not currently any case where the function attribute will be "fp-cast-overflow-workaround=false", but I left that as a possibility in case we decide to lift the limit at a finer granularity (scalar vs. vector etc). I may be misunderstanding the question/suggestion - do we want the front-end to independently have a default setting? spatel: The driver alone is handling the default setting, so it passes this flag to the front-end only…
				chandlercUnsubmitted Not Done Reply Inline Actions How about a test that checks the attribute is absent in the default mode then? I think it's useful to have the both sides of the test, however a particular side is spelled. chandlerc: How about a test that checks the attribute is absent in the default mode then? I think it's…
				// CHECK-LABEL: main
				// CHECK: attributes #0 = {{.}}"fp-cast-overflow-workaround"="true"{{.}}

				int main() {
				return 0;
				}

test/Driver/fast-math.c

	Show First 20 Lines • Show All 281 Lines • ▼ Show 20 Lines
	// CHECK-NO-REASSOC-NO_UNSAFE-MATH-NOT: "-mreassociate"			// CHECK-NO-REASSOC-NO_UNSAFE-MATH-NOT: "-mreassociate"
	// CHECK-NO-REASSOC-NO-UNSAFE-MATH-NOT: "-menable-unsafe-fp-math"			// CHECK-NO-REASSOC-NO-UNSAFE-MATH-NOT: "-menable-unsafe-fp-math"
	// CHECK-NO-REASSOC-NO-UNSAFE-MATH: "-o"			// CHECK-NO-REASSOC-NO-UNSAFE-MATH: "-o"


	// RUN: %clang -### -ftrapping-math -fno-trapping-math -c %s 2>&1 \			// RUN: %clang -### -ftrapping-math -fno-trapping-math -c %s 2>&1 \
	// RUN: \| FileCheck --check-prefix=CHECK-NO-TRAPPING-MATH %s			// RUN: \| FileCheck --check-prefix=CHECK-NO-TRAPPING-MATH %s
	// CHECK-NO-TRAPPING-MATH: "-fno-trapping-math"			// CHECK-NO-TRAPPING-MATH: "-fno-trapping-math"

				// This isn't fast-math, but the option is handled in the same place as other FP params.
				// The flag is not passed by default.

				// RUN: %clang -### -ffp-cast-overflow-workaround -c %s 2>&1 \
				// RUN: \| FileCheck --check-prefix=CHECK-FPOV-WORKAROUND %s
				// CHECK-FPOV-WORKAROUND: "-cc1"
				// CHECK-FPOV-WORKAROUND: "-ffp-cast-overflow-workaround"

				// RUN: %clang -### -c %s 2>&1 \
				// RUN: \| FileCheck --check-prefix=CHECK-FPOV-WORKAROUND-DEFAULT %s
				// CHECK-FPOV-WORKAROUND-DEFAULT: "-cc1"
				// CHECK-FPOV-WORKAROUND-DEFAULT-NOT: "-ffp-cast-overflow-workaround"

This is an archive of the discontinued LLVM Phabricator instance.

[Driver, CodeGen] add options to enable/disable an FP cast optimizationClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 144200

docs/UsersManual.rst

include/clang/Driver/Options.td

include/clang/Frontend/CodeGenOptions.def

lib/CodeGen/CGCall.cpp

lib/Driver/ToolChains/Clang.cpp

lib/Frontend/CompilerInvocation.cpp

test/CodeGen/no-junk-ftrunc.c

test/Driver/fast-math.c

[Driver, CodeGen] add options to enable/disable an FP cast optimization
ClosedPublic