This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
docs/
13/23
UsersManual.rst
-
include/clang/
-
clang/
-
Basic/
2/3
CodeGenOptions.def
3/6
LangOptions.h
-
Driver/
2/3
Options.td
-
lib/
-
CodeGen/
-
BackendUtil.cpp
1/1
CodeGenFunction.h
2/11
CodeGenFunction.cpp
-
Driver/ToolChains/
-
ToolChains/
8/14
Clang.cpp
-
Frontend/
1/3
CompilerInvocation.cpp
-
test/
-
CodeGen/
2
fpconstrained.c
-
Driver/
1
clang_f_opts.c
-
fast-math.c
-
llvm/include/llvm/Target/
-
include/
-
llvm/
-
Target/
-
TargetOptions.h

Differential D62731

Add support for options -frounding-math, -ftrapping-math, -ffp-model=, and -ffp-exception-behavior=, : Specify floating point behavior
ClosedPublic

Authored by mibintc on May 31 2019, 7:01 AM.

Download Raw Diff

Details

Reviewers

chandlerc
rsmith
rjmccall
kpn
erichkeane

Summary

Intel would like to contribute a patch to implement support for these Intel- and Microsoft -fp options. This message is to describe the options and request feedback from the community.
-frounding-math (supported by gcc and ICC)
-fp-model=[precise|strict|fast] and -fp-exception-behavior=[ignore|maytrap|strict]

This contribution dovetails with the llvm patch "Teach the IRBuilder about constrained fadd and friends". The motivation for providing these is that having umbrella options such as -fp-model= to control most basic FP options is better and easier to understand for users.

The option settings -fp-model=[precise|strict|fast] are supported by both ICC and CL. The CL and ICC -fp-model option is documented on these pages:

https://docs.microsoft.com/en-us/cpp/build/reference/fp-specify-floating-point-behavior?view=vs-2019
https://software.intel.com/en-us/cpp-compiler-developer-guide-and-reference-fp-model-fp

Currently, clang's default behavior corresponds to -fp-model=precise. Clang/llvm support for -fp-model=strict and
-fp-exception-behavior= was developed in the D53157 patch, and there is current llvm support for the fast settings by using the fast math flags llvm::FastMathFlags. Note: the clang-cl wrapper to support Microsoft options has simplified support for these options by mapping /fp-model=except to ftrapping-math, fp-mdel=fast to ffast-math, fp-model=precise and fp-model=strict to fno-fast-math (see clang/Driver/CLCompatOptions.td).

These are the settings for -fp-model=

precise - Disables optimizations that are not value-safe on floating-point data, although FP contraction is enabled.
strict - Enables precise and except, disables contractions (FMA), and enables pragma stdc fenv_access.   [Note: fenv_access not currently supported in clang]
fast - Equivalent to -ffast-math

What follows here is Microsoft /fp documentation from the msdn site. It's copied here solely for the purposes of saving
information for the future in case the information is removed from the msdn site:

/fp (Specify floating-point behavior)

Specifies how the compiler treats floating-point expressions, optimizations, and exceptions. The /fp options specify whether the generated code allows floating-point environment changes to the rounding mode, exception masks, and subnormal behavior, and whether floating-point status checks return current, accurate results. It controls whether the compiler generates code that maintains source operation and expression ordering and conforms to the standard for NaN propagation, or if it instead generates more efficient code that may reorder or combine operations and use simplifying algebraic transformations that are not allowed by the standard.

Syntax

/fp:[precise | strict | fast | except[-]]

Arguments

precise

By default, the compiler uses /fp:precise behavior.

Under /fp:precise the compiler preserves the source expression ordering and rounding properties of floating-point code when it generates and optimizes object code for the target machine. The compiler rounds to source code precision at four specific points during expression evaluation: at assignments, at typecasts, when a floating-point argument is passed to a function call, and when a floating-point value is returned from a function call. Intermediate computations may be performed at machine precision. Typecasts can be used to explicitly round intermediate computations.

The compiler does not perform algebraic transformations on floating-point expressions, such as reassociation or distribution, unless the transformation is guaranteed to produce a bitwise identical result.
Expressions that involve special values (NaN, +infinity, -infinity, -0.0) are processed according to IEEE-754 specifications. For example, x != x evaluates to true if x is NaN. Floating-point *contractions*, that is, machine instructions that combine floating-point operations, may be generated under /fp:precise.

The compiler generates code intended to run in the [default floating-point environment](#the-default-floating-point-environment) and assumes that the floating-point environment is not accessed or modified at runtime. That is, it assumes that the code does not unmask floating-point exceptions, read or write floating-point status registers, or change rounding modes.

If your floating-point code does not depend on the order of operations and expressions in your floating-point statements (for example, if you don't care whether a * b + a * c is computed as (b + c) * a or 2 * a as a + a), consider the [/fp:fast](#fast) option, which can produce faster, more efficient code. If your code both depends on the order of operations and expressions, and accesses or alters the floating-point environment (for example, to change rounding modes or to trap floating-point exceptions), use [/fp:strict](#strict).

strict

/fp:strict has behavior similar to /fp:precise, that is, the compiler preserves the source ordering and rounding properties of floating-point code when it generates and optimizes object code for the target machine, and observes the standard when handling special values. In addition, the program may safely access or modify the floating-point environment at runtime.

Under /fp:strict, the compiler generates code that allows the program to safely unmask floating-point exceptions, read or write floating-point status registers, or change rounding modes. It rounds to source code precision at four specific points during expression evaluation: at assignments, at typecasts, when a floating-point argument is passed to a function call, and when a floating-point value is returned from a function call. Intermediate computations may be performed at machine precision. Typecasts can be used to explicitly round intermediate computations. The compiler does not perform algebraic transformations on floating-point expressions, such as reassociation or distribution, unless the transformation is guaranteed to produce a bitwise identical result. Expressions that involve special values (NaN, +infinity, -infinity, -0.0) are processed according to IEEE-754 specifications. For example, x != x evaluates to true if x is NaN. Floating-point contractions are not generated under /fp:strict.

/fp:strict is computationally more expensive than /fp:precise because the compiler must insert additional instructions to trap exceptions and allow programs to access or modify the floating-point environment at runtime. If your code doesn’t use this capability, but requires source code ordering and rounding, or relies on special values, use /fp:precise. Otherwise, consider using /fp:fast, which can produce faster and smaller code.

fast

The /fp:fast option allows the compiler to reorder, combine, or simplify floating-point operations to optimize floating-point code for speed and space. The compiler may omit rounding at assignment statements, typecasts, or function calls. It may reorder operations or perform algebraic transforms, for example, by use of associative and distributive laws, even if such transformations result in observably different rounding behavior. Because of this enhanced optimization, the result of some floating-point computations may differ from those produced by other /fp options. Special values (NaN, +infinity, -infinity, -0.0) may not be propagated or behave strictly according to the IEEE-754 standard. Floating-point contractions may be generated under /fp:fast. The compiler is still bound by the underlying architecture under /fp:fast, and additional optimizations may be available through use of the [/arch](arch-minimum-cpu-architecture.md) option.

Under /fp:fast, the compiler generates code intended to run in the default floating-point environment and assumes that the floating-point environment isn’t accessed or modified at runtime. That is, it assumes that the code does not unmask floating-point exceptions, read or write floating-point status registers, or change rounding modes.

/fp:fast is intended for programs that do not require strict source code ordering and rounding of floating-point expressions, and do not rely on the standard rules for handling special values such as NaN. If your floating-point code requires preservation of source code ordering and rounding, or relies on standard behavior of special values, use [/fp:precise](#precise). If your code accesses or modifies the floating-point environment to change rounding modes, unmask floating-point exceptions, or check floating-point status, use [/fp:strict](#strict).

except

The /fp:except option generates code to ensures that any unmasked floating-point exceptions are raised at the exact point at which they occur, and that no additional floating-point exceptions are raised. By default, the /fp:strict option enables /fp:except, and /fp:precise does not. The /fp:except option is not compatible with /fp:fast. The option can be explicitly disabled by us of /fp:except-.

Note that /fp:except does not enable any floating-point exceptions by itself, but it is required for programs to enable floating-point exceptions. See [_controlfp](../../c-runtime-library/reference/control87-controlfp-control87-2.md) for information on how to enable floating-point exceptions.

Remarks

Multiple /fp options can be specified in the same compiler command line. Only one of /fp:strict, /fp:fast, and /fp:precise options can be in effect at a time. If more than one of these options is specified on the command line, the later option takes precedence and the compiler generates a warning.

Diff Detail

Repository: rL LLVM

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

I think this is a step in the right direction, thank you. I'd like @scanon to weigh in on the evolving design here.

clang/docs/UsersManual.rst
1510	What you should document here are the semantics and how the option interacts with other options, not how code gets translated into LLVM. I'm not sure what the FIXME question here is; are you asking whether providing `-frounding-math` should imply an FP model? The notes about each of the options should probably be structured into a bullet list.
1532	This is basically incomprehensible. :) I don't know if the problem is the behavior or just how it's being described, but I have no idea what "conflict" means — does it mean the option gets overridden, ignored, or causes an error? I think what you're trying to say is: Basic FP behavior can be broken down along two dimensions: the FP strictness model and the FP exceptions model. There are many existing options for controlling FP behavior. Some of these existing options are equivalent to setting one (or both?) of these dimensions. These options should generally be treated as synonyms for the purposes of deciding the ultimate setting; for example, `-ffp-model=fast -fno-fast-math` should basically leave the setting in its default state (right?). Other existing options only make sense in combination with certain basic models. For example, `-ffp-contract=fast` (note the spelling) is only allowed when using the fast FP model (right?). As a specific note, you break out the options into a list below; the entry for `fast` is the place to add things like "Equivalent to `-ffast-math`, including defining `__FAST_MATH__`)".

mibintc marked 2 inline comments as done.Sep 9 2019, 1:28 PM

mibintc added inline comments.

clang/docs/UsersManual.rst
1510	I'll remove the FIXME and assert that frounding-math uses dynamic-rounding and strict exception behavior. This will make frounding-math synonymous with fp-model=strict. I'll reformat to put notes into bullet lists.
1532	Conflict was a poor choice of words. I meant to say that the umbrella options like fp-model=strict overlap with some of the other floating-point settings, in that case the rightmost option takes precedence and overrides the setting. I want the new options to behave in the same way that other clang options: rightmost option has precedence.

Hmm, you know, there are enough different FP options that I think we should probably split them all out into their own section in the manual instead of just listing them under "code generation". That will also give us an obvious place to describe the basic model, i.e. all the stuff about it mostly coming down to different strictness and exception models. Could you prepare a patch that *just* does that reorganization without adding any new features, and then we can add the new options on top of that?

In D62731#1663748, @rjmccall wrote:

Hmm, you know, there are enough different FP options that I think we should probably split them all out into their own section in the manual instead of just listing them under "code generation". That will also give us an obvious place to describe the basic model, i.e. all the stuff about it mostly coming down to different strictness and exception models. Could you prepare a patch that *just* does that reorganization without adding any new features, and then we can add the new options on top of that?

Yes I'll do that

In D62731#1663748, @rjmccall wrote:

Hmm, you know, there are enough different FP options that I think we should probably split them all out into their own section in the manual instead of just listing them under "code generation". That will also give us an obvious place to describe the basic model, i.e. all the stuff about it mostly coming down to different strictness and exception models. Could you prepare a patch that *just* does that reorganization without adding any new features, and then we can add the new options on top of that?

I uploaded a patch to move floating point options to a new documentation section here, https://reviews.llvm.org/D67517

In the previous review, @rjmccall asked me to redo the existing floating point option documentation before submitting this patch. I got the floating point documentation update committed, and I've worked on this patch more to get the floating point "render options" checking implemented. This patch needs more test cases, and there might be a bug or 2 in the "render options" checking. I wanted to show you this work especially to get your reaction to the changes to the floating point options

This patch adds support for frounding-math and ftrapping-math and new options fp-model= and fp-exception-behavior=; fp-model is an "umbrella" option.

Thank you, this looks very clean now.

clang/docs/UsersManual.rst
1318	"represent the corresponding IEEE rounding rules"
1330	"provided by other, single-purpose floating point options."
1341	That's not typical driver behavior; why this choice?

simoll added a subscriber: simoll.Oct 8 2019, 10:52 AM

I made a couple wording changes suggested by @rjmccall

mibintc marked 2 inline comments as done.Oct 8 2019, 11:26 AM

mibintc added inline comments.

clang/docs/UsersManual.rst
1341	The rationale for the warnings is that the floating point options are sufficiently complicated that it makes sense to warn the uses that one of the later options supplied on the command line is undoing a choice made earlier. It's not obvious that e.g. the setting for fassociative-math is also controlled by -fp-model=strict

mibintc marked 2 inline comments as done.Oct 8 2019, 11:35 AM

mibintc added inline comments.

clang/include/clang/Driver/Options.td
927	The ffp-model= option is just a Driver option, it is rewritten into combinations of lower level options like ffp-exception-behavior and frounding-math: it's not a cc1 option.
clang/lib/Driver/ToolChains/Clang.cpp
2326	By default, floating point exceptions are masked. Previously this was set to true, but the value wasn't used. This patch implements support for trapping-math

clean up some dead code

I added a test case to show the warning diagnostics when options conflicting with fp-model are provided. I fixed a couple bugs in RenderFloatingPointOptions when issueing diagnostics. still owe a test case showing how the fp-model, rounding, and trapping options are rendered by the Driver for cc1

rjmccall added inline comments.Oct 8 2019, 8:28 PM

clang/docs/UsersManual.rst
1330	I don't know why you keep including "clang" as a modifier here; this is the clang documentation, and all of these options are clang options no matter where they might have been borrowed from.
1341	Okay. Well, it's a new option, so new behavior is alright, but if you're worried about the collisions having arbitrary effects that you'll have to maintain compatibility with, you should consider making it an error instead, because a warning still means it's permitted.

I added a new test case fp-model.c to test RenderFloatingPointOptions, I also fixed a few issues that I spotted while working through this test case. I responded to couple documentation comments from @rjmccall I still owe a more deluxe version of the test fp-constrained.c to be sure all the option values come through as expected

mibintc marked 5 inline comments as done.Oct 9 2019, 1:18 PM

mibintc added inline comments.

clang/docs/UsersManual.rst
1330	thanks for explicitly pointing out use of 'clang', i fixed it
1341	@andrew.w.kaylor What do you think about making the diagnostics error vs. warning?
clang/include/clang/Basic/LangOptions.h
187	Currently there's no way to get at any of these values besides ToNearest and Dynamic, but I put all the supported values here to support future work
203	-fno-trapping-math implemented by selecting -ffp-exception-behavior=ignore and -ftrapping-math is implemented by selecting -ffp-exception-behavior=strict. What do you think about making ftrapping-math a Driver only option, so that Driver converts the values like this. Otherwise let's make fp-exception-behavior take precedence, in llvm, over ftrapping-math (trapping math is t/f but exception behavior, in the llvm Constrained Floating Point Intrinsics, can take 3 values)

I looked over the codegen testcase fpconstrained.c and it looks pretty good, so i think this is ready for your review comments. I'll be off the grid for a couple days but looking forward to receiving your feedback.

I inserted a couple of inline remarks in the code review to highlight some areas and questions.

rjmccall added inline comments.Oct 16 2019, 3:22 PM

clang/include/clang/Basic/LangOptions.h
203	If your new option subsumes existing ones, I think making it the frontend option is sensible.

Ping. Hoping for code review so I can move this forward. Affirmative or negative, please let me know. Thank you! --Melanie

rjmccall added inline comments.Oct 18 2019, 11:26 AM

clang/docs/UsersManual.rst
1318	A few points about this documentation that occurred to me since the last time I looked at it: It's weird to talk about LLVM here, since this is the Clang documentation. Clang's behavior is not specified in terms of the IR it generates; it's specified in terms of the formal behavior of the source code. Therefore this documentation should talk about things using concepts from an appropriate language standard whenever possible; in this case, C99 works. It's weird to bring up all these different rounding modes when this option doesn't actually let you do anything with them. If you want to talk about rounding modes in general that's fine as a way of informing the programmer, but we shouldn't give them information they can't use. I don't think `-fno-rounding-math` is actually equivalent to forcing the use of the `tonearest` rounding mode; I think it assumes that the rounding mode is set to `tonearest`. (Or am I wrong and this is actually guaranteed by ABI?) I don't think we want to define `-frounding-math` as exactly equivalent to `-ffp-model=strict`. That might be a convenient implementation for now, but it seems to me that `-frounding-math` still allows some optimizations that `-ffp-model=strict` wouldn't. With that in mind, I'd suggest something like this: Force floating-point operations to honor the dynamically-set rounding mode by default. The result of a floating-point operation often cannot be exactly represented in the result type and therefore must be rounded. IEEE 754 describes different rounding modes that control how to perform this rounding, not all of which are supported by all implementations. C provides interfaces (`fesetround `and` `fesetenv``) for dynamically controlling the rounding mode, and while it also recommends certain conventions for changing the rounding mode, these conventions are not typically enforced in the ABI. Since the rounding mode changes the numerical result of operations, the compiler must understand something about it in order to optimize floating point operations. Note that floating-point operations performed as part of constant initialization are formally performed prior to the start of the program and are therefore not subject to the current rounding mode. This includes the initialization of global variables and local `static `variables. Floating-point operations in these contexts will be rounded using` `FE_TONEAREST``. The option `-fno-rounding-math `allows the compiler to assume that the rounding mode is set to` `FE_TONEAREST``. This is the default. The option `-frounding-math` forces the compiler to honor the dynamically-set rounding mode. This prevents optimizations which might affect results if the rounding mode changes or is different from the default; for example, it prevents floating-point operations from being reordered across most calls and prevents constant-folding when the result is not exactly representable.

mibintc added inline comments.Oct 20 2019, 6:29 PM

clang/docs/UsersManual.rst
1318	Thank you, I will work on another patch

I adopted the language that @rjmccall recommended for documenting frounding-math., also adding a sentence to describe effects on exception behavior control.

Is the exception-strictness of -frounding-math actually considered to be specified behavior, or is it just a consequence of the current implementation? There are definitely some optimizations that we can't do under strict FP exceptions that we can still do in principle while respecting a dynamic FP rounding mode; for example, the rounding mode can only be changed by a call (or inline assembly), so you can still reorder FP operations around "lesser" side effects, like stores. We can document it even if it's not required behavior, but we should be clear about what it is.

My suggested wording started with a sentence briefly summarizing what the option did that I think you accidentally dropped.

In response to comments from @rjmccall I inserted into the UsersManual the one-line summary of frounding-math that had been omitted and changed the semantics of frounding-math to not also set exception-behavior to strict

In D62731#1724290, @rjmccall wrote:

Is the exception-strictness of -frounding-math actually considered to be specified behavior, or is it just a consequence of the current implementation? There are definitely some optimizations that we can't do under strict FP exceptions that we can still do in principle while respecting a dynamic FP rounding mode; for example, the rounding mode can only be changed by a call (or inline assembly), so you can still reorder FP operations around "lesser" side effects, like stores. We can document it even if it's not required behavior, but we should be clear about what it is.

I had thought that it was intended behavior, but I re-checked my notes and realize I was wrong about that. So I've changed the document and the driver, and updated the test. Thanks again for your careful reading.

My suggested wording started with a sentence briefly summarizing what the option did that I think you accidentally dropped.

Yes, it's there now.

Thanks. A few things about the functionality parts of the patch now.

clang/include/clang/Basic/CodeGenOptions.def
237	Why do we need both a code-gen option and a language option?
clang/include/clang/Basic/LangOptions.h
366	Everything here is a "setting", and in the context of this type they're all FP. Please name these methods something like `getRoundingMode()`. Does this structure really need to exist as opposed to tracking the dimensions separately? Don't we already track some of this somewhere? We should subsume that state into these values rather than tracking them separately.
clang/lib/CodeGen/CodeGenFunction.cpp
108	Code style: please use `()` instead of `(void)`, and please place open-braces on the same line as the declaration.
clang/lib/CodeGen/CodeGenFunction.h
4152	Don't use `(void)`, please.

mibintc added inline comments.Oct 29 2019, 12:54 PM

clang/include/clang/Basic/CodeGenOptions.def
237	The main reason i added it to LangOptions.h is because I saw the FPContract support in there and I thought I'd get on that bandwagon. My ultimate goal, after committing the command line options, is to add support for controlling rounding mode and exception behavior with pragma's embedded in the functions, similar to https://reviews.llvm.org/D69272. There's a patch here that I like, to add rounding-mode and exception-behavior to FPOptions https://reviews.llvm.org/D65994, but it hasn't been committed yet.

I followed up on some code review remarks from @rjmccall. I dropped the CODEGEN option and fixed some code formatting. I changed the spelling of the enumeration values for RoundingMode and ExceptionMode to match those proposed in https://reviews.llvm.org/D65994

mibintc marked 5 inline comments as done.Oct 31 2019, 8:27 AM

mibintc added inline comments.

clang/include/clang/Basic/CodeGenOptions.def
237	I dropped the code-gen option.
clang/include/clang/Basic/LangOptions.h
366	I fixed the spelling, I also dropped the structure and used the ENUM_OPT macro instead of writing out the setter and getter. Look OK now?

Yes, thanks, looks a lot better. Just a few tweaks now.

clang/include/clang/Basic/LangOptions.h
345	Spurious change.
clang/include/clang/Driver/Options.td
1148	It looks like both of these can now be written with `BooleanFFlag`.
clang/lib/CodeGen/CodeGenFunction.cpp
148	Please make functions that do these translations, and please make them use exhaustive switches with `llvm_unreachable` at the end.
clang/test/Driver/clang_f_opts.c
323	Looks like the intent of this test is that you pull this to the lines above, to test that we don't emit an error on it. You should also test `-ffp-model`.

mibintc marked an inline comment as done.Oct 31 2019, 12:12 PM

mibintc added inline comments.

clang/include/clang/Driver/Options.td
1148	BooleanFFlag doesn't work, there's a FIXME message saying that prefixes don't work, currently they are only being used for unimplemented options. llvm/clang/lib/Driver/ToolChains/Clang.cpp:2301:17: error: ‘OPT_frounding_math’ is not a member of ‘clang::driver::options’ optID = options::OPT_frounding_math; ^

Respond to recent code review from @rjmccall ; I modified the test cases and added functions for translating between the LangOptions enumeration and llvm enumeration for rounding-mode and exception-behavior. I wasn't able to use BooleanFFlag because at the moment that is only usable for unsupported options.

mibintc added inline comments.Nov 1 2019, 9:52 AM

clang/lib/CodeGen/CodeGenFunction.cpp
133	I added these 2 functions, is this what you have in mind or do you want me to write them differently?

rjmccall added inline comments.Nov 1 2019, 1:15 PM

clang/lib/CodeGen/CodeGenFunction.cpp

133

Slightly differently, yes, please.

static llvm::ConstrainedFPIntrinsic::ExceptionBehavior getConstrainedExceptionBehavior(LangOptions;:FPExceptionModeKind kind) {
  switch (kind) {
  case LangOptions::FPE_Ignore:
    return llvm::ConstrainedFPIntrinsic::ebIgnore;
  // ...rest of cases here...
  // no default: should be exhaustive over the enum
  }
  llvm_unreachable("bad kind");
}

Recoded ToConstrainedRoundingMD and ToConstrainedExceptionMD as requested by @rjmccall

rjmccall added inline comments.Nov 1 2019, 1:50 PM

clang/lib/CodeGen/CodeGenFunction.cpp
137	Sorry for dragging this out, but is there a reason these need to be member functions on `CodeGenFunction` rather than just `static` functions in this file?

Made a couple functions static per @rjmccall request

mibintc marked an inline comment as done.Nov 1 2019, 2:04 PM

mibintc added inline comments.

clang/lib/CodeGen/CodeGenFunction.cpp
133	sorry i missed that detail (static) the first time around

LGTM; thanks for your patience during all the rounds of review.

This revision is now accepted and ready to land.Nov 4 2019, 9:40 AM

When doing final testing before commit, I used the Debug build with assertions enabled and found that a couple tests failed with assertion in Clang.cpp. I made some modifications there so the assert wouldn't be triggered. Also I fixed some spelling errors in the comments. (option name misspelled: missing the first f)

when "make check-all" with Debug build, I see lit test failure llvm/test/Object/macho-invalid.test; I think that fail is not related to my change.

@rjmccall thanks for all your help developing this patch

Looks okay to me.

Don't know why the commit id didn't get linked when I pushed the change. Here's the closure info:

commit af57dbf12e54f3a8ff48534bf1078f4de104c1cd
Author: Melanie Blower <melanie.blower@intel.com>
Date: Tue Nov 5 13:41:21 2019 -0800

Add support for options -frounding-math, ftrapping-math, -ffp-model=, and -ffp-exception-behavior=

Hi,

I found a clang crash that seems to be caused by this patch. Here's a reduced test case:

template <class>
class a {
 public:
  ~a();
  void b();
};

template <class c>
a<c>::~a() try {
  b();
} catch (...) {
}

class d {
 public:
  d(const char *, int);
  a<int> e;
}

d("", 1);

Building it with clang -c -frounding-math results in the following crash:

Stack dump:
0.      Program arguments: /usr/local/google/home/jgorbe/code/llvm-build/bin/clang-10 -cc1 -triple x86_64-unknown-linux-gnu -emit-obj -mrelax-all -disable-free -main-fi
le-name repro.cc -mrelocation-model static -mthread-model posix -mframe-pointer=all -fmath-errno -frounding-math -masm-verbose -mconstructor-aliases -munwind-tables -fu
se-init-array -target-cpu x86-64 -dwarf-column-info -debugger-tuning=gdb -resource-dir /usr/local/google/home/jgorbe/code/llvm-build/lib/clang/10.0.0 -internal-isystem 
/usr/lib/gcc/x86_64-linux-gnu/8/../../../../include/c++/8 -internal-isystem /usr/lib/gcc/x86_64-linux-gnu/8/../../../../include/x86_64-linux-gnu/c++/8 -internal-isystem
 /usr/lib/gcc/x86_64-linux-gnu/8/../../../../include/x86_64-linux-gnu/c++/8 -internal-isystem /usr/lib/gcc/x86_64-linux-gnu/8/../../../../include/c++/8/backward -intern
al-isystem /usr/local/include -internal-isystem /usr/local/google/home/jgorbe/code/llvm-build/lib/clang/10.0.0/include -internal-externc-isystem /usr/include/x86_64-lin
ux-gnu -internal-externc-isystem /include -internal-externc-isystem /usr/include -fdeprecated-macro -fdebug-compilation-dir /usr/local/google/home/jgorbe/repro4 -ferror
-limit 19 -fmessage-length 0 -fgnuc-version=4.2.1 -fobjc-runtime=gcc -fcxx-exceptions -fexceptions -fdiagnostics-show-option -fcolor-diagnostics -faddrsig -o /tmp/repro
-c9cae0.o -x c++ repro.cc 
1.      <eof> parser at end of file
2.      Per-file LLVM IR generation
3.      repro.cc:4:3: Generating code for declaration 'a<int>::~a'
 #0 0x0000000007fb2b27 llvm::sys::PrintStackTrace(llvm::raw_ostream&) /usr/local/google/home/jgorbe/code/llvm/llvm/lib/Support/Unix/Signals.inc:532:11 
 #1 0x0000000007fb2cc9 PrintStackTraceSignalHandler(void*) /usr/local/google/home/jgorbe/code/llvm/llvm/lib/Support/Unix/Signals.inc:593:1             
 #2 0x0000000007fb160b llvm::sys::RunSignalHandlers() /usr/local/google/home/jgorbe/code/llvm/llvm/lib/Support/Signals.cpp:67:5                        
 #3 0x0000000007fb3415 SignalHandler(int) /usr/local/google/home/jgorbe/code/llvm/llvm/lib/Support/Unix/Signals.inc:384:1                        
 #4 0x00007f6789ec03a0 __restore_rt (/lib/x86_64-linux-gnu/libpthread.so.0+0x123a0)                                                                                     
 #5 0x000000000739af45 llvm::AttributeList::hasFnAttribute(llvm::Attribute::AttrKind) const /usr/local/google/home/jgorbe/code/llvm/llvm/lib/IR/Attributes.cpp:1315:10
 #6 0x00000000051a1964 llvm::Function::hasFnAttribute(llvm::Attribute::AttrKind) const /usr/local/google/home/jgorbe/code/llvm/llvm/include/llvm/IR/Function.h:324:5    
 #7 0x00000000052421a1 llvm::IRBuilderBase::setConstrainedFPFunctionAttr() /usr/local/google/home/jgorbe/code/llvm/llvm/include/llvm/IR/IRBuilder.h:262:9
 #8 0x0000000005241b60 llvm::IRBuilderBase::setConstrainedFPCallAttr(llvm::CallInst*) /usr/local/google/home/jgorbe/code/llvm/llvm/include/llvm/IR/IRBuilder.h:271:3
 #9 0x00000000084a516b llvm::IRBuilder<llvm::ConstantFolder, clang::CodeGen::CGBuilderInserter>::CreateCall(llvm::FunctionType*, llvm::Value*, llvm::ArrayRef<llvm::Value*>, llvm::ArrayRef<llvm::OperandBundleDefT<llvm::Value*> >, llvm::Twine const&, llvm::MDNode*) /usr/local/google/home/jgorbe/code/llvm/llvm/include/llvm/IR/IRBuilder.h:2274:9                                                                                                                                                                 
#10 0x00000000084a4488 llvm::IRBuilder<llvm::ConstantFolder, clang::CodeGen::CGBuilderInserter>::CreateCall(llvm::FunctionCallee, llvm::ArrayRef<llvm::Value*>, llvm::ArrayRef<llvm::OperandBundleDefT<llvm::Value*> >, llvm::Twine const&, llvm::MDNode*) /usr/local/google/home/jgorbe/code/llvm/llvm/include/llvm/IR/IRBuilder.h:2288:5
#11 0x000000000849288c clang::CodeGen::CodeGenFunction::EmitRuntimeCall(llvm::FunctionCallee, llvm::ArrayRef<llvm::Value*>, llvm::Twine const&) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGCall.cpp:3726:26
#12 0x000000000849278d clang::CodeGen::CodeGenFunction::EmitNounwindRuntimeCall(llvm::FunctionCallee, llvm::ArrayRef<llvm::Value*>, llvm::Twine const&) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGCall.cpp:3691:19                                                                                                            
#13 0x000000000897b3e4 (anonymous namespace)::ItaniumCXXABI::emitTerminateForUnexpectedException(clang::CodeGen::CodeGenFunction&, llvm::Value*) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/ItaniumCXXABI.cpp:4364:5                       
#14 0x000000000865698d clang::CodeGen::CodeGenFunction::getTerminateHandler() /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGException.cpp:1504:19
#15 0x00000000086561de clang::CodeGen::CodeGenFunction::getEHDispatchBlock(clang::CodeGen::EHScopeStack::stable_iterator) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGException.cpp:630:21                                                                                                                                      
#16 0x000000000864e7d8 clang::CodeGen::CodeGenFunction::PopCleanupBlock(bool) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGCleanup.cpp:970:23
#17 0x000000000864d053 clang::CodeGen::CodeGenFunction::PopCleanupBlocks(clang::CodeGen::EHScopeStack::stable_iterator, std::initializer_list<llvm::Value**>) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGCleanup.cpp:423:3
#18 0x000000000864eae3 clang::CodeGen::CodeGenFunction::PopCleanupBlocks(clang::CodeGen::EHScopeStack::stable_iterator, unsigned long, std::initializer_list<llvm::Value**>) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGCleanup.cpp:479:19                                                                                     
#19 0x000000000864404d clang::CodeGen::CodeGenFunction::RunCleanupsScope::ForceCleanup(std::initializer_list<llvm::Value**>) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenFunction.h:756:11
#20 0x0000000008655cd4 clang::CodeGen::CodeGenFunction::ExitCXXTryStmt(clang::CXXTryStmt const&, bool) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGException.cpp:1234:16
#21 0x000000000868bf3c clang::CodeGen::CodeGenFunction::EmitDestructorBody(clang::CodeGen::FunctionArgList&) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/C
GClass.cpp:1531:1
#22 0x0000000008673ce5 clang::CodeGen::CodeGenFunction::GenerateCode(clang::GlobalDecl, llvm::Function*, clang::CodeGen::CGFunctionInfo const&) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenFunction.cpp:1256:5
#23 0x00000000087aea54 clang::CodeGen::CodeGenModule::codegenCXXStructor(clang::GlobalDecl) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CGCXX.cpp:214:3
#24 0x0000000008980a5e (anonymous namespace)::ItaniumCXXABI::emitCXXStructor(clang::GlobalDecl) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/ItaniumCXXABI.cpp:4003:19
#25 0x000000000853c297 clang::CodeGen::CodeGenModule::EmitGlobalDefinition(clang::GlobalDecl, llvm::GlobalValue*) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenModule.cpp:2816:9       
#26 0x0000000008532fb9 clang::CodeGen::CodeGenModule::EmitDeferred() /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenModule.cpp:2132:5                 
#27 0x0000000008533004 clang::CodeGen::CodeGenModule::EmitDeferred() /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenModule.cpp:2138:7
#28 0x0000000008533004 clang::CodeGen::CodeGenModule::EmitDeferred() /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenModule.cpp:2138:7
#29 0x0000000008533004 clang::CodeGen::CodeGenModule::EmitDeferred() /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenModule.cpp:2138:7
#30 0x0000000008531692 clang::CodeGen::CodeGenModule::Release() /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenModule.cpp:393:3
#31 0x0000000008e96d12 (anonymous namespace)::CodeGeneratorImpl::HandleTranslationUnit(clang::ASTContext&) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/ModuleBuilder.cpp:0:18                                                                                                                                                     
#32 0x0000000008e90d99 clang::BackendConsumer::HandleTranslationUnit(clang::ASTContext&) /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenAction.cpp:242:14                                                                                                                                                                     
#33 0x000000000af2c3ce clang::ParseAST(clang::Sema&, bool, bool) /usr/local/google/home/jgorbe/code/llvm/clang/lib/Parse/ParseAST.cpp:178:12                        
#34 0x0000000008cea5c2 clang::ASTFrontendAction::ExecuteAction() /usr/local/google/home/jgorbe/code/llvm/clang/lib/Frontend/FrontendAction.cpp:1044:1                   
#35 0x0000000008e8e72b clang::CodeGenAction::ExecuteAction() /usr/local/google/home/jgorbe/code/llvm/clang/lib/CodeGen/CodeGenAction.cpp:1089:1                         
#36 0x0000000008ce9f88 clang::FrontendAction::Execute() /usr/local/google/home/jgorbe/code/llvm/clang/lib/Frontend/FrontendAction.cpp:939:7                             
#37 0x0000000008c193b3 clang::CompilerInstance::ExecuteAction(clang::FrontendAction&) /usr/local/google/home/jgorbe/code/llvm/clang/lib/Frontend/CompilerInstance.cpp:964:23                                                                                                                                                                    
#38 0x0000000008e78e19 clang::ExecuteCompilerInvocation(clang::CompilerInstance*) /usr/local/google/home/jgorbe/code/llvm/clang/lib/FrontendTool/ExecuteCompilerInvocation.cpp:290:8                                                                        
#39 0x000000000517a8d2 cc1_main(llvm::ArrayRef<char const*>, char const*, void*) /usr/local/google/home/jgorbe/code/llvm/clang/tools/driver/cc1_main.cpp:250:13         
#40 0x000000000516e5ff ExecuteCC1Tool(llvm::ArrayRef<char const*>, llvm::StringRef) /usr/local/google/home/jgorbe/code/llvm/clang/tools/driver/driver.cpp:309:5         
#41 0x000000000516d9c9 main /usr/local/google/home/jgorbe/code/llvm/clang/tools/driver/driver.cpp:382:5                                                                 
#42 0x00007f678914e52b __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x2352b)
#43 0x000000000516d1ba _start (/usr/local/google/home/jgorbe/code/llvm-build/bin/clang-10+0x516d1ba)                                                           
clang-10: error: unable to execute command: Segmentation fault                                                                                                          
clang-10: error: clang frontend command failed due to signal (use -v to see invocation)                                                                                 
clang version 10.0.0 (https://github.com/llvm/llvm-project.git f2e65447b3cb6340883957e033e77095a025ebdc)

Thanks I see it, I'm working on a patch. Previously there was no support for frounding-math (unimplemented). This patch enables the option. In the IR builder, there's a call to a runtime function in the exception handler which is unexpectedly null. I start by adding a null pointer check.

In D62731#1749916, @mibintc wrote:

Thanks I see it, I'm working on a patch. Previously there was no support for frounding-math (unimplemented). This patch enables the option. In the IR builder, there's a call to a runtime function in the exception handler which is unexpectedly null. I start by adding a null pointer check.

Had a crash on valid here for days, let's revert and then get a fix when recommitting. I'll respond to the thread when reverting. Thanks :)

The incorrect code is actually in the IRBuilder which is part of a different patch...

In D62731#1750312, @echristo wrote:

In D62731#1749916, @mibintc wrote:

Thanks I see it, I'm working on a patch. Previously there was no support for frounding-math (unimplemented). This patch enables the option. In the IR builder, there's a call to a runtime function in the exception handler which is unexpectedly null. I start by adding a null pointer check.

Had a crash on valid here for days, let's revert and then get a fix when recommitting. I'll respond to the thread when reverting. Thanks :)

@echristo I just saw the bug was reported today, is the "crash on valid" visible on the bots? Can you provide url showing same? Thanks
I opened https://bugs.llvm.org/show_bug.cgi?id=44048 for @jgorbe

mibintc mentioned this in D69312: [FPEnv] Teach the IRBuilder about correct use of the strictfp attribute..Nov 18 2019, 11:11 AM

Reopening since patch was reverted

This revision is now accepted and ready to land.Nov 18 2019, 12:00 PM

mibintc updated this revision to Diff 229893.Nov 18 2019, 12:02 PM

mibintc added a reviewer: kpn.Nov 18 2019, 12:02 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 18 2019, 12:02 PM

I added a nullptr check in IRBuilder.h and a test case to cover the segfault reported in https://bugs.llvm.org/show_bug.cgi?id=44048

Does anyone think a warning is appropriate because the new flags are exercising experimental, incomplete code in both clang and llvm? The warning would be removed when we believe the feature is complete and ready to use.

llvm/include/llvm/IR/IRBuilder.h
262 ↗	(On Diff #229893)	This looks reasonable to me. It smells like there's a larger strictfp IRBuilder issue, but that's not an issue for this patch here. The larger issue won't be hit since the new options affect the entire compilation. It therefore shouldn't block this patch.

rjmccall added inline comments.Nov 18 2019, 12:23 PM

llvm/include/llvm/IR/IRBuilder.h
262 ↗	(On Diff #229893)	Does IRBuilder actually support inserting into an unparented basic block? I feel like this is exposing a much more serious mis-use of IRBuilder.

kpn added a child revision: D70256: [FPEnv] clang support for constrained FP builtins.Nov 18 2019, 12:26 PM

In D62731#1750412, @kpn wrote:

Does anyone think a warning is appropriate because the new flags are exercising experimental, incomplete code in both clang and llvm? The warning would be removed when we believe the feature is complete and ready to use.

@kpn Can you say more about "incomplete code in ... clang". I don't know what's missing from clang.

kpn mentioned this in D70256: [FPEnv] clang support for constrained FP builtins.Nov 18 2019, 12:27 PM

In D62731#1750427, @mibintc wrote:

In D62731#1750412, @kpn wrote:

Does anyone think a warning is appropriate because the new flags are exercising experimental, incomplete code in both clang and llvm? The warning would be removed when we believe the feature is complete and ready to use.

@kpn Can you say more about "incomplete code in ... clang". I don't know what's missing from clang.

See D70256. Calls to clang's math builtins that are llvm intrinsics need to be changed to be calls to the constrained intrinsics. I've been waiting to submit the patches because they weren't testable without these command line options.

There may be other issues as well. I'm not sure.

michele.scandale added a subscriber: michele.scandale.Nov 18 2019, 9:37 PM

michele.scandale added inline comments.

clang/lib/Driver/ToolChains/Clang.cpp
2448	Isn't the default `-fno-rounding-math`?
2456	Shouldn't this be set to `true` similarly to what you do for `TrappingMathPresent` to track whether there is an explicit option related to rounding math?
2605	Running `clang -### -ftrapping-math -ffp-exception-behavior=ignore` lead to this assertion to fail. As far as I can see `TrappingMath` is not changed in the case FPExceptionBehavior is "ignore" or "maytrap". Clearly in the "ignore" case it should be safe to just set `TrappingMath` to false, but I'm not sure about the "maytrap" case. It seems that `-ffp-exception-behavior` is more general than `-f{,no-}trapping-math`, so it seems natural to me to see `ftrapping-math` and `foo-trapping-math` as aliases for `ffp-exception-behavior=strict` and `ffp-exception-behavior=ignore` respectively. If we agree on this, then I would expect the reasoning inside the compiler only in terms of `FPExceptionBehavior`.
2607	With this change if I run `clang -### -ffast-math test.c` I don't see `-fno-trapping-math` passed to the CC1. This is changing the changes the value of the function level attribute "no-trapping-math" (see lib/CodeGen/CGCall.cpp : 1747). Is this an intended change? Moreover since with this patch the default value for trapping math changed, the "no-trapping-math" function level attribute is incorrect also for default case.

kpn added inline comments.Nov 19 2019, 6:06 AM

llvm/include/llvm/IR/IRBuilder.h
262 ↗	(On Diff #229893)	I suspect you are correct. If we let this "F && " change go in we'll have a situation where whether or not a block is currently in a function when a function call is emitted will affect whether or not the eventual function definition gets the strictfp attribute. That seems like an unfortunate inconsistency. I'm still looking into it. I hope to have an IRBuilder review up today or tomorrow.

kpn mentioned this in D70451: [FPEnv] IRBuilder should not put strictfp on function definitions automatically.Nov 19 2019, 10:02 AM

inline replies to Michele, will upload a new patch shortly

clang/lib/Driver/ToolChains/Clang.cpp
2448	Yes the default is no rounding math, I'll remove the comment. Thank you.
2456	There's a switch statement above this that interprets the command line option -fp-model=strict as though frounding had appeared on the command line by assigning a new value to optID so that's why there is a discrepancy. Also I'm using the *Present boolean variables to preserve the output from Driver so that pre-existing driver test cases don't need to be changed.
2605	Thanks for pointing out this assertion failure, I will upload a patch with fix. Yes we could entirely express ftrapping-math and fno-trapping-math via the ffp-exception-behavior= option. That would probably be better--currently the trapping option becomes effective via the exception behavior parameter to the llvm floating point constrained intrinsics, and it can take 3 values. I thought it would be too radical at the moment, so I didn't propose that in this patch. In the patch I'm about to add, I added a test case for the assertion that you saw.
2607	Before this patch, ftrapping-math was added to the Driver and also a bitfield, `NoTrappingFPMath `was created in the LLVM to describe the state of trapping-math, but otherwise that bit wasn't consulted and the option had no effect. Gcc describes ftrapping-math as the default, but in llvm by default floating point exceptions are masked and this corresponds to the floating point Constrained Intrinsics having exception behavior set to ignored. This patch changed the llvm constructor to set the trapping bit to "no trap". In fact I'd like to get rid of the` `NoTrappingFPMath`` bitfield since it's not being used, but I didn't make that change at this point. If I remember correctly, there are a bunch of driver tests that failed if fno-trapping-math is output to cc1. I'd have to reconstruct the details. Since fno-trapping-math is the default, it isn't passed through on the cc1 command line: the Clang.cpp driver doesn't pass through the positive and negative for each existing option. Thanks for pointing out the line in CGCall.cpp, it seems the CodeGenOpts aren't getting set up perfectly I'll fix that in CompilerInvocation.cpp; I don't see anything setting trapping-math as part of function level attribute, @michele.scandale did I overlook that/can you point out where that is?

andrew.w.kaylor added inline comments.Nov 19 2019, 12:20 PM

llvm/include/llvm/IR/IRBuilder.h
262 ↗	(On Diff #229893)	As I just commented on the related patch @kpn posted, it appears that IRBuilder doesn't entirely support inserting into an unparented block. I was surprised by this, but there are places that need to be able to get to the Module from the BasicBlock. So, I think something problematic may be happening in the failing case.

Here's an update in response to comments from @michele.scandale
I fixed the assertion error and added a test case
I fixed the setting of ftrapping-math in CodeGenOpts
I deleted an incorrect comment
I added a diagnostic when -fp-exception-behavior= is overridden on the command line by f[no-]trapping-math
I updated one of the test cases to work with some small modifications that will be made to IRBuilder.h

michele.scandale added inline comments.Nov 19 2019, 11:26 PM

clang/lib/Driver/ToolChains/Clang.cpp
2607	I guess you are referring to the code in `TargetMachine.cpp` where the function level attributes are used to reset the `TargetOptions` state whenever we initiate the backend codegen for a given function. Considering that the trapping math option as stated in the documentation did not have any effect, I'm not surprised to see not many uses. The only one I can see is in `llvm/lib/Target/ARM/ARMAsmPrinter.cpp : 687` where the function level attribute affects the emission of some ARM specific attributes. My only concern was that the change of the default value for trapping math was not propagated entirely causing this function level attribute to be initialized incorrectly. Fixing the logic in `CompilerInvocation.cpp` considering the change of default it is fine for me. Given that `ffp-exception-behavior={ignore,maytrap,strict}` supersedes `-f{,no-}trapping-math` I would expect long term to see the internal state of the compiler frontend to only care about the new state `FPExceptionBehavior` for both language and code generation options. And I guess the same would apply to the backend stage as well.

michele.scandale added inline comments.Nov 19 2019, 11:47 PM

clang/lib/Driver/ToolChains/Clang.cpp
2381	Here it seems you are changing `optID` to `OPT_ffast_math` to reuse the logic specified below for that case to reset the state of the floating point options.
2389	Here the state of the floating point options seems unchanged except for `FPContract`. If I run `clang -ffp-model=fast -ffp-model=precise`, I would expect the state of the floating point options to match the one of `-fno-fast-math` except for `FPContract` which you want to be set to "fast". I think you might need to replicate the reset for all the option here as well, so at this point I don't know how much worth is to use the optID reset trick for the "fast" case only.

This commit was reverted in 30e7ee3c4bac because a null deref error occurred in IRBuilder.h when setting strictfp attribute, see https://bugs.llvm.org/show_bug.cgi?id=44048 for information about that bug.
This patch moves setting strictfp from IRBuilder into clang/CodeGen. Also addresses some code review comments that were received after the revert. I'll add some inline comments next.

mibintc added a reviewer: erichkeane.Dec 2 2019, 12:21 PM

I added inline comments describing what I did in this version of the patch to address the bug https://bugs.llvm.org/show_bug.cgi?id=44048

clang/include/clang/AST/DeclBase.h
1539 ↗	(On Diff #231758)	This corresponds to "strictfp" LLVM attribute. I add this here because I want to collect the information during Sema and set the attribute during CodeGen. The next thing I want to do is to add support for modifying float_control via pragma within function bodies (enable floating point control at the block level). If I wasn't preparing to support floating_control via statement-level pragma then setting the bit could be accomplished entirely within CodeGen.
1560 ↗	(On Diff #231758)	Need to adjust the number of bits here, because it's at the threshold of overrunning 64 bits.
clang/include/clang/Basic/DiagnosticDriverKinds.td
444 ↗	(On Diff #231758)	@kpn thought it would be a good idea to add a Warning that the implementation of float control is experimental and partially implemented. That's what this is for.
clang/lib/Driver/ToolChains/Clang.cpp
2389	@michele.scandale Thanks for your helpful review, I think I fixed the things that you remarked on. I also added a test case for the assertion fail that you saw.
clang/test/CodeGen/fpconstrained.cpp
10 ↗	(On Diff #231758)	This is the test case from the bug report (null deref/segfault/in IRBuilder)
llvm/include/llvm/IR/IRBuilder.h
268 ↗	(On Diff #231758)	@kpn I got rid of this line because the function attribute is being set in CodeGen
llvm/unittests/IR/IRBuilderTest.cpp
186 ↗	(On Diff #231758)	@kpn I changed the test to create the function attribute a priori since it will be set in CodeGen before passing to IRBuilder

kpn added inline comments.Dec 2 2019, 12:52 PM

llvm/include/llvm/IR/IRBuilder.h
268 ↗	(On Diff #231758)	Makes sense.
llvm/unittests/IR/IRBuilderTest.cpp
186 ↗	(On Diff #231758)	Right, of course. I'm not going to quibble over the use of auto. It's fine I think.

michele.scandale added inline comments.Dec 2 2019, 7:16 PM

clang/lib/Driver/ToolChains/Clang.cpp
2389	Thanks!

I've pushed the updated patch,
commit cdbed2dd856c14687efd741c2d8321686102acb8

The tests fail on Windows, http://45.33.8.238/win/3405/step_6.txt

Please take a look, and if it takes a while to fix please revert while you investigte.

I reverted it again because build break on windows

In D62731#1769532, @thakis wrote:

The tests fail on Windows, http://45.33.8.238/win/3405/step_6.txt

Please take a look, and if it takes a while to fix please revert while you investigte.

It appears that only the 1st failure there is the fault of this patch. The 2nd seems to have come from some openmp patch (that didn't consider dso_local on windows).

The first (fpconstrained.cpp) likely just needs the check-lines to NOT explicitly say the %4/%5 and capture the loads of those with a wildcard instead.

It appears that only the 1st failure there is the fault of this patch. The 2nd seems to have come from some openmp patch (that didn't consider dso_local on windows).

The first (fpconstrained.cpp) likely just needs the check-lines to NOT explicitly say the %4/%5 and capture the loads of those with a wildcard instead.

Yes, sorry, I pasted the wrong link. http://45.33.8.238/win/3402/step_6.txt is the one for this commit, and it has just one of the two failures.

I fixed the lit test problem and pushed it again

commit 7f9b5138470db1dc58f3bc05631284c653c9ed7a
Author: Melanie Blower <melanie.blower@intel.com>

I've noticed you removed the change for CompilerInvocation.cpp about the initialization of the codegen option NoTrappingMath. Was that an accident?

In D62731#1773854, @michele.scandale wrote:

I've noticed you removed the change for CompilerInvocation.cpp about the initialization of the codegen option NoTrappingMath. Was that an accident?

I checked the old and new version of the patch and it seems like initialization of NoTrappingMath is unchanged, the definition of the option has it default to 0, and CompilerInvocation.cpp sets it like this in ParseLangArgs, was it something else you were looking at?
Opts.NoTrappingMath = Args.hasArg(OPT_fno_trapping_math);

In D62731#1775046, @mibintc wrote:

In D62731#1773854, @michele.scandale wrote:

I've noticed you removed the change for CompilerInvocation.cpp about the initialization of the codegen option NoTrappingMath. Was that an accident?

I checked the old and new version of the patch and it seems like initialization of NoTrappingMath is unchanged, the definition of the option has it default to 0, and CompilerInvocation.cpp sets it like this in ParseLangArgs, was it something else you were looking at?
Opts.NoTrappingMath = Args.hasArg(OPT_fno_trapping_math);

In the driver code you don't always pass -fno-trapping-math, therefore when when the compiler setup the CodeGen options in ParseCodeGenArgs you will end up executing Opts.NoTrappingMath = Args.hasArg(OPT_fno_trapping_math); hence you will have that Opt.NoTrappingMath = false. This is inconsistent with the state of the compiler driver where no-trapping-math is enabled default.

If you want to keep the default of the CC1 different than the default of the compiler driver, that's fine to me, but in that case the compiler driver needs to pass -fno-trapping-math to the CC1.
If we want the same new default, then the logic in ParseCodeGenArgs must be updated.

In D62731#1778597, @michele.scandale wrote:

I've noticed you removed the change for CompilerInvocation.cpp about the initialization of the codegen option NoTrappingMath. Was that an accident?

Thanks again Michele. I'd like to get rid of Opts.NoTrappingMath, but I haven't been bold enough yet. NoTrappingMath is not expressive enough because it can hold only 2 values, whereas the Exception behavior can be ignore, strict or maytrap. So I'd get rid of that Opts field, and the only place where I see it actually being used is in llvm/lib/Target/ARM/ARMAsmPrinter.cpp and the change in this patch doesn't seem to affect the ARM logic so I think if I got rid of it, it would be OK. All the other instances of the string are in llvm test cases.

In D62731#1779872, @mibintc wrote:

In D62731#1778597, @michele.scandale wrote:

I've noticed you removed the change for CompilerInvocation.cpp about the initialization of the codegen option NoTrappingMath. Was that an accident?

Thanks again Michele. I'd like to get rid of Opts.NoTrappingMath, but I haven't been bold enough yet.

I don't see a problem with this, but it would be nice to make the -f[no-]trapping-math command line option work. GNU compatibility is good.

rupprecht added a subscriber: rupprecht.Dec 11 2019, 4:53 PM

rupprecht added inline comments.

clang/lib/Sema/SemaExpr.cpp
13047 ↗	(On Diff #231758)	Looks like this is leftover debugging? I'm seeing log spam compiling some files -- this message repeated hundreds of times. I'll go ahead and create a patch that nukes this.

rupprecht marked an inline comment as done.Dec 11 2019, 4:55 PM

rupprecht added inline comments.

clang/lib/Sema/SemaExpr.cpp
13047 ↗	(On Diff #231758)	Sorry for the noise, looks like f4a7d5659df7cb56c1baa34a39e9fe2639472741 already did this.

In D62731#1780597, @cameron.mcinally wrote:

I don't see a problem with this, but it would be nice to make the -f[no-]trapping-math command line option work. GNU compatibility is good.

Thanks Cameron, I'll go that way

rupprecht added inline comments.Dec 12 2019, 12:51 PM

clang/include/clang/Basic/DiagnosticDriverKinds.td
444 ↗	(On Diff #231758)	Instead of adding a warning, I'd like to propose `-frounding-math` not be enabled unless an additional flag (e.g. `-fexperimental-float-control`) is passed. Or maybe this feature should be called `-f[no-]experimental-rounding-math` instead. There are plenty of builds that are already specifying `-frounding-math` (e.g. they also support building w/ a compiler such as gcc that implements this), and adding this experimental/incomplete implementation is a surprise to those builds. If I'm wrong and it's completely safe to ignore the warning (maybe it's incomplete but not in any unsafe way), we should just not have it at all.

andrew.w.kaylor added inline comments.Dec 12 2019, 1:31 PM

clang/include/clang/Basic/DiagnosticDriverKinds.td
444 ↗	(On Diff #231758)	You raise an interesting point about people who might be using -frounding-math already. However, if they are using this flag because they also sometimes build with a compiler such as gcc that supports the flag, they are currently getting incorrect behavior from clang. Without this patch, clang silently ignores the option and the optimizer silently ignores the fact that the program may be changing the rounding mode dynamically. The user may or may not be aware of this. With this patch such a user is likely to observe two effects: (1) their code will suddenly get slower, and (2) it will probably start behaving correctly with regard to rounding mode changes. The rounding behavior will definitely not get worse. I think the warning is useful as an indication that something has changed. I don't think requiring an additional option should be necessary.

rupprecht added inline comments.Dec 12 2019, 4:19 PM

clang/include/clang/Basic/DiagnosticDriverKinds.td
444 ↗	(On Diff #231758)	However, if they are using this flag because they also sometimes build with a compiler such as gcc that supports the flag, they are currently getting incorrect behavior from clang Incorrect, yes, but also likely valid behavior. "experimental" seems to imply a miscompile when using this option should not be unexpected. As I suggested before: if I'm wrong, and this behavior is only going to make the code more correct (as you suggest), can we remove the warning that this must be ack'd explicitly by adding `-Wno-experimental-float-control` to builds? I don't understand the motivation for the warning.

Currently we emit a warning if you use -frounding-math, saying that the option is ignored. That should have alerted users that they're not getting the correct behavior now. This patch removes the warning and (IIUC) implements the correct behavior but is over-conservative. If that's correct, I think that's acceptable and we don't need an "experimental" flag or a warning.

In D62731#1782762, @rjmccall wrote:

Currently we emit a warning if you use -frounding-math, saying that the option is ignored. That should have alerted users that they're not getting the correct behavior now. This patch removes the warning and (IIUC) implements the correct behavior but is over-conservative. If that's correct, I think that's acceptable and we don't need an "experimental" flag or a warning.

Oh, I didn't realize we were already warning about that. In theory, we should handle rounding math correctly with this change. It's possible we've missed some things, but I suppose that's always true. I think there are a few general intrinsics left that need constrained versions but don't have them, and we don't have any solution yet for target-specific intrinsics. If any of those have special handling that assumes the default rounding mode we will get it wrong. I don't think most users would be likely to encounter a problem.

In D62731#1782897, @andrew.w.kaylor wrote:

In D62731#1782762, @rjmccall wrote:

Currently we emit a warning if you use -frounding-math, saying that the option is ignored. That should have alerted users that they're not getting the correct behavior now. This patch removes the warning and (IIUC) implements the correct behavior but is over-conservative. If that's correct, I think that's acceptable and we don't need an "experimental" flag or a warning.

Oh, I didn't realize we were already warning about that. In theory, we should handle rounding math correctly with this change. It's possible we've missed some things, but I suppose that's always true. I think there are a few general intrinsics left that need constrained versions but don't have them, and we don't have any solution yet for target-specific intrinsics. If any of those have special handling that assumes the default rounding mode we will get it wrong. I don't think most users would be likely to encounter a problem.

Hmm. The target-specific intrinsics thing does concern me, since I assume many targets have a bunch of vector intrinsics that may be sensitive to rounding. Do we have an idea of how we'd fix that? If it's a short-term incorrectness that we have a plan to fix, I don't mind the risk, but if we don't know how we'd even start to address it...

In D62731#1782912, @rjmccall wrote:

Hmm. The target-specific intrinsics thing does concern me, since I assume many targets have a bunch of vector intrinsics that may be sensitive to rounding. Do we have an idea of how we'd fix that? If it's a short-term incorrectness that we have a plan to fix, I don't mind the risk, but if we don't know how we'd even start to address it...

I see two potential problem cases with the target-specific intrinsics:

Some optimization pass recognizes the intrinisic and uses its semantics to perform some optimization such as constant folding
Some optimization performs code motion that moves the intrinsic (or, in the backend, the instruction it represents) across an operation that changes the rounding mode

I don't know if there are any instances of the first case in the public repository. Downstream users could be doing it. Those will need special handling if they exist (checking for the the strictfp attribute).

The second case should be handled in IR by fesetround() or other such intrinsics being marked as having side effects. It's possible that there are target-specific intrinsics to change the rounding mode that aren't marked as having side effects, but if so that's simply a bug. The other part of this is that the intrinsic might be lowered to MC and the MC instructions in a way that neglects rounding mode. Many targets have instructions with forms that take an explicit rounding mode argument and the backends may be using that with the default rounding mode. I am not aware of any such case, but it's definitely possible.

Finally, our design for handling strict fp mode in the backends is that rounding mode control will be handled by explicitly modeling the dependency between the relevant control registers and instructions that implicitly use the rounding mode controled by those registers. X86 only recently started doing this. There may be other backends that have not implemented it. Some may never do so.

I don't have a strong preference about what to do with the warning. I have a slight preference for replacing the existing warning with a more specific warning saying that floating math support is a work in progress. Eventually we need a way for backends to indicate that they believe their support is complete.

I believe your analysis on the second point is unfortunately missing half the problem. Functions like fesetround will be treated as having arbitrary side-effects by default, which mean arbitrary code can't be reordered with calls to them. It'd be good to model that more precisely — to flag that the *only* side-effect they have is changing the FP state — but that's not really the big problem; that's just enabling better optimization by e.g. allowing them to be reordered with non-FP code. The big problem is that intrinsic calls are not arbitrary code: the vast majority of intrinsics (e.g. all the ARM vector intrinsics, many of which can be floating-point) are marked IntrNoMem (which I believe corresponds to readnone), which means calls to those intrinsics can be reordered with other code whether or not that code has arbitrary side-effects. It's good that people are looking at achieving better modeling for the x86 backend, but we need to have a plan that doesn't require heroic effort just to get basic correctness.

I would suggest that we need a function/call attribute roughly on the level of readonly / readnone, maybe readfponly, that says that a function has no side-effects and no dependencies on anything *except* the FP state. Basic queries like Instruction::mayReadMemory() that are supposed to be used generically in code-motion transforms would then return true for calls marked that way only if they're FP-constrained functions. So outside of an FP-constrained function we'd preserve optimizability, and inside one we'd be appropriately conservative. The generic backend could similarly just default to treating those intrinsic calls as having side-effects in FP-constrained functions, and there'd just be some way for a backend to opt out of that when it provides precise modeling of FP state. It'd then be a fairly straightforward change to go through the target intrinsic tables and mark which ones have dependencies on FP state.

In D62731#1784225, @rjmccall wrote:

... The big problem is that intrinsic calls are not arbitrary code: the vast majority of intrinsics (e.g. all the ARM vector intrinsics, many of which can be floating-point) are marked IntrNoMem (which I believe corresponds to readnone), which means calls to those intrinsics can be reordered with other code whether or not that code has arbitrary side-effects.

Oh, you're right. With the constrained intrinsics we are currently handling that by using IntrInaccessibleMemOnly as a proxy for access to the FP environment, but that's stronger than we'd want for architecture-specific intrinsics in the default case. We have talked about an fpenv-specific attribute, but nothing has been done. So, I guess that does leave us in the situation where rounding controls might not be correctly respected if target-specific intrinsics are used.

It's good that people are looking at achieving better modeling for the x86 backend, but we need to have a plan that doesn't require heroic effort just to get basic correctness.

Do you mean in the backend? If so, I don't think that's possible. The backends just don't have any sort of feature that could be used to get conservatively correct behavior for cheap the way intrinsics give it to us in the middle end. Once you go into instruction selection things get very low level in a hurry.

andrew.w.kaylor added inline comments.Dec 13 2019, 4:12 PM

clang/include/clang/Basic/DiagnosticDriverKinds.td
444 ↗	(On Diff #231758)	The "experimental" code won't be incorrect in any way that the code generated when we ignore the option is. The things that have been implemented will work correctly. The things that are not implemented will have the potential to disregard runtime changes to the rounding mode. Currently, dynamic changes to the rounding mode always have the potential of being ignored.

It's good that people are looking at achieving better modeling for the x86 backend, but we need to have a plan that doesn't require heroic effort just to get basic correctness.

Do you mean in the backend? If so, I don't think that's possible. The backends just don't have any sort of feature that could be used to get conservatively correct behavior for cheap the way intrinsics give it to us in the middle end. Once you go into instruction selection things get very low level in a hurry.

I'm looking for simple ways to modeling X86 intrinsics, but haven't find better one than modeling it one by one.

I would suggest that we need a function/call attribute roughly on the level of readonly / readnone, maybe readfponly, that says that a function has no side-effects and no dependencies on anything *except* the FP state.

Do you mean mark it at the declaration of intrinsics? Is it reasonable to mark except on dependent intrinsics?

Basic queries like Instruction::mayReadMemory() that are supposed to be used generically in code-motion transforms would then return true for calls marked that way only if they're FP-constrained functions.

Middle end or back end? I think in middle end you may need to change all releated passes to get such information to prevent optimization. And in back end, I think we can simply chain intrinsics marked except with other FP nodes like what common code doing.

In D62731#1784491, @pengfei wrote:

It's good that people are looking at achieving better modeling for the x86 backend, but we need to have a plan that doesn't require heroic effort just to get basic correctness.

Do you mean in the backend? If so, I don't think that's possible. The backends just don't have any sort of feature that could be used to get conservatively correct behavior for cheap the way intrinsics give it to us in the middle end. Once you go into instruction selection things get very low level in a hurry.

I'm looking for simple ways to modeling X86 intrinsics, but haven't find better one than modeling it one by one.

I would suggest that we need a function/call attribute roughly on the level of readonly / readnone, maybe readfponly, that says that a function has no side-effects and no dependencies on anything *except* the FP state.

Do you mean mark it at the declaration of intrinsics? Is it reasonable to mark except on dependent intrinsics?

Basic queries like Instruction::mayReadMemory() that are supposed to be used generically in code-motion transforms would then return true for calls marked that way only if they're FP-constrained functions.

Middle end or back end? I think in middle end you may need to change all releated passes to get such information to prevent optimization. And in back end, I think we can simply chain intrinsics marked except with other FP nodes like what common code doing.

We don't want to do this all the time. So we need a new property for the intrinsics that means we should prevent code motion in the middle end when the calling function has the strictfp attribute. Similarly SelectionDAGBuilder should use INTRINSIC_W_CHAIN instead of INTRINSIC_WO_CHAIN for any of these intrinsics when strictp is enabled.

rupprecht mentioned this in D71635: [clang] Rename -frounding-math to -fexperimental-rounding-math and add -frounding-math back as a gcc-compat arg..Dec 17 2019, 3:54 PM

It seems the discussion of whether or not this is incomplete died out -- I'd prefer to assume it is incomplete if there is no consensus. Mailed D71635 to rename -frounding-math to -fexperimental-rounding-math.

Alternatively we could remove the warning. I still don't see a good argument for the middle ground of having it called -frounding-math but also generate a warning.

In D62731#1788838, @rupprecht wrote:

It seems the discussion of whether or not this is incomplete died out -- I'd prefer to assume it is incomplete if there is no consensus. Mailed D71635 to rename -frounding-math to -fexperimental-rounding-math.

Alternatively we could remove the warning. I still don't see a good argument for the middle ground of having it called -frounding-math but also generate a warning.

It's definitely incomplete but the results will not be any worse than you get when -frounding-math is ignored.

My preference would be to change the text of the warning that is issued but allow -frounding-math to be enabled by this commit without requiring an additional option.

I would also very much like to see this patch re-committed. It's currently in the "approved" state. If anyone objects to this being committed, please use the "request changes" action to indicate this.

In D62731#1788902, @andrew.w.kaylor wrote:

In D62731#1788838, @rupprecht wrote:

It seems the discussion of whether or not this is incomplete died out -- I'd prefer to assume it is incomplete if there is no consensus. Mailed D71635 to rename -frounding-math to -fexperimental-rounding-math.

Alternatively we could remove the warning. I still don't see a good argument for the middle ground of having it called -frounding-math but also generate a warning.

It's definitely incomplete but the results will not be any worse than you get when -frounding-math is ignored.

My preference would be to change the text of the warning that is issued but allow -frounding-math to be enabled by this commit without requiring an additional option.

If other reviewers agree, then let's just remove the warning. I can send a patch tomorrow unless someone else wants to do that.

I would also very much like to see this patch re-committed. It's currently in the "approved" state. If anyone objects to this being committed, please use the "request changes" action to indicate this.

It is already re-committed. 7f9b5138470db1dc58f3bc05631284c653c9ed7a reapplied it, but IIUC it was not closed in phabricator due to leading whitespace in the commit message:

Reapply af57dbf12e54 "Add support for options -frounding-math, ftrapping-math, -ffp-model=, and -ffp-exception-behavior="
...
        Differential Revision: https://reviews.llvm.org/D62731

The "Differential" needs to be the first thing, whitespace cannot come before it.

rupprecht mentioned this in D71671: [clang] Remove -Wexperimental-float-control..Dec 18 2019, 12:13 PM

In D62731#1789122, @rupprecht wrote:

In D62731#1788902, @andrew.w.kaylor wrote:

In D62731#1788838, @rupprecht wrote:

It seems the discussion of whether or not this is incomplete died out -- I'd prefer to assume it is incomplete if there is no consensus. Mailed D71635 to rename -frounding-math to -fexperimental-rounding-math.

Alternatively we could remove the warning. I still don't see a good argument for the middle ground of having it called -frounding-math but also generate a warning.

It's definitely incomplete but the results will not be any worse than you get when -frounding-math is ignored.

My preference would be to change the text of the warning that is issued but allow -frounding-math to be enabled by this commit without requiring an additional option.

If other reviewers agree, then let's just remove the warning. I can send a patch tomorrow unless someone else wants to do that.

Mailed D71671 to do this.

If neither patch is acceptable, then I would like to revert this commit instead, as we are having issues with this patch.

In D62731#1790211, @rupprecht wrote:

In D62731#1789122, @rupprecht wrote:

In D62731#1788902, @andrew.w.kaylor wrote:

In D62731#1788838, @rupprecht wrote:

It seems the discussion of whether or not this is incomplete died out -- I'd prefer to assume it is incomplete if there is no consensus. Mailed D71635 to rename -frounding-math to -fexperimental-rounding-math.

Alternatively we could remove the warning. I still don't see a good argument for the middle ground of having it called -frounding-math but also generate a warning.

It's definitely incomplete but the results will not be any worse than you get when -frounding-math is ignored.

My preference would be to change the text of the warning that is issued but allow -frounding-math to be enabled by this commit without requiring an additional option.

If other reviewers agree, then let's just remove the warning. I can send a patch tomorrow unless someone else wants to do that.

Mailed D71671 to do this.

If neither patch is acceptable, then I would like to revert this commit instead, as we are having issues with this patch.

I think we should stick with this patch and remove the warning like you proposed in D71671

rupprecht mentioned this in rG553a727f5f64: [clang] Remove -Wexperimental-float-control..Dec 18 2019, 5:03 PM

I have found bug in clang-cl (win32 clang), related to recent inroduction of ffp-exception-behavior.
Unfortunately, I don't have a working patch yet, and since LLVM bugtracker registration is closed, I can not even submit a bug.

So, if it is not a trouble for you, I will email the bug description here.

Please let me know if it isn't appropriate. Bug description:

Windows: clang-cl is generating call to non-existing lib function for win32 with /fp:except option.
With recent ffp-exception-behavior=maytrap/strict, fp:except in clang-cl became generate FPE aware code.

But in case of floorf and ceilf it generates call to non-existing library function.

clang-cl.exe -m32 /Ox /fp:except testFloor.cpp /FA
testFloor.cpp:

#include <math.h>
float ret(float v) {  return floorf(v); }

resulting assember:

push    eax
movss    xmm0, dword ptr [esp + 8]
movss    dword ptr [esp], xmm0
call    _floorf #no such function!!!
pop    eax
ret

Expected behaviour:

there is no floorf lib function. Like with cosf and other math functions, floorf in MSVC is implemented as inline function.

So, it should be call to _floor (with apropriate conversion first).

In D62731#1796962, @AntonYudintsev wrote:
I have found bug in clang-cl (win32 clang), related to recent inroduction of ffp-exception-behavior.
Unfortunately, I don't have a working patch yet, and since LLVM bugtracker registration is closed, I can not even submit a bug.

So, if it is not a trouble for you, I will email the bug description here.

Please let me know if it isn't appropriate. Bug description:

Windows: clang-cl is generating call to non-existing lib function for win32 with /fp:except option.
With recent ffp-exception-behavior=maytrap/strict, fp:except in clang-cl became generate FPE aware code.

But in case of floorf and ceilf it generates call to non-existing library function.

clang-cl.exe -m32 /Ox /fp:except testFloor.cpp /FA
testFloor.cpp:
#include <math.h>
float ret(float v) {  return floorf(v); }
resulting assember:
push    eax
movss    xmm0, dword ptr [esp + 8]
movss    dword ptr [esp], xmm0
call    _floorf #no such function!!!
pop    eax
ret
Expected behaviour:

there is no floorf lib function. Like with cosf and other math functions, floorf in MSVC is implemented as inline function.

So, it should be call to _floor (with apropriate conversion first).

Hopefully fixed by 53ee806d93e8d2371726ec5ce59b0e68b309c258

Found some issue when looking at this code: -ftrapping_math and -fno_trapping_math will never have effect

clang/lib/Frontend/CompilerInvocation.cpp
3100	Calling 'Opts.setFPExceptionMode(xx)' here has no effect, as it will be overruled later on, on line 3174 ! Same is true on line 3159

Herald added a subscriber: dang. · View Herald TranscriptJul 24 2020, 5:03 AM

SjoerdMeijer mentioned this in D93395: [clang][cli] Remove -f[no-]trapping-math from -cc1 command line.Dec 21 2020, 3:02 AM

Revision Contents

Path

Size

clang/

docs/

UsersManual.rst

55 lines

include/

clang/

Basic/

CodeGenOptions.def

1 line

LangOptions.h

57 lines

Driver/

Options.td

7 lines

lib/

CodeGen/

BackendUtil.cpp

1 line

CodeGenFunction.h

3 lines

CodeGenFunction.cpp

58 lines

Driver/

ToolChains/

Clang.cpp

187 lines

Frontend/

CompilerInvocation.cpp

44 lines

test/

CodeGen/

fpconstrained.c

23 lines

Driver/

clang_f_opts.c

2 lines

fast-math.c

4 lines

llvm/

include/

llvm/

Target/

TargetOptions.h

7 lines

Diff 223908

clang/docs/UsersManual.rst

Show First 20 Lines • Show All 1,213 Lines • ▼ Show 20 Lines	-f[no-]math-errno
On some targets, math library functions never set ``errno``, and so		On some targets, math library functions never set ``errno``, and so
``-fno-math-errno`` is the default. This includes most BSD-derived		``-fno-math-errno`` is the default. This includes most BSD-derived
systems, including Darwin.		systems, including Darwin.

.. _opt_ftrapping-math:		.. _opt_ftrapping-math:

-f[no-]trapping-math		-f[no-]trapping-math

``-fno-trapping-math`` allows optimizations that assume that		Control floating point exception behavior. ``-fno-trapping-math`` allows optimizations that assume that floating point operations cannot generate traps such as divide-by-zero, overflow and underflow.
floating point operations cannot generate traps such as divide-by-zero,
overflow and underflow. Defaults to ``-ftrapping-math``.		- The option ``-ftrapping-math`` behaves identically to ``-ffp-exception-behavior=strict``.
Currently this option has no effect.		- The option ``-fno-trapping-math`` behaves identically to ``-ffp-exception-behavior=ignore``. This is the default.

.. option:: -ffp-contract=<value>		.. option:: -ffp-contract=<value>

Specify when the compiler is permitted to form fused floating-point		Specify when the compiler is permitted to form fused floating-point
operations, such as fused multiply-add (FMA). Fused operations are		operations, such as fused multiply-add (FMA). Fused operations are
permitted to produce more precise results than performing the same		permitted to produce more precise results than performing the same
operations separately.		operations separately.

▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	-f[no-]finite-math
not NaNs or +-Inf. This defines the ``__FINITE_MATH_ONLY__`` preprocessor macro.		not NaNs or +-Inf. This defines the ``__FINITE_MATH_ONLY__`` preprocessor macro.
Also implies:		Also implies:

* ``-fno-honor-infinities``		* ``-fno-honor-infinities``
* ``-fno-honor-nans``		* ``-fno-honor-nans``

Defaults to ``-fno-finite-math``.		Defaults to ``-fno-finite-math``.

		.. _opt_frounding-math:

		-f[no-]rounding-math

		LLVM constrained floating point supports five rounding modes: ``tonearest``,
		``downward``, ``upward``, ``towardzero`` and ``dynamic``. The first four
		values represent the corresponding IEEE rounding rules, and the ``dynamic``
		mode informs the compiler that it must not assume any particular
		rounding mode.
		rjmccallUnsubmitted Done Reply Inline Actions "represent the corresponding IEEE rounding rules" rjmccall: "represent the corresponding IEEE rounding rules"
		rjmccallUnsubmitted Not Done Reply Inline Actions A few points about this documentation that occurred to me since the last time I looked at it: It's weird to talk about LLVM here, since this is the Clang documentation. Clang's behavior is not specified in terms of the IR it generates; it's specified in terms of the formal behavior of the source code. Therefore this documentation should talk about things using concepts from an appropriate language standard whenever possible; in this case, C99 works. It's weird to bring up all these different rounding modes when this option doesn't actually let you do anything with them. If you want to talk about rounding modes in general that's fine as a way of informing the programmer, but we shouldn't give them information they can't use. I don't think `-fno-rounding-math` is actually equivalent to forcing the use of the `tonearest` rounding mode; I think it assumes that the rounding mode is set to `tonearest`. (Or am I wrong and this is actually guaranteed by ABI?) I don't think we want to define `-frounding-math` as exactly equivalent to `-ffp-model=strict`. That might be a convenient implementation for now, but it seems to me that `-frounding-math` still allows some optimizations that `-ffp-model=strict` wouldn't. With that in mind, I'd suggest something like this: Force floating-point operations to honor the dynamically-set rounding mode by default. The result of a floating-point operation often cannot be exactly represented in the result type and therefore must be rounded. IEEE 754 describes different rounding modes that control how to perform this rounding, not all of which are supported by all implementations. C provides interfaces (`fesetround `and` `fesetenv``) for dynamically controlling the rounding mode, and while it also recommends certain conventions for changing the rounding mode, these conventions are not typically enforced in the ABI. Since the rounding mode changes the numerical result of operations, the compiler must understand something about it in order to optimize floating point operations. Note that floating-point operations performed as part of constant initialization are formally performed prior to the start of the program and are therefore not subject to the current rounding mode. This includes the initialization of global variables and local `static `variables. Floating-point operations in these contexts will be rounded using` `FE_TONEAREST``. The option `-fno-rounding-math `allows the compiler to assume that the rounding mode is set to` `FE_TONEAREST``. This is the default. The option `-frounding-math` forces the compiler to honor the dynamically-set rounding mode. This prevents optimizations which might affect results if the rounding mode changes or is different from the default; for example, it prevents floating-point operations from being reordered across most calls and prevents constant-folding when the result is not exactly representable. rjmccall: A few points about this documentation that occurred to me since the last time I looked at it…
		mibintcAuthorUnsubmitted Not Done Reply Inline Actions Thank you, I will work on another patch mibintc: Thank you, I will work on another patch

		- The option ``-fno-rounding-math`` specifies ``tonearest`` rounding mode. This is the default.
		- The option ``-frounding-math`` specifies ``dynamic`` rounding mode.
		- The option ``-frounding-math`` behaves identically to ``-ffp-model=strict``. Consequently, this option also sets ``-ffp-exception-behavior=strict``.

		.. option:: -ffp-model=<value>

		Specify floating point behavior. ``-ffp-model`` is an umbrella
		option that encompasses functionality provided by other, single
		purpose, clang floating point options. Valid values are: ``precise``, ``strict``,
		and ``fast``.
		Details:
		rjmccallUnsubmitted Done Reply Inline Actions "provided by other, single-purpose floating point options." rjmccall: "provided by other, single-purpose floating point options."
		rjmccallUnsubmitted Done Reply Inline Actions I don't know why you keep including "clang" as a modifier here; this is the clang documentation, and all of these options are clang options no matter where they might have been borrowed from. rjmccall: I don't know why you keep including "clang" as a modifier here; this is the clang documentation…
		mibintcAuthorUnsubmitted Done Reply Inline Actions thanks for explicitly pointing out use of 'clang', i fixed it mibintc: thanks for explicitly pointing out use of 'clang', i fixed it

		* ``precise`` Disables optimizations that are not value-safe on floating-point data, although FP contraction (FMA) is enabled (``-ffp-contract=fast``). This is clang's default behavior.
		* ``strict`` Enables ``-frounding-math`` and ``-ffp-exception-behavior=strict``, and disables contractions (FMA). All of the ``-ffast-math`` enablements are disabled.
		* ``fast`` Behaves identically to specifying both ``-ffast-math`` and ``ffp-contract=fast``

		Note: If your command line specifies multiple instances
		of the ``-ffp-model`` option, or if your command line option specifies
		``-ffp-model`` and later on the command line selects a floating point
		option that has the effect of negating part of the ``ffp-model`` that
		has been selected, then the compiler will issue a diagnostic warning
		that the override has occurred.
		rjmccallUnsubmitted Not Done Reply Inline Actions That's not typical driver behavior; why this choice? rjmccall: That's not typical driver behavior; why this choice?
		mibintcAuthorUnsubmitted Not Done Reply Inline Actions The rationale for the warnings is that the floating point options are sufficiently complicated that it makes sense to warn the uses that one of the later options supplied on the command line is undoing a choice made earlier. It's not obvious that e.g. the setting for fassociative-math is also controlled by -fp-model=strict mibintc: The rationale for the warnings is that the floating point options are sufficiently complicated…
		rjmccallUnsubmitted Not Done Reply Inline Actions Okay. Well, it's a new option, so new behavior is alright, but if you're worried about the collisions having arbitrary effects that you'll have to maintain compatibility with, you should consider making it an error instead, because a warning still means it's permitted. rjmccall: Okay. Well, it's a new option, so new behavior is alright, but if you're worried about the…
		mibintcAuthorUnsubmitted Done Reply Inline Actions @andrew.w.kaylor What do you think about making the diagnostics error vs. warning? mibintc: @andrew.w.kaylor What do you think about making the diagnostics error vs. warning?

		.. option:: -ffp-exception-behavior=<value>

		Specify the floating-point exception behavior.

		Valid values are: ``ignore``, ``maytrap``, and ``strict``.
		The default value is ``ignore``. Details:

		* ``ignore`` The compiler assumes that the exception status flags will not be read and that floating point exceptions will be masked.
		* ``maytrap`` The compiler avoids transformations that may raise exceptions that would not have been raised by the original code. Constant folding performed by the compiler is exempt from this option.
		* ``strict`` The compiler ensures that all transformations strictly preserve the floating point exception semantics of the original code.




.. _controlling-code-generation:		.. _controlling-code-generation:

Controlling Code Generation		Controlling Code Generation
---------------------------		---------------------------

Clang provides a number of ways to control code generation. The options		Clang provides a number of ways to control code generation. The options
are listed below.		are listed below.

▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines

.. option:: -fstrict-vtable-pointers		.. option:: -fstrict-vtable-pointers

Enable optimizations based on the strict rules for overwriting polymorphic		Enable optimizations based on the strict rules for overwriting polymorphic
C++ objects, i.e. the vptr is invariant during an object's lifetime.		C++ objects, i.e. the vptr is invariant during an object's lifetime.
This enables better devirtualization. Turned off by default, because it is		This enables better devirtualization. Turned off by default, because it is
still experimental.		still experimental.

.. option:: -fwhole-program-vtables		.. option:: -fwhole-program-vtables
		rjmccallUnsubmitted Done Reply Inline Actions This should be something like `-fp-model=<value>`. Square brackets mean optional elements in these docs. rjmccall: This should be something like `-fp-model=<value>`. Square brackets mean optional elements in…

Enable whole-program vtable optimizations, such as single-implementation		Enable whole-program vtable optimizations, such as single-implementation
devirtualization and virtual constant propagation, for classes with		devirtualization and virtual constant propagation, for classes with
:doc:`hidden LTO visibility <LTOVisibility>`. Requires ``-flto``.		:doc:`hidden LTO visibility <LTOVisibility>`. Requires ``-flto``.

.. option:: -fforce-emit-vtables		.. option:: -fforce-emit-vtables
		rjmccallUnsubmitted Not Done Reply Inline Actions Combined how? With a comma? This option seems to have two independent dimensions. Is that necessary for command-line compatibility with ICC, or can we separate it into two options? The documentation should mention the default behavior along both dimensions. Is it possible to override a prior instance of this option to get this default behavior back? You mention that this `-fp-model=fast` is equivalent to `-ffast-math`. How does this option interact with that one if both are given on a command line? Please put option text in backticks wherever it appears. Most of these comments apply to `-fp-speculation` as well. rjmccall: Combined how? With a comma? This option seems to have two independent dimensions. Is that…
		mibintcAuthorUnsubmitted Done Reply Inline Actions Combined how? With a comma? This option seems to have two independent dimensions. Is that necessary for command-line compatibility with ICC, or can we separate it into two options? Yes that's right, there are 2 dimensions. I wrote it like this for identical compatibility with icc, and cl.exe also defines the option this way, to specify multiple values simultaneously. However I think it would be reasonable and good to split them into separate options. I will discuss this with the folks back home. The documentation should mention the default behavior along both dimensions. I added this info into the doc Is it possible to override a prior instance of this option to get this default behavior back? The 3 values along one dimension, precise, strict, fast if they appear multiple times in the command line, the last value will be the setting along that dimension. Ditto with the other dimension, the rightmost occurrence of except or noexcept will be the setting. You mention that this -fp-model=fast is equivalent to -ffast-math. How does this option interact with that one if both are given on a command line? The idea is that they are synonyms so if either or both appeared on the command line, the effect would be identical. I'll upload another patch with a few documentation updates and get back to you about splitting the fp-model option into multiple options. (Longer term, there are 2 other dimensions to fp-model) And thanks for the review mibintc: > Combined how? With a comma? > This option seems to have two independent dimensions. Is that…
		rjmccallUnsubmitted Not Done Reply Inline Actions Yes that's right, there are 2 dimensions. I wrote it like this for identical compatibility with icc, and cl.exe also defines the option this way, to specify multiple values simultaneously. However I think it would be reasonable and good to split them into separate options. I will discuss this with the folks back home. Okay. There's certainly some value in imitating existing compilers, but it sounds like a lot has been forced into one option, so maybe we should take the opportunity to split it up. If we do split it, though, I think the different dimensions should have different base spellings, rather than being repeated uses of `-fp-model`. The 3 values along one dimension, precise, strict, fast if they appear multiple times in the command line, the last value will be the setting along that dimension. Okay. This wasn't clear to me from the code, since the code also has an "off" option. The idea is that they are synonyms so if either or both appeared on the command line, the effect would be identical. Right, but compiler options are allowed to conflict with each other, with the general rule being that the last option "wins". So what I'm asking is if that works correctly with this option and `-ffast-math`, so that e.g. `-ffast-math -fp-model=strict` leaves you with strict FP but `-fp-model=strict -ffast-math` leaves you with fast FP. (That is another reason why it's best to have one aspect settled in each option: because you don't have to merge information from different uses of the option.) At any rate, the documentation should be clear about how this interacts with `-ffast-math`. You might even consider merging this into the documentation for `-ffast-math`, or at least revising that option's documentation. Does `-fp-model=fast` cause `__FAST_MATH__` to be defined? Also, strictly speaking, this should be `-ffp-model`, right? rjmccall: > Yes that's right, there are 2 dimensions. I wrote it like this for identical compatibility…
		mibintcAuthorUnsubmitted Done Reply Inline Actions Thanks for the review. I'm going to upload anotoher patch which drops -fp-model=[no-]except. This will clean up the command line for the fp-model setting because now it cannot have 2 settings simultaneously. The new patch will drop the fp-speculation option, and add a new option fp-exception-behavior. The fp-exception-behavior option allows access to the "eb" exception behavior setting of the LLVM constrained floating point intrinsics. The patch is pseudo code at this point because I want to get @rjmccall response to this proposal before finalizing. Since fp-model is an umbrella option, there are conflicts between it and existing options. I added pseudo code into RenderFloatingPointOptions to detect and report the conflicts, and rewrote the part that detects inter-option conflicts. mibintc: Thanks for the review. I'm going to upload anotoher patch which drops -fp-model=[no-]except.
		andrew.w.kaylorUnsubmitted Not Done Reply Inline Actions I think the ICC interface includes the exception option for compatibility/consistency with Microsoft's /fp option. We can handle that in clang-cl. So, I agree that it makes sense to split that out in clang. ICC's implementation of this actually has four dimensions, only two of which are being taken on here. Frankly, I think it's a bit of a mess. The core concept which I think we should bring into clang with this option is to have a single option that manages all the various settings to control floating point behavior to produce the primary expected modes of operation so users don't have to find all the flags and remember the default settings for each one. The way I'd suggest this should work is that we provide just the primary "models" and allow other options to modify the base behavior, regardless of the order in which the options appear. So, for example, -fp-model=precise -fp-speculation=safe and -fp-speculation=stafe -fp-model=precise would both mean the same thing, disable value-unsafe optimizations and prevent speculative execution of floating point operations. I don't know how painful that is from a driver perspective or how obvious it would be to "most users" but to me it seems to be the logical result of fp-model being an umbrella setting and other options being able to modify it. andrew.w.kaylor: I think the ICC interface includes the exception option for compatibility/consistency with…

In order to improve devirtualization, forces emitting of vtables even in		In order to improve devirtualization, forces emitting of vtables even in
		kpnUnsubmitted Done Reply Inline Actions Extra spaces? kpn: Extra spaces?
modules where it isn't necessary. It causes more inline virtual functions		modules where it isn't necessary. It causes more inline virtual functions
to be emitted.		to be emitted.
		andrew.w.kaylorUnsubmitted Done Reply Inline Actions There's a bit of ambiguity here because FP contraction isn't an on/off switch in LLVM. It has three settings: on, off, and fast. What you've done in this patch sets it to 'on' for precise, 'off' for strict, and 'fast' for fast. That sounds reasonable, but it's not what ICC and MSVC do. ICC and MSVC both have a behavior equivalent to -ffp-contract=fast in the precise model. The idea behind this is that FMA operations are actually more precise than the non-contracted operations. They don't always give the same result, but they give a more precise result. The problem with this is that if we adopt this approach it leaves us with no fp model that corresponds to the default compiler behavior if you don't specify an -fp-model at all. andrew.w.kaylor: There's a bit of ambiguity here because FP contraction isn't an on/off switch in LLVM. It has…

.. option:: -fno-assume-sane-operator-new		.. option:: -fno-assume-sane-operator-new

Don't assume that the C++'s new operator is sane.		Don't assume that the C++'s new operator is sane.

		rjmccallUnsubmitted Not Done Reply Inline Actions What you should document here are the semantics and how the option interacts with other options, not how code gets translated into LLVM. I'm not sure what the FIXME question here is; are you asking whether providing `-frounding-math` should imply an FP model? The notes about each of the options should probably be structured into a bullet list. rjmccall: What you should document here are the semantics and how the option interacts with other options…
		mibintcAuthorUnsubmitted Done Reply Inline Actions I'll remove the FIXME and assert that frounding-math uses dynamic-rounding and strict exception behavior. This will make frounding-math synonymous with fp-model=strict. I'll reformat to put notes into bullet lists. mibintc: I'll remove the FIXME and assert that frounding-math uses dynamic-rounding and strict exception…
This option tells the compiler to do not assume that C++'s global		This option tells the compiler to do not assume that C++'s global
new operator will always return a pointer that does not alias any		new operator will always return a pointer that does not alias any
other pointer when the function returns.		other pointer when the function returns.

.. option:: -ftrap-function=[name]		.. option:: -ftrap-function=[name]

Instruct code generator to emit a function call to the specified		Instruct code generator to emit a function call to the specified
function name for ``__builtin_trap()``.		function name for ``__builtin_trap()``.

LLVM code generator translates ``__builtin_trap()`` to a trap		LLVM code generator translates ``__builtin_trap()`` to a trap
		rjmccallUnsubmitted Done Reply Inline Actions These are exclusive, right? So the documentation should be `<value>`, not `<values>`. rjmccall: These are exclusive, right? So the documentation should be `<value>`, not `<values>`.
instruction if it is supported by the target ISA. Otherwise, the		instruction if it is supported by the target ISA. Otherwise, the
builtin is translated into a call to ``abort``. If this option is		builtin is translated into a call to ``abort``. If this option is
set, then the code generator will always lower the builtin to a call		set, then the code generator will always lower the builtin to a call
to the specified function regardless of whether the target ISA has a		to the specified function regardless of whether the target ISA has a
trap instruction. This option is useful for environments (e.g.		trap instruction. This option is useful for environments (e.g.
deeply embedded) where a trap cannot be properly handled, or when		deeply embedded) where a trap cannot be properly handled, or when
some custom behavior is desired.		some custom behavior is desired.

.. option:: -ftls-model=[model]		.. option:: -ftls-model=[model]

Select which TLS model to use.		Select which TLS model to use.

		rjmccallUnsubmitted Not Done Reply Inline Actions This is basically incomprehensible. :) I don't know if the problem is the behavior or just how it's being described, but I have no idea what "conflict" means — does it mean the option gets overridden, ignored, or causes an error? I think what you're trying to say is: Basic FP behavior can be broken down along two dimensions: the FP strictness model and the FP exceptions model. There are many existing options for controlling FP behavior. Some of these existing options are equivalent to setting one (or both?) of these dimensions. These options should generally be treated as synonyms for the purposes of deciding the ultimate setting; for example, `-ffp-model=fast -fno-fast-math` should basically leave the setting in its default state (right?). Other existing options only make sense in combination with certain basic models. For example, `-ffp-contract=fast` (note the spelling) is only allowed when using the fast FP model (right?). As a specific note, you break out the options into a list below; the entry for `fast` is the place to add things like "Equivalent to `-ffast-math`, including defining `__FAST_MATH__`)". rjmccall: This is basically incomprehensible. :) I don't know if the problem is the behavior or just how…
		mibintcAuthorUnsubmitted Done Reply Inline Actions Conflict was a poor choice of words. I meant to say that the umbrella options like fp-model=strict overlap with some of the other floating-point settings, in that case the rightmost option takes precedence and overrides the setting. I want the new options to behave in the same way that other clang options: rightmost option has precedence. mibintc: Conflict was a poor choice of words. I meant to say that the umbrella options like fp…
Valid values are: ``global-dynamic``, ``local-dynamic``,		Valid values are: ``global-dynamic``, ``local-dynamic``,
``initial-exec`` and ``local-exec``. The default value is		``initial-exec`` and ``local-exec``. The default value is
``global-dynamic``. The compiler may use a different model if the		``global-dynamic``. The compiler may use a different model if the
selected model is not supported by the target, or if a more		selected model is not supported by the target, or if a more
efficient model can be used. The TLS model can be overridden per		efficient model can be used. The TLS model can be overridden per
variable using the ``tls_model`` attribute.		variable using the ``tls_model`` attribute.

.. option:: -femulated-tls		.. option:: -femulated-tls
▲ Show 20 Lines • Show All 2,039 Lines • Show Last 20 Lines

clang/include/clang/Basic/CodeGenOptions.def

	Show First 20 Lines • Show All 228 Lines • ▼ Show 20 Lines
	CODEGENOPT(TimePasses , 1, 0) ///< Set when -ftime-report is enabled.			CODEGENOPT(TimePasses , 1, 0) ///< Set when -ftime-report is enabled.
	CODEGENOPT(TimeTrace , 1, 0) ///< Set when -ftime-trace is enabled.			CODEGENOPT(TimeTrace , 1, 0) ///< Set when -ftime-trace is enabled.
	VALUE_CODEGENOPT(TimeTraceGranularity, 32, 500) ///< Minimum time granularity (in microseconds),			VALUE_CODEGENOPT(TimeTraceGranularity, 32, 500) ///< Minimum time granularity (in microseconds),
	///< traced by time profiler			///< traced by time profiler
	CODEGENOPT(UnrollLoops , 1, 0) ///< Control whether loops are unrolled.			CODEGENOPT(UnrollLoops , 1, 0) ///< Control whether loops are unrolled.
	CODEGENOPT(RerollLoops , 1, 0) ///< Control whether loops are rerolled.			CODEGENOPT(RerollLoops , 1, 0) ///< Control whether loops are rerolled.
	CODEGENOPT(NoUseJumpTables , 1, 0) ///< Set when -fno-jump-tables is enabled.			CODEGENOPT(NoUseJumpTables , 1, 0) ///< Set when -fno-jump-tables is enabled.
	CODEGENOPT(UnsafeFPMath , 1, 0) ///< Allow unsafe floating point optzns.			CODEGENOPT(UnsafeFPMath , 1, 0) ///< Allow unsafe floating point optzns.
				CODEGENOPT(RoundingFPMath , 1, 0) ///< Rounding floating point optzns.
				rjmccallUnsubmitted Done Reply Inline Actions Why do we need both a code-gen option and a language option? rjmccall: Why do we need both a code-gen option and a language option?
				mibintcAuthorUnsubmitted Not Done Reply Inline Actions The main reason i added it to LangOptions.h is because I saw the FPContract support in there and I thought I'd get on that bandwagon. My ultimate goal, after committing the command line options, is to add support for controlling rounding mode and exception behavior with pragma's embedded in the functions, similar to https://reviews.llvm.org/D69272. There's a patch here that I like, to add rounding-mode and exception-behavior to FPOptions https://reviews.llvm.org/D65994, but it hasn't been committed yet. mibintc: The main reason i added it to LangOptions.h is because I saw the FPContract support in there…
				mibintcAuthorUnsubmitted Done Reply Inline Actions I dropped the code-gen option. mibintc: I dropped the code-gen option.
	CODEGENOPT(UnwindTables , 1, 0) ///< Emit unwind tables.			CODEGENOPT(UnwindTables , 1, 0) ///< Emit unwind tables.
	CODEGENOPT(VectorizeLoop , 1, 0) ///< Run loop vectorizer.			CODEGENOPT(VectorizeLoop , 1, 0) ///< Run loop vectorizer.
	CODEGENOPT(VectorizeSLP , 1, 0) ///< Run SLP vectorizer.			CODEGENOPT(VectorizeSLP , 1, 0) ///< Run SLP vectorizer.
	CODEGENOPT(ProfileSampleAccurate, 1, 0) ///< Sample profile is accurate.			CODEGENOPT(ProfileSampleAccurate, 1, 0) ///< Sample profile is accurate.

	/// Attempt to use register sized accesses to bit-fields in structures, when			/// Attempt to use register sized accesses to bit-fields in structures, when
	/// possible.			/// possible.
	CODEGENOPT(UseRegisterSizedBitfieldAccess , 1, 0)			CODEGENOPT(UseRegisterSizedBitfieldAccess , 1, 0)
	▲ Show 20 Lines • Show All 129 Lines • Show Last 20 Lines

clang/include/clang/Basic/LangOptions.h

Show First 20 Lines • Show All 178 Lines • ▼ Show 20 Lines	public:

// TODO: merge FEnvAccessModeKind and FPContractModeKind		// TODO: merge FEnvAccessModeKind and FPContractModeKind
enum FEnvAccessModeKind {		enum FEnvAccessModeKind {
FEA_Off,		FEA_Off,

FEA_On		FEA_On
};		};

		enum FPRoundingModeKind {
		mibintcAuthorUnsubmitted Done Reply Inline Actions Currently there's no way to get at any of these values besides ToNearest and Dynamic, but I put all the supported values here to support future work mibintc: Currently there's no way to get at any of these values besides ToNearest and Dynamic, but I put…
		// Round to the nearest integer - IEEE rounding mode
		FPRM_ToNearest, // This is the default
		// Rounding mode is dynamic: optimizer assumes that rounding mode
		// is unknown.
		FPRM_Dynamic,
		// Round down - IEEE rounding mode
		FPRM_Downward,
		// Round up - IEEE rounding mode
		FPRM_Upward,
		// Round towards zero - IEEE rounding mode
		FPRM_ToZero
		};

		enum FPExceptionBehaviorKind {
		// Floating point exceptions are not handled: fp exceptions are masked.
		FPEB_Ignore, // This is the default
		mibintcAuthorUnsubmitted Done Reply Inline Actions -fno-trapping-math implemented by selecting -ffp-exception-behavior=ignore and -ftrapping-math is implemented by selecting -ffp-exception-behavior=strict. What do you think about making ftrapping-math a Driver only option, so that Driver converts the values like this. Otherwise let's make fp-exception-behavior take precedence, in llvm, over ftrapping-math (trapping math is t/f but exception behavior, in the llvm Constrained Floating Point Intrinsics, can take 3 values) mibintc: -fno-trapping-math implemented by selecting -ffp-exception-behavior=ignore and -ftrapping-math…
		rjmccallUnsubmitted Not Done Reply Inline Actions If your new option subsumes existing ones, I think making it the frontend option is sensible. rjmccall: If your new option subsumes existing ones, I think making it the frontend option is sensible.
		// Optimizer will avoid transformations that may raise exceptions that would
		// not have been raised by unoptimized code
		FPEB_MayTrap,
		// Optimizer will strictly preserve the fp exception semantics of the
		// unoptimized code
		FPEB_Strict
		};

enum class LaxVectorConversionKind {		enum class LaxVectorConversionKind {
/// Permit no implicit vector bitcasts.		/// Permit no implicit vector bitcasts.
None,		None,
/// Permit vector bitcasts between integer vectors with different numbers		/// Permit vector bitcasts between integer vectors with different numbers
/// of elements but the same total bit-width.		/// of elements but the same total bit-width.
Integer,		Integer,
/// Permit vector bitcasts between all vectors with the same total		/// Permit vector bitcasts between all vectors with the same total
/// bit-width.		/// bit-width.
▲ Show 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	#include "clang/Basic/LangOptions.def"
}		}

bool assumeFunctionsAreConvergent() const {		bool assumeFunctionsAreConvergent() const {
return (CUDA && CUDAIsDevice) \|\| OpenCL;		return (CUDA && CUDAIsDevice) \|\| OpenCL;
}		}

/// Return the OpenCL C or C++ version as a VersionTuple.		/// Return the OpenCL C or C++ version as a VersionTuple.
VersionTuple getOpenCLVersionTuple() const;		VersionTuple getOpenCLVersionTuple() const;

		rjmccallUnsubmitted Not Done Reply Inline Actions Spurious change. rjmccall: Spurious change.
		/// Floating point model options
		class FPModelOptions {
		public:
		FPModelOptions() : FPRM(LangOptions::FPRM_ToNearest),
		FPEB(LangOptions::FPEB_Ignore) {}

		LangOptions::FPRoundingModeKind getFPRoundingModeSetting() const {
		return FPRM;
		}
		void setFPRoundingModeSetting(LangOptions::FPRoundingModeKind Value) {
		FPRM = Value;
		}

		LangOptions::FPExceptionBehaviorKind getFPExceptionBehaviorSetting() const
		{
		return FPEB;
		}
		void setFPExceptionBehaviorSetting(
		LangOptions::FPExceptionBehaviorKind Value) {
		FPEB = Value;
		}
		rjmccallUnsubmitted Not Done Reply Inline Actions Everything here is a "setting", and in the context of this type they're all FP. Please name these methods something like `getRoundingMode()`. Does this structure really need to exist as opposed to tracking the dimensions separately? Don't we already track some of this somewhere? We should subsume that state into these values rather than tracking them separately. rjmccall: Everything here is a "setting", and in the context of this type they're all FP. Please name…
		mibintcAuthorUnsubmitted Done Reply Inline Actions I fixed the spelling, I also dropped the structure and used the ENUM_OPT macro instead of writing out the setter and getter. Look OK now? mibintc: I fixed the spelling, I also dropped the structure and used the ENUM_OPT macro instead of…

		private:
		LangOptions::FPRoundingModeKind FPRM = LangOptions::FPRM_ToNearest;
		LangOptions::FPExceptionBehaviorKind FPEB = LangOptions::FPEB_Ignore;
		};

		FPModelOptions& getFPMOptions() { return fpm_options; }
		FPModelOptions getFPMOptions() const { return fpm_options; }
		private:
		FPModelOptions fpm_options;
};		};

/// Floating point control options		/// Floating point control options
class FPOptions {		class FPOptions {
public:		public:
FPOptions() : fp_contract(LangOptions::FPC_Off),		FPOptions() : fp_contract(LangOptions::FPC_Off),
fenv_access(LangOptions::FEA_Off) {}		fenv_access(LangOptions::FEA_Off) {}

▲ Show 20 Lines • Show All 65 Lines • Show Last 20 Lines

clang/include/clang/Driver/Options.td

Show First 20 Lines • Show All 918 Lines • ▼ Show 20 Lines
def : Flag<["-"], "fno-expensive-optimizations">, Group<clang_ignored_gcc_optimization_f_Group>;		def : Flag<["-"], "fno-expensive-optimizations">, Group<clang_ignored_gcc_optimization_f_Group>;
def fextdirs_EQ : Joined<["-"], "fextdirs=">, Group<f_Group>;		def fextdirs_EQ : Joined<["-"], "fextdirs=">, Group<f_Group>;
def : Flag<["-"], "fdefer-pop">, Group<clang_ignored_gcc_optimization_f_Group>;		def : Flag<["-"], "fdefer-pop">, Group<clang_ignored_gcc_optimization_f_Group>;
def : Flag<["-"], "fno-defer-pop">, Group<clang_ignored_gcc_optimization_f_Group>;		def : Flag<["-"], "fno-defer-pop">, Group<clang_ignored_gcc_optimization_f_Group>;
def : Flag<["-"], "fextended-identifiers">, Group<clang_ignored_f_Group>;		def : Flag<["-"], "fextended-identifiers">, Group<clang_ignored_f_Group>;
def : Flag<["-"], "fno-extended-identifiers">, Group<f_Group>, Flags<[Unsupported]>;		def : Flag<["-"], "fno-extended-identifiers">, Group<f_Group>, Flags<[Unsupported]>;
def fhosted : Flag<["-"], "fhosted">, Group<f_Group>;		def fhosted : Flag<["-"], "fhosted">, Group<f_Group>;
def fdenormal_fp_math_EQ : Joined<["-"], "fdenormal-fp-math=">, Group<f_Group>, Flags<[CC1Option]>;		def fdenormal_fp_math_EQ : Joined<["-"], "fdenormal-fp-math=">, Group<f_Group>, Flags<[CC1Option]>;
		def ffp_model_EQ : Joined<["-"], "ffp-model=">, Group<f_Group>, Flags<[DriverOption]>,
		mibintcAuthorUnsubmitted Done Reply Inline Actions The ffp-model= option is just a Driver option, it is rewritten into combinations of lower level options like ffp-exception-behavior and frounding-math: it's not a cc1 option. mibintc: The ffp-model= option is just a Driver option, it is rewritten into combinations of lower…
		HelpText<"Controls the semantics of floating-point calculations.">;
		def ffp_exception_behavior_EQ : Joined<["-"], "ffp-exception-behavior=">, Group<f_Group>, Flags<[CC1Option]>,
		HelpText<"Specifies the exception behavior of floating-point operations.">;
def ffast_math : Flag<["-"], "ffast-math">, Group<f_Group>, Flags<[CC1Option]>,		def ffast_math : Flag<["-"], "ffast-math">, Group<f_Group>, Flags<[CC1Option]>,
HelpText<"Allow aggressive, lossy floating-point optimizations">;		HelpText<"Allow aggressive, lossy floating-point optimizations">;
def fno_fast_math : Flag<["-"], "fno-fast-math">, Group<f_Group>;		def fno_fast_math : Flag<["-"], "fno-fast-math">, Group<f_Group>;
def fmath_errno : Flag<["-"], "fmath-errno">, Group<f_Group>, Flags<[CC1Option]>,		def fmath_errno : Flag<["-"], "fmath-errno">, Group<f_Group>, Flags<[CC1Option]>,
HelpText<"Require math functions to indicate errors by setting errno">;		HelpText<"Require math functions to indicate errors by setting errno">;
def fno_math_errno : Flag<["-"], "fno-math-errno">, Group<f_Group>;		def fno_math_errno : Flag<["-"], "fno-math-errno">, Group<f_Group>;
def fbracket_depth_EQ : Joined<["-"], "fbracket-depth=">, Group<f_Group>, Flags<[CoreOption]>;		def fbracket_depth_EQ : Joined<["-"], "fbracket-depth=">, Group<f_Group>, Flags<[CoreOption]>;
def fsignaling_math : Flag<["-"], "fsignaling-math">, Group<f_Group>;		def fsignaling_math : Flag<["-"], "fsignaling-math">, Group<f_Group>;
▲ Show 20 Lines • Show All 200 Lines • ▼ Show 20 Lines	def fno_signed_zeros :
HelpText<"Allow optimizations that ignore the sign of floating point zeros">;		HelpText<"Allow optimizations that ignore the sign of floating point zeros">;
def fhonor_nans : Flag<["-"], "fhonor-nans">, Group<f_Group>;		def fhonor_nans : Flag<["-"], "fhonor-nans">, Group<f_Group>;
def fno_honor_nans : Flag<["-"], "fno-honor-nans">, Group<f_Group>;		def fno_honor_nans : Flag<["-"], "fno-honor-nans">, Group<f_Group>;
def fhonor_infinities : Flag<["-"], "fhonor-infinities">, Group<f_Group>;		def fhonor_infinities : Flag<["-"], "fhonor-infinities">, Group<f_Group>;
def fno_honor_infinities : Flag<["-"], "fno-honor-infinities">, Group<f_Group>;		def fno_honor_infinities : Flag<["-"], "fno-honor-infinities">, Group<f_Group>;
// This option was originally misspelt "infinites" [sic].		// This option was originally misspelt "infinites" [sic].
def : Flag<["-"], "fhonor-infinites">, Alias<fhonor_infinities>;		def : Flag<["-"], "fhonor-infinites">, Alias<fhonor_infinities>;
def : Flag<["-"], "fno-honor-infinites">, Alias<fno_honor_infinities>;		def : Flag<["-"], "fno-honor-infinites">, Alias<fno_honor_infinities>;
		def frounding_math : Flag<["-"], "frounding-math">, Group<f_Group>, Flags<[CC1Option]>;
		def fno_rounding_math : Flag<["-"], "fno-rounding-math">, Group<f_Group>, Flags<[CC1Option]>;
		rjmccallUnsubmitted Not Done Reply Inline Actions It looks like both of these can now be written with `BooleanFFlag`. rjmccall: It looks like both of these can now be written with `BooleanFFlag`.
		mibintcAuthorUnsubmitted Done Reply Inline Actions BooleanFFlag doesn't work, there's a FIXME message saying that prefixes don't work, currently they are only being used for unimplemented options. llvm/clang/lib/Driver/ToolChains/Clang.cpp:2301:17: error: ‘OPT_frounding_math’ is not a member of ‘clang::driver::options’ optID = options::OPT_frounding_math; ^ mibintc: BooleanFFlag doesn't work, there's a FIXME message saying that prefixes don't work, currently…
def ftrapping_math : Flag<["-"], "ftrapping-math">, Group<f_Group>, Flags<[CC1Option]>;		def ftrapping_math : Flag<["-"], "ftrapping-math">, Group<f_Group>, Flags<[CC1Option]>;
def fno_trapping_math : Flag<["-"], "fno-trapping-math">, Group<f_Group>, Flags<[CC1Option]>;		def fno_trapping_math : Flag<["-"], "fno-trapping-math">, Group<f_Group>, Flags<[CC1Option]>;
def ffp_contract : Joined<["-"], "ffp-contract=">, Group<f_Group>,		def ffp_contract : Joined<["-"], "ffp-contract=">, Group<f_Group>,
Flags<[CC1Option]>, HelpText<"Form fused FP ops (e.g. FMAs): fast (everywhere)"		Flags<[CC1Option]>, HelpText<"Form fused FP ops (e.g. FMAs): fast (everywhere)"
" \| on (according to FP_CONTRACT pragma, default) \| off (never fuse)">, Values<"fast,on,off">;		" \| on (according to FP_CONTRACT pragma, default) \| off (never fuse)">, Values<"fast,on,off">;

def fstrict_float_cast_overflow : Flag<["-"],		def fstrict_float_cast_overflow : Flag<["-"],
"fstrict-float-cast-overflow">, Group<f_Group>, Flags<[CC1Option]>,		"fstrict-float-cast-overflow">, Group<f_Group>, Flags<[CC1Option]>,
▲ Show 20 Lines • Show All 2,012 Lines • ▼ Show 20 Lines
defm profile : BooleanFFlag<"profile">, Group<clang_ignored_f_Group>;		defm profile : BooleanFFlag<"profile">, Group<clang_ignored_f_Group>;
defm profile_correction : BooleanFFlag<"profile-correction">, Group<clang_ignored_gcc_optimization_f_Group>;		defm profile_correction : BooleanFFlag<"profile-correction">, Group<clang_ignored_gcc_optimization_f_Group>;
defm profile_generate_sampling : BooleanFFlag<"profile-generate-sampling">, Group<clang_ignored_f_Group>;		defm profile_generate_sampling : BooleanFFlag<"profile-generate-sampling">, Group<clang_ignored_f_Group>;
defm profile_reusedist : BooleanFFlag<"profile-reusedist">, Group<clang_ignored_f_Group>;		defm profile_reusedist : BooleanFFlag<"profile-reusedist">, Group<clang_ignored_f_Group>;
defm profile_values : BooleanFFlag<"profile-values">, Group<clang_ignored_gcc_optimization_f_Group>;		defm profile_values : BooleanFFlag<"profile-values">, Group<clang_ignored_gcc_optimization_f_Group>;
defm regs_graph : BooleanFFlag<"regs-graph">, Group<clang_ignored_f_Group>;		defm regs_graph : BooleanFFlag<"regs-graph">, Group<clang_ignored_f_Group>;
defm rename_registers : BooleanFFlag<"rename-registers">, Group<clang_ignored_gcc_optimization_f_Group>;		defm rename_registers : BooleanFFlag<"rename-registers">, Group<clang_ignored_gcc_optimization_f_Group>;
defm ripa : BooleanFFlag<"ripa">, Group<clang_ignored_f_Group>;		defm ripa : BooleanFFlag<"ripa">, Group<clang_ignored_f_Group>;
defm rounding_math : BooleanFFlag<"rounding-math">, Group<clang_ignored_gcc_optimization_f_Group>;
defm schedule_insns : BooleanFFlag<"schedule-insns">, Group<clang_ignored_gcc_optimization_f_Group>;		defm schedule_insns : BooleanFFlag<"schedule-insns">, Group<clang_ignored_gcc_optimization_f_Group>;
defm schedule_insns2 : BooleanFFlag<"schedule-insns2">, Group<clang_ignored_gcc_optimization_f_Group>;		defm schedule_insns2 : BooleanFFlag<"schedule-insns2">, Group<clang_ignored_gcc_optimization_f_Group>;
defm see : BooleanFFlag<"see">, Group<clang_ignored_f_Group>;		defm see : BooleanFFlag<"see">, Group<clang_ignored_f_Group>;
defm signaling_nans : BooleanFFlag<"signaling-nans">, Group<clang_ignored_gcc_optimization_f_Group>;		defm signaling_nans : BooleanFFlag<"signaling-nans">, Group<clang_ignored_gcc_optimization_f_Group>;
defm single_precision_constant : BooleanFFlag<"single-precision-constant">,		defm single_precision_constant : BooleanFFlag<"single-precision-constant">,
Group<clang_ignored_gcc_optimization_f_Group>;		Group<clang_ignored_gcc_optimization_f_Group>;
defm spec_constr_count : BooleanFFlag<"spec-constr-count">, Group<clang_ignored_f_Group>;		defm spec_constr_count : BooleanFFlag<"spec-constr-count">, Group<clang_ignored_f_Group>;
defm stack_check : BooleanFFlag<"stack-check">, Group<clang_ignored_f_Group>;		defm stack_check : BooleanFFlag<"stack-check">, Group<clang_ignored_f_Group>;
▲ Show 20 Lines • Show All 103 Lines • Show Last 20 Lines

clang/lib/CodeGen/BackendUtil.cpp

Show First 20 Lines • Show All 469 Lines • ▼ Show 20 Lines	if (LangOpts.DWARFExceptions)
Options.ExceptionModel = llvm::ExceptionHandling::DwarfCFI;		Options.ExceptionModel = llvm::ExceptionHandling::DwarfCFI;
if (LangOpts.WasmExceptions)		if (LangOpts.WasmExceptions)
Options.ExceptionModel = llvm::ExceptionHandling::Wasm;		Options.ExceptionModel = llvm::ExceptionHandling::Wasm;

Options.NoInfsFPMath = CodeGenOpts.NoInfsFPMath;		Options.NoInfsFPMath = CodeGenOpts.NoInfsFPMath;
Options.NoNaNsFPMath = CodeGenOpts.NoNaNsFPMath;		Options.NoNaNsFPMath = CodeGenOpts.NoNaNsFPMath;
Options.NoZerosInBSS = CodeGenOpts.NoZeroInitializedInBSS;		Options.NoZerosInBSS = CodeGenOpts.NoZeroInitializedInBSS;
Options.UnsafeFPMath = CodeGenOpts.UnsafeFPMath;		Options.UnsafeFPMath = CodeGenOpts.UnsafeFPMath;
		Options.RoundingFPMath = CodeGenOpts.RoundingFPMath;
Options.StackAlignmentOverride = CodeGenOpts.StackAlignment;		Options.StackAlignmentOverride = CodeGenOpts.StackAlignment;
Options.FunctionSections = CodeGenOpts.FunctionSections;		Options.FunctionSections = CodeGenOpts.FunctionSections;
Options.DataSections = CodeGenOpts.DataSections;		Options.DataSections = CodeGenOpts.DataSections;
Options.UniqueSectionNames = CodeGenOpts.UniqueSectionNames;		Options.UniqueSectionNames = CodeGenOpts.UniqueSectionNames;
Options.EmulatedTLS = CodeGenOpts.EmulatedTLS;		Options.EmulatedTLS = CodeGenOpts.EmulatedTLS;
Options.ExplicitEmulatedTLS = CodeGenOpts.ExplicitEmulatedTLS;		Options.ExplicitEmulatedTLS = CodeGenOpts.ExplicitEmulatedTLS;
Options.DebuggerTuning = CodeGenOpts.getDebuggerTuning();		Options.DebuggerTuning = CodeGenOpts.getDebuggerTuning();
Options.EmitStackSizeSection = CodeGenOpts.StackSizeSection;		Options.EmitStackSizeSection = CodeGenOpts.StackSizeSection;
▲ Show 20 Lines • Show All 1,191 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.h

Show First 20 Lines • Show All 4,142 Lines • ▼ Show 20 Lines	public:
/// a r-value suitable for passing the given parameter.		/// a r-value suitable for passing the given parameter.
void EmitDelegateCallArg(CallArgList &args, const VarDecl *param,		void EmitDelegateCallArg(CallArgList &args, const VarDecl *param,
SourceLocation loc);		SourceLocation loc);

/// SetFPAccuracy - Set the minimum required accuracy of the given floating		/// SetFPAccuracy - Set the minimum required accuracy of the given floating
/// point operation, expressed as the maximum relative error in ulp.		/// point operation, expressed as the maximum relative error in ulp.
void SetFPAccuracy(llvm::Value *Val, float Accuracy);		void SetFPAccuracy(llvm::Value *Val, float Accuracy);

		/// SetFPModel - Control floating point behavior via fp-model settings.
		void SetFPModel(void);
		rjmccallUnsubmitted Done Reply Inline Actions Don't use `(void)`, please. rjmccall: Don't use `(void)`, please.

private:		private:
llvm::MDNode *getRangeForLoadFromType(QualType Ty);		llvm::MDNode *getRangeForLoadFromType(QualType Ty);
void EmitReturnOfRValue(RValue RV, QualType Ty);		void EmitReturnOfRValue(RValue RV, QualType Ty);

void deferPlaceholderReplacement(llvm::Instruction Old, llvm::Value New);		void deferPlaceholderReplacement(llvm::Instruction Old, llvm::Value New);

llvm::SmallVector<std::pair<llvm::Instruction , llvm::Value >, 4>		llvm::SmallVector<std::pair<llvm::Instruction , llvm::Value >, 4>
DeferredReplacements;		DeferredReplacements;
▲ Show 20 Lines • Show All 235 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.cpp

Show All 27 Lines
#include "clang/AST/StmtObjC.h"		#include "clang/AST/StmtObjC.h"
#include "clang/Basic/Builtins.h"		#include "clang/Basic/Builtins.h"
#include "clang/Basic/CodeGenOptions.h"		#include "clang/Basic/CodeGenOptions.h"
#include "clang/Basic/TargetInfo.h"		#include "clang/Basic/TargetInfo.h"
#include "clang/CodeGen/CGFunctionInfo.h"		#include "clang/CodeGen/CGFunctionInfo.h"
#include "clang/Frontend/FrontendDiagnostic.h"		#include "clang/Frontend/FrontendDiagnostic.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/Dominators.h"		#include "llvm/IR/Dominators.h"
		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
#include "llvm/IR/MDBuilder.h"		#include "llvm/IR/MDBuilder.h"
#include "llvm/IR/Operator.h"		#include "llvm/IR/Operator.h"
#include "llvm/Transforms/Utils/PromoteMemToReg.h"		#include "llvm/Transforms/Utils/PromoteMemToReg.h"
using namespace clang;		using namespace clang;
using namespace CodeGen;		using namespace CodeGen;

/// shouldEmitLifetimeMarkers - Decide whether we need emit the life-time		/// shouldEmitLifetimeMarkers - Decide whether we need emit the life-time
Show All 38 Lines	CodeGenFunction::CodeGenFunction(CodeGenModule &cgm, bool suppressNewContext)
}		}
if (CGM.getCodeGenOpts().ReciprocalMath) {		if (CGM.getCodeGenOpts().ReciprocalMath) {
FMF.setAllowReciprocal();		FMF.setAllowReciprocal();
}		}
if (CGM.getCodeGenOpts().Reassociate) {		if (CGM.getCodeGenOpts().Reassociate) {
FMF.setAllowReassoc();		FMF.setAllowReassoc();
}		}
Builder.setFastMathFlags(FMF);		Builder.setFastMathFlags(FMF);
		SetFPModel();
}		}

CodeGenFunction::~CodeGenFunction() {		CodeGenFunction::~CodeGenFunction() {
assert(LifetimeExtendedCleanupStack.empty() && "failed to emit a cleanup");		assert(LifetimeExtendedCleanupStack.empty() && "failed to emit a cleanup");

// If there are any unclaimed block infos, go ahead and destroy them		// If there are any unclaimed block infos, go ahead and destroy them
// now. This can happen if IR-gen gets clever and skips evaluating		// now. This can happen if IR-gen gets clever and skips evaluating
// something.		// something.
if (FirstBlockInfo)		if (FirstBlockInfo)
destroyBlockInfos(FirstBlockInfo);		destroyBlockInfos(FirstBlockInfo);

if (getLangOpts().OpenMP && CurFn)		if (getLangOpts().OpenMP && CurFn)
CGM.getOpenMPRuntime().functionFinished(*this);		CGM.getOpenMPRuntime().functionFinished(*this);
}		}

		void CodeGenFunction::SetFPModel(void)
		{
		rjmccallUnsubmitted Done Reply Inline Actions Code style: please use `()` instead of `(void)`, and please place open-braces on the same line as the declaration. rjmccall: Code style: please use `()` instead of `(void)`, and please place open-braces on the same line…
		auto fpRoundingMode = getLangOpts().getFPMOptions().getFPRoundingModeSetting();
		auto fpExceptionBehavior =
		getLangOpts().getFPMOptions().getFPExceptionBehaviorSetting();

		// Translate the compiler options into
		// the settings that are transmitted to the IR Builder
		llvm::ConstrainedFPIntrinsic::RoundingMode ConstrainedRoundingMD;
		llvm::ConstrainedFPIntrinsic::ExceptionBehavior ConstrainedExceptMD;

		switch (fpRoundingMode) {
		case LangOptions::FPRM_ToNearest:
		ConstrainedRoundingMD = llvm::ConstrainedFPIntrinsic::rmToNearest;
		break;
		case LangOptions::FPRM_Downward:
		ConstrainedRoundingMD = llvm::ConstrainedFPIntrinsic::rmDownward;
		kpnUnsubmitted Not Done Reply Inline Actions Wait, so "fast" and "precise" are the same thing? That doesn't sound like where the documentation you put in the ticket says "the compiler preserves the source expression ordering and rounding properties of floating-point". (Yes, I saw below where "fast" turns on the fast math flags but "precise" doesn't. That doesn't affect my point here.) kpn: Wait, so "fast" and "precise" are the same thing? That doesn't sound like where the…
		mibintcAuthorUnsubmitted Not Done Reply Inline Actions "precise" doesn't necessitate the use of Constrained Intrinsics, And likewise for "fast". The words "compiler preserves the source expression ordering" were copied from the msdn documentation for /fp:precise as you explained it would be useful to have the msdn documentation for the option in case it goes offline in, say, 30 years. The ICL Intel compiler also provides equivalent floating point options. The Intel documentation for precise is phrased differently "Disables optimizations that are not value-safe on floating-point data." fp-model=precise should enable contractions, if that's not true at default (I mean, clang -c) then this patch is missing that. fp-model=fast is the same as requesting ffast-math mibintc: "precise" doesn't necessitate the use of Constrained Intrinsics, And likewise for "fast". The…
		mibintcAuthorUnsubmitted Not Done Reply Inline Actions Well, we haven't heard from Andy yet, but he told me some time ago that /fp:precise corresponds more or less (there was wiggle room) to clang's default behavior. It sounds like you think the description in the msdn of /fp:precise isn't describing clang's default behavior, @kpn can you say more about that, and do you think that ConstrainedIntrinsics should be created to provide the semantics of /fp:precise? mibintc: Well, we haven't heard from Andy yet, but he told me some time ago that /fp:precise corresponds…
		andrew.w.kaylorUnsubmitted Not Done Reply Inline Actions "Precise" means that no value unsafe optimizations will be performed. That's what LLVM does by default. As long as no fast math flags are set, we will not perform optimizations that are not value safe. andrew.w.kaylor: "Precise" means that no value unsafe optimizations will be performed. That's what LLVM does by…
		kpnUnsubmitted Not Done Reply Inline Actions OK, I stand corrected. kpn: OK, I stand corrected.
		break;
		case LangOptions::FPRM_Upward:
		ConstrainedRoundingMD = llvm::ConstrainedFPIntrinsic::rmUpward;
		break;
		case LangOptions::FPRM_ToZero:
		ConstrainedRoundingMD = llvm::ConstrainedFPIntrinsic::rmTowardZero;
		break;
		case LangOptions::FPRM_Dynamic:
		ConstrainedRoundingMD = llvm::ConstrainedFPIntrinsic::rmDynamic;
		break;
		mibintcAuthorUnsubmitted Not Done Reply Inline Actions I added these 2 functions, is this what you have in mind or do you want me to write them differently? mibintc: I added these 2 functions, is this what you have in mind or do you want me to write them…
		rjmccallUnsubmitted Not Done Reply Inline Actions Slightly differently, yes, please. static llvm::ConstrainedFPIntrinsic::ExceptionBehavior getConstrainedExceptionBehavior(LangOptions;:FPExceptionModeKind kind) { switch (kind) { case LangOptions::FPE_Ignore: return llvm::ConstrainedFPIntrinsic::ebIgnore; // ...rest of cases here... // no default: should be exhaustive over the enum } llvm_unreachable("bad kind"); } rjmccall: Slightly differently, yes, please. ``` static llvm::ConstrainedFPIntrinsic::ExceptionBehavior…
		mibintcAuthorUnsubmitted Done Reply Inline Actions sorry i missed that detail (static) the first time around mibintc: sorry i missed that detail (static) the first time around
		default:
		llvm_unreachable("Unsupported FP RoundingMode");
		}

		rjmccallUnsubmitted Not Done Reply Inline Actions Sorry for dragging this out, but is there a reason these need to be member functions on `CodeGenFunction` rather than just `static` functions in this file? rjmccall: Sorry for dragging this out, but is there a reason these need to be member functions on…
		switch (fpExceptionBehavior) {
		case LangOptions::FPEB_Ignore:
		ConstrainedExceptMD = llvm::ConstrainedFPIntrinsic::ebIgnore;
		break;
		case LangOptions::FPEB_MayTrap:
		ConstrainedExceptMD = llvm::ConstrainedFPIntrinsic::ebMayTrap;
		break;
		case LangOptions::FPEB_Strict:
		ConstrainedExceptMD = llvm::ConstrainedFPIntrinsic::ebStrict;
		break;
		default:
		rjmccallUnsubmitted Not Done Reply Inline Actions Please make functions that do these translations, and please make them use exhaustive switches with `llvm_unreachable` at the end. rjmccall: Please make functions that do these translations, and please make them use exhaustive switches…
		llvm_unreachable("Unsupported FP Exception Behavior");
		}

		if (fpExceptionBehavior == LangOptions::FPEB_Ignore &&
		fpRoundingMode == LangOptions::FPRM_ToNearest)
		// Constrained intrinsics are not used.
		;
		else {
		Builder.setIsFPConstrained(true);
		Builder.setDefaultConstrainedRounding(ConstrainedRoundingMD);
		Builder.setDefaultConstrainedExcept(ConstrainedExceptMD);
		}
		}

CharUnits CodeGenFunction::getNaturalPointeeTypeAlignment(QualType T,		CharUnits CodeGenFunction::getNaturalPointeeTypeAlignment(QualType T,
LValueBaseInfo *BaseInfo,		LValueBaseInfo *BaseInfo,
TBAAAccessInfo *TBAAInfo) {		TBAAAccessInfo *TBAAInfo) {
return getNaturalTypeAlignment(T->getPointeeType(), BaseInfo, TBAAInfo,		return getNaturalTypeAlignment(T->getPointeeType(), BaseInfo, TBAAInfo,
/* forPointeeType= */ true);		/* forPointeeType= */ true);
}		}

CharUnits CodeGenFunction::getNaturalTypeAlignment(QualType T,		CharUnits CodeGenFunction::getNaturalTypeAlignment(QualType T,
▲ Show 20 Lines • Show All 2,295 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Clang.cpp

Show First 20 Lines • Show All 2,317 Lines • ▼ Show 20 Lines	static void RenderFloatingPointOptions(const ToolChain &TC, const Driver &D,
// LLVM flags based on the final state.		// LLVM flags based on the final state.
bool HonorINFs = true;		bool HonorINFs = true;
bool HonorNaNs = true;		bool HonorNaNs = true;
// -fmath-errno is the default on some platforms, e.g. BSD-derived OSes.		// -fmath-errno is the default on some platforms, e.g. BSD-derived OSes.
bool MathErrno = TC.IsMathErrnoDefault();		bool MathErrno = TC.IsMathErrnoDefault();
bool AssociativeMath = false;		bool AssociativeMath = false;
bool ReciprocalMath = false;		bool ReciprocalMath = false;
bool SignedZeros = true;		bool SignedZeros = true;
bool TrappingMath = true;		bool TrappingMath = false;
		mibintcAuthorUnsubmitted Done Reply Inline Actions By default, floating point exceptions are masked. Previously this was set to true, but the value wasn't used. This patch implements support for trapping-math mibintc: By default, floating point exceptions are masked. Previously this was set to true, but the…
		bool RoundingFPMath = false;
		bool RoundingMathPresent = false;
		// -fp-model options:
		StringRef FPModel = "";
		//bool FPM_Precise = false;
		//bool FPM_Strict = false;
		//bool FPM_Fast = false;
		// -fp-exception-behavior options:
		StringRef FPExceptionBehavior = "";
		//bool FPE_Ignore = false;
		//bool FPE_MayTrap = false;
		//bool FPE_Stric = false;
StringRef DenormalFPMath = "";		StringRef DenormalFPMath = "";
StringRef FPContract = "";		StringRef FPContract = "";
		bool StrictFPModel = false;

if (const Arg *A = Args.getLastArg(options::OPT_flimited_precision_EQ)) {		if (const Arg *A = Args.getLastArg(options::OPT_flimited_precision_EQ)) {
CmdArgs.push_back("-mlimit-float-precision");		CmdArgs.push_back("-mlimit-float-precision");
CmdArgs.push_back(A->getValue());		CmdArgs.push_back(A->getValue());
}		}

for (const Arg *A : Args) {		for (const Arg *A : Args) {
switch (A->getOption().getID()) {		auto optID = A->getOption().getID();
		bool PreciseFPModel = false;
		switch (optID) {
		default:
		break;
		case options::OPT_ffp_model_EQ: {
		StrictFPModel = false;
		PreciseFPModel = true;
		// ffp-model= is a Driver option, it is entirely rewritten into more
		// granular options before being passed into cc1.
		// Use the gcc option in the switch below.
		StringRef Val = A->getValue();
		if (!FPModel.empty() && !FPModel.equals(Val)) {
		D.Diag(clang::diag::warn_drv_overriding_flag_option)
		<< Args.MakeArgString("-ffp-model=" + FPModel)
		<< Args.MakeArgString("-ffp-model=" + Val);
		FPContract = "";
		}
		if (Val.equals("fast")) {
		if (!FPContract.empty() && !FPContract.equals("fast"))
		// FPContract has already been set to something else
		// so warn about the override.
		D.Diag(clang::diag::warn_drv_overriding_flag_option)
		<< Args.MakeArgString("-ffp-contract=" + FPContract)
		<< "-ffp-contract=fast";
		optID = options::OPT_ffast_math;
		FPModel = Val;
		FPContract = "fast";
		} else if (Val.equals("precise")) {
		if (!FPContract.empty() && !FPContract.equals("fast"))
		D.Diag(clang::diag::warn_drv_overriding_flag_option)
		<< Args.MakeArgString("-ffp-contract=" + FPContract)
		<< "-ffp-contract=fast";
		michele.scandaleUnsubmitted Done Reply Inline Actions Here it seems you are changing `optID` to `OPT_ffast_math` to reuse the logic specified below for that case to reset the state of the floating point options. michele.scandale: Here it seems you are changing `optID` to `OPT_ffast_math` to reuse the logic specified below…
		optID = options::OPT_ffp_contract;
		FPModel = Val;
		FPContract = "fast";
		PreciseFPModel = true;
		} else if (Val.equals("strict")) {
		StrictFPModel = true;
		if (!FPContract.empty() && !FPContract.equals("strict"))
		D.Diag(clang::diag::warn_drv_overriding_flag_option)
		michele.scandaleUnsubmitted Done Reply Inline Actions Here the state of the floating point options seems unchanged except for `FPContract`. If I run `clang -ffp-model=fast -ffp-model=precise`, I would expect the state of the floating point options to match the one of `-fno-fast-math` except for `FPContract` which you want to be set to "fast". I think you might need to replicate the reset for all the option here as well, so at this point I don't know how much worth is to use the optID reset trick for the "fast" case only. michele.scandale: Here the state of the floating point options seems unchanged except for `FPContract`. If I run…
		mibintcAuthorUnsubmitted Done Reply Inline Actions @michele.scandale Thanks for your helpful review, I think I fixed the things that you remarked on. I also added a test case for the assertion fail that you saw. mibintc: @michele.scandale Thanks for your helpful review, I think I fixed the things that you remarked…
		michele.scandaleUnsubmitted Not Done Reply Inline Actions Thanks! michele.scandale: Thanks!
		<< Args.MakeArgString("-ffp-contract=" + FPContract)
		<< "-ffp-contract=strict";
		optID = options::OPT_frounding_math;
		FPModel = Val;
		// fp-model=strict also enables fno-fast-math
		HonorINFs = true;
		HonorNaNs = true;
		// Turning on -ffast-math (with either flag) removes the need for
		// MathErrno. However, turning off -ffast-math merely restores the
		// toolchain default (which may be false).
		MathErrno = TC.IsMathErrnoDefault();
		AssociativeMath = false;
		ReciprocalMath = false;
		SignedZeros = true;
		TrappingMath = true;
		RoundingFPMath = true;
		// -fno_fast_math restores default denormal and fpcontract handling
		DenormalFPMath = "";
		FPContract = "";
		} else
		D.Diag(diag::err_drv_unsupported_option_argument)
		<< A->getOption().getName() << Val;
		break;
		}
		}

		switch (optID) {
// If this isn't an FP option skip the claim below		// If this isn't an FP option skip the claim below
default: continue;		default: continue;

// Options controlling individual features		// Options controlling individual features
case options::OPT_fhonor_infinities: HonorINFs = true; break;		case options::OPT_fhonor_infinities: HonorINFs = true; break;
case options::OPT_fno_honor_infinities: HonorINFs = false; break;		case options::OPT_fno_honor_infinities: HonorINFs = false; break;
case options::OPT_fhonor_nans: HonorNaNs = true; break;		case options::OPT_fhonor_nans: HonorNaNs = true; break;
case options::OPT_fno_honor_nans: HonorNaNs = false; break;		case options::OPT_fno_honor_nans: HonorNaNs = false; break;
case options::OPT_fmath_errno: MathErrno = true; break;		case options::OPT_fmath_errno: MathErrno = true; break;
case options::OPT_fno_math_errno: MathErrno = false; break;		case options::OPT_fno_math_errno: MathErrno = false; break;
case options::OPT_fassociative_math: AssociativeMath = true; break;		case options::OPT_fassociative_math: AssociativeMath = true; break;
case options::OPT_fno_associative_math: AssociativeMath = false; break;		case options::OPT_fno_associative_math: AssociativeMath = false; break;
case options::OPT_freciprocal_math: ReciprocalMath = true; break;		case options::OPT_freciprocal_math: ReciprocalMath = true; break;
case options::OPT_fno_reciprocal_math: ReciprocalMath = false; break;		case options::OPT_fno_reciprocal_math: ReciprocalMath = false; break;
case options::OPT_fsigned_zeros: SignedZeros = true; break;		case options::OPT_fsigned_zeros: SignedZeros = true; break;
case options::OPT_fno_signed_zeros: SignedZeros = false; break;		case options::OPT_fno_signed_zeros: SignedZeros = false; break;
case options::OPT_ftrapping_math: TrappingMath = true; break;		case options::OPT_ftrapping_math:
		TrappingMath = true;
		FPExceptionBehavior = "strict";
		break;
case options::OPT_fno_trapping_math: TrappingMath = false; break;		case options::OPT_fno_trapping_math: TrappingMath = false; break;
		case options::OPT_frounding_math:
		// The default setting for frounding-math is True and ffast-math
		// sets fno-rounding-math, but we only want to use constrained
		// floating point intrinsics if the option is specifically requested.
		RoundingFPMath = true;
		RoundingMathPresent = true;
		FPExceptionBehavior = "strict";
		break;
		case options::OPT_fno_rounding_math:
		RoundingFPMath = false;
		RoundingMathPresent = false;
		michele.scandaleUnsubmitted Not Done Reply Inline Actions Isn't the default `-fno-rounding-math`? michele.scandale: Isn't the default `-fno-rounding-math`?
		mibintcAuthorUnsubmitted Done Reply Inline Actions Yes the default is no rounding math, I'll remove the comment. Thank you. mibintc: Yes the default is no rounding math, I'll remove the comment. Thank you.
		FPExceptionBehavior = "";
		break;

case options::OPT_fdenormal_fp_math_EQ:		case options::OPT_fdenormal_fp_math_EQ:
DenormalFPMath = A->getValue();		DenormalFPMath = A->getValue();
break;		break;

// Validate and pass through -fp-contract option.		// Validate and pass through -fp-contract option.
		michele.scandaleUnsubmitted Not Done Reply Inline Actions Shouldn't this be set to `true` similarly to what you do for `TrappingMathPresent` to track whether there is an explicit option related to rounding math? michele.scandale: Shouldn't this be set to `true` similarly to what you do for `TrappingMathPresent ` to track…
		mibintcAuthorUnsubmitted Done Reply Inline Actions There's a switch statement above this that interprets the command line option -fp-model=strict as though frounding had appeared on the command line by assigning a new value to optID so that's why there is a discrepancy. Also I'm using the Present boolean variables to preserve the output from Driver so that pre-existing driver test cases don't need to be changed. mibintc:* There's a switch statement above this that interprets the command line option -fp-model=strict…
case options::OPT_ffp_contract: {		case options::OPT_ffp_contract: {
StringRef Val = A->getValue();		StringRef Val = A->getValue();
if (Val == "fast" \|\| Val == "on" \|\| Val == "off")		if (PreciseFPModel) {
		// -fp-model=precise enables fp-contract=fast as a side effect
		// the FPContract value has already been set to a string literal
		// and the Val string isn't a pertinent value.
		;
		} else if (Val.equals("fast") \|\| Val.equals("on") \|\| Val.equals("off"))
FPContract = Val;		FPContract = Val;
else		else
D.Diag(diag::err_drv_unsupported_option_argument)		D.Diag(diag::err_drv_unsupported_option_argument)
<< A->getOption().getName() << Val;		<< A->getOption().getName() << Val;
break;		break;
}		}

		// Validate and pass through -ffp-model option.
		case options::OPT_ffp_model_EQ:
		// This should only occur in the error case
		// since the optID has been replaced by a more granular
		// floating point option.
		break;

		// Validate and pass through -ffp-exception-behavior option.
		case options::OPT_ffp_exception_behavior_EQ: {
		StringRef Val = A->getValue();
		if (!FPExceptionBehavior.empty())
		// Warn that previous value of option is overridden.
		D.Diag(clang::diag::warn_drv_overriding_flag_option)
		<< Args.MakeArgString("-ffp-exception-behavior=" + FPExceptionBehavior)
		<< Args.MakeArgString("-ffp-exception-behavior=" + Val);
		if (Val.equals("ignore") \|\| Val.equals("maytrap") \|\| Val.equals("strict"))
		FPExceptionBehavior = Val;
		else
		D.Diag(diag::err_drv_unsupported_option_argument)
		<< A->getOption().getName() << Val;
		break;
		}

case options::OPT_ffinite_math_only:		case options::OPT_ffinite_math_only:
HonorINFs = false;		HonorINFs = false;
HonorNaNs = false;		HonorNaNs = false;
break;		break;
case options::OPT_fno_finite_math_only:		case options::OPT_fno_finite_math_only:
HonorINFs = true;		HonorINFs = true;
HonorNaNs = true;		HonorNaNs = true;
break;		break;
Show All 21 Lines	for (const Arg *A : Args) {
case options::OPT_ffast_math:		case options::OPT_ffast_math:
HonorINFs = false;		HonorINFs = false;
HonorNaNs = false;		HonorNaNs = false;
MathErrno = false;		MathErrno = false;
AssociativeMath = true;		AssociativeMath = true;
ReciprocalMath = true;		ReciprocalMath = true;
SignedZeros = false;		SignedZeros = false;
TrappingMath = false;		TrappingMath = false;
		RoundingFPMath = false;
// If fast-math is set then set the fp-contract mode to fast.		// If fast-math is set then set the fp-contract mode to fast.
FPContract = "fast";		FPContract = "fast";
break;		break;
case options::OPT_fno_fast_math:		case options::OPT_fno_fast_math:
HonorINFs = true;		HonorINFs = true;
HonorNaNs = true;		HonorNaNs = true;
// Turning on -ffast-math (with either flag) removes the need for		// Turning on -ffast-math (with either flag) removes the need for
// MathErrno. However, turning off -ffast-math merely restores the		// MathErrno. However, turning off -ffast-math merely restores the
// toolchain default (which may be false).		// toolchain default (which may be false).
MathErrno = TC.IsMathErrnoDefault();		MathErrno = TC.IsMathErrnoDefault();
AssociativeMath = false;		AssociativeMath = false;
ReciprocalMath = false;		ReciprocalMath = false;
SignedZeros = true;		SignedZeros = true;
TrappingMath = true;		TrappingMath = true;
		RoundingFPMath = true;
// -fno_fast_math restores default denormal and fpcontract handling		// -fno_fast_math restores default denormal and fpcontract handling
DenormalFPMath = "";		DenormalFPMath = "";
FPContract = "";		FPContract = "";
break;		break;
}		}
		if (StrictFPModel) {
		// If fp-model=strict has been specified on command line but
		// subsequent options conflict then emit warning diagnostic.
		if (HonorINFs && HonorNaNs &&
		!AssociativeMath && !ReciprocalMath &&
		SignedZeros && TrappingMath && RoundingFPMath &&
		DenormalFPMath.empty() && FPContract.empty())
		// OK: Current Arg doesn't conflict with fp-model=strict
		;
		else {
		StrictFPModel = false;
		FPModel = "";
		StringRef Val = A->getValue();
		if (Val.empty())
		D.Diag(clang::diag::warn_drv_overriding_flag_option)
		<< "-ffp-model=strict"
		<< A->getSpelling();
		else
		D.Diag(clang::diag::warn_drv_overriding_flag_option)
		<< "-ffp-model=strict"
		<< Args.MakeArgString(A->getSpelling() + Val);
		}
		}

// If we handled this option claim it		// If we handled this option claim it
A->claim();		A->claim();
}		}

if (!HonorINFs)		if (!HonorINFs)
CmdArgs.push_back("-menable-no-infs");		CmdArgs.push_back("-menable-no-infs");

Show All 11 Lines	if (!SignedZeros)
CmdArgs.push_back("-fno-signed-zeros");		CmdArgs.push_back("-fno-signed-zeros");

if (AssociativeMath && !SignedZeros && !TrappingMath)		if (AssociativeMath && !SignedZeros && !TrappingMath)
CmdArgs.push_back("-mreassociate");		CmdArgs.push_back("-mreassociate");

if (ReciprocalMath)		if (ReciprocalMath)
CmdArgs.push_back("-freciprocal-math");		CmdArgs.push_back("-freciprocal-math");

if (!TrappingMath)		if (TrappingMath)
		// Note: FP Exception Behavior is also set to strict
		CmdArgs.push_back("-ftrapping-math");
		michele.scandaleUnsubmitted Not Done Reply Inline Actions Running `clang -### -ftrapping-math -ffp-exception-behavior=ignore` lead to this assertion to fail. As far as I can see `TrappingMath` is not changed in the case FPExceptionBehavior is "ignore" or "maytrap". Clearly in the "ignore" case it should be safe to just set `TrappingMath` to false, but I'm not sure about the "maytrap" case. It seems that `-ffp-exception-behavior` is more general than `-f{,no-}trapping-math`, so it seems natural to me to see `ftrapping-math` and `foo-trapping-math` as aliases for `ffp-exception-behavior=strict` and `ffp-exception-behavior=ignore` respectively. If we agree on this, then I would expect the reasoning inside the compiler only in terms of `FPExceptionBehavior`. michele.scandale: Running `clang -### -ftrapping-math -ffp-exception-behavior=ignore` lead to this assertion to…
		mibintcAuthorUnsubmitted Done Reply Inline Actions Thanks for pointing out this assertion failure, I will upload a patch with fix. Yes we could entirely express ftrapping-math and fno-trapping-math via the ffp-exception-behavior= option. That would probably be better--currently the trapping option becomes effective via the exception behavior parameter to the llvm floating point constrained intrinsics, and it can take 3 values. I thought it would be too radical at the moment, so I didn't propose that in this patch. In the patch I'm about to add, I added a test case for the assertion that you saw. mibintc: Thanks for pointing out this assertion failure, I will upload a patch with fix. Yes we could…
		else
CmdArgs.push_back("-fno-trapping-math");		CmdArgs.push_back("-fno-trapping-math");
		michele.scandaleUnsubmitted Not Done Reply Inline Actions With this change if I run `clang -### -ffast-math test.c` I don't see `-fno-trapping-math` passed to the CC1. This is changing the changes the value of the function level attribute "no-trapping-math" (see lib/CodeGen/CGCall.cpp : 1747). Is this an intended change? Moreover since with this patch the default value for trapping math changed, the "no-trapping-math" function level attribute is incorrect also for default case. michele.scandale: With this change if I run `clang -### -ffast-math test.c` I don't see `-fno-trapping-math`…
		mibintcAuthorUnsubmitted Done Reply Inline Actions Before this patch, ftrapping-math was added to the Driver and also a bitfield, `NoTrappingFPMath `was created in the LLVM to describe the state of trapping-math, but otherwise that bit wasn't consulted and the option had no effect. Gcc describes ftrapping-math as the default, but in llvm by default floating point exceptions are masked and this corresponds to the floating point Constrained Intrinsics having exception behavior set to ignored. This patch changed the llvm constructor to set the trapping bit to "no trap". In fact I'd like to get rid of the` `NoTrappingFPMath`` bitfield since it's not being used, but I didn't make that change at this point. If I remember correctly, there are a bunch of driver tests that failed if fno-trapping-math is output to cc1. I'd have to reconstruct the details. Since fno-trapping-math is the default, it isn't passed through on the cc1 command line: the Clang.cpp driver doesn't pass through the positive and negative for each existing option. Thanks for pointing out the line in CGCall.cpp, it seems the CodeGenOpts aren't getting set up perfectly I'll fix that in CompilerInvocation.cpp; I don't see anything setting trapping-math as part of function level attribute, @michele.scandale did I overlook that/can you point out where that is? mibintc: Before this patch, ftrapping-math was added to the Driver and also a bitfield…
		michele.scandaleUnsubmitted Not Done Reply Inline Actions I guess you are referring to the code in `TargetMachine.cpp` where the function level attributes are used to reset the `TargetOptions` state whenever we initiate the backend codegen for a given function. Considering that the trapping math option as stated in the documentation did not have any effect, I'm not surprised to see not many uses. The only one I can see is in `llvm/lib/Target/ARM/ARMAsmPrinter.cpp : 687` where the function level attribute affects the emission of some ARM specific attributes. My only concern was that the change of the default value for trapping math was not propagated entirely causing this function level attribute to be initialized incorrectly. Fixing the logic in `CompilerInvocation.cpp` considering the change of default it is fine for me. Given that `ffp-exception-behavior={ignore,maytrap,strict}` supersedes `-f{,no-}trapping-math` I would expect long term to see the internal state of the compiler frontend to only care about the new state `FPExceptionBehavior` for both language and code generation options. And I guess the same would apply to the backend stage as well. michele.scandale: I guess you are referring to the code in `TargetMachine.cpp` where the function level…

if (!DenormalFPMath.empty())		if (!DenormalFPMath.empty())
CmdArgs.push_back(		CmdArgs.push_back(
Args.MakeArgString("-fdenormal-fp-math=" + DenormalFPMath));		Args.MakeArgString("-fdenormal-fp-math=" + DenormalFPMath));

if (!FPContract.empty())		if (!FPContract.empty())
CmdArgs.push_back(Args.MakeArgString("-ffp-contract=" + FPContract));		CmdArgs.push_back(Args.MakeArgString("-ffp-contract=" + FPContract));

		if (!RoundingFPMath)
		CmdArgs.push_back(Args.MakeArgString("-fno-rounding-math"));

		if (RoundingFPMath && RoundingMathPresent)
		CmdArgs.push_back(Args.MakeArgString("-frounding-math"));

		if (!FPExceptionBehavior.empty())
		CmdArgs.push_back(Args.MakeArgString("-ffp-exception-behavior=" +
		FPExceptionBehavior));

ParseMRecip(D, Args, CmdArgs);		ParseMRecip(D, Args, CmdArgs);

// -ffast-math enables the __FAST_MATH__ preprocessor macro, but check for the		// -ffast-math enables the __FAST_MATH__ preprocessor macro, but check for the
// individual features enabled by -ffast-math instead of the option itself as		// individual features enabled by -ffast-math instead of the option itself as
// that's consistent with gcc's behaviour.		// that's consistent with gcc's behaviour.
if (!HonorINFs && !HonorNaNs && !MathErrno && AssociativeMath &&		if (!HonorINFs && !HonorNaNs && !MathErrno && AssociativeMath &&
ReciprocalMath && !SignedZeros && !TrappingMath)		ReciprocalMath && !SignedZeros && !TrappingMath && !RoundingFPMath) {
CmdArgs.push_back("-ffast-math");		CmdArgs.push_back("-ffast-math");
		if (FPModel.equals("fast")) {
		if (FPContract.equals("fast"))
		// All set, do nothing.
		;
		else if (FPContract.empty())
		// Enable fp-contract=fast
		CmdArgs.push_back(Args.MakeArgString("-ffp-contract=fast"));
		else
		D.Diag(clang::diag::warn_drv_overriding_flag_option)
		<< "-ffp-model=fast"
		<< Args.MakeArgString("-ffp-contract=" + FPContract);
		}
		}

// Handle __FINITE_MATH_ONLY__ similarly.		// Handle __FINITE_MATH_ONLY__ similarly.
if (!HonorINFs && !HonorNaNs)		if (!HonorINFs && !HonorNaNs)
CmdArgs.push_back("-ffinite-math-only");		CmdArgs.push_back("-ffinite-math-only");

if (const Arg *A = Args.getLastArg(options::OPT_mfpmath_EQ)) {		if (const Arg *A = Args.getLastArg(options::OPT_mfpmath_EQ)) {
CmdArgs.push_back("-mfpmath");		CmdArgs.push_back("-mfpmath");
CmdArgs.push_back(A->getValue());		CmdArgs.push_back(A->getValue());
▲ Show 20 Lines • Show All 4,003 Lines • Show Last 20 Lines

clang/lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 3,084 Lines • ▼ Show 20 Lines	if (Arg *A = Args.getLastArg(OPT_ffp_contract)) {
else if (Val == "on")		else if (Val == "on")
Opts.setDefaultFPContractMode(LangOptions::FPC_On);		Opts.setDefaultFPContractMode(LangOptions::FPC_On);
else if (Val == "off")		else if (Val == "off")
Opts.setDefaultFPContractMode(LangOptions::FPC_Off);		Opts.setDefaultFPContractMode(LangOptions::FPC_Off);
else		else
Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args) << Val;		Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args) << Val;
}		}

		if (Args.hasArg(OPT_frounding_math)) {
		Opts.getFPMOptions().setFPRoundingModeSetting(LangOptions::FPRM_Dynamic);
		}

		if (Args.hasArg(OPT_fno_rounding_math)) {
		Opts.getFPMOptions().setFPRoundingModeSetting(LangOptions::FPRM_ToNearest);
		}

		jeroen.dobbelaereUnsubmitted Not Done Reply Inline Actions Calling 'Opts.setFPExceptionMode(xx)' here has no effect, as it will be overruled later on, on line 3174 ! Same is true on line 3159 jeroen.dobbelaere: Calling 'Opts.setFPExceptionMode(xx)' here has no effect, as it will be overruled later on, on…
		if (Args.hasArg(OPT_ftrapping_math)) {
		Opts.getFPMOptions().setFPExceptionBehaviorSetting(LangOptions::FPEB_Strict);
		}
		kpnUnsubmitted Not Done Reply Inline Actions Shouldn't this be a call to Diags.Report() like in the code just above it and below? Same question for _some_ other uses of llvm_unreachable(). kpn: Shouldn't this be a call to Diags.Report() like in the code just above it and below? Same…
		mibintcAuthorUnsubmitted Done Reply Inline Actions I put it in as unreachable because the clang driver shouldn't build this combination, but that's a good point I can just switch it to match the other code in this function, thanks. mibintc: I put it in as unreachable because the clang driver shouldn't build this combination, but…

		if (Args.hasArg(OPT_fno_trapping_math)) {
		Opts.getFPMOptions().setFPExceptionBehaviorSetting(LangOptions::FPEB_Ignore);
		}

		LangOptions::FPExceptionBehaviorKind FPEB = LangOptions::FPEB_Ignore;
		if (Arg *A = Args.getLastArg(OPT_ffp_exception_behavior_EQ)) {
		StringRef Val = A->getValue();
		if (Val.equals("ignore"))
		FPEB = LangOptions::FPEB_Ignore;
		else if (Val.equals("maytrap"))
		FPEB = LangOptions::FPEB_MayTrap;
		else if (Val.equals("strict"))
		FPEB = LangOptions::FPEB_Strict;
		else
		Diags.Report(diag::err_drv_invalid_value) << A->getAsString(Args) << Val;
		Opts.getFPMOptions().setFPExceptionBehaviorSetting(FPEB);
		}

		#if 0
		//don't need it
		if (FPM == LangOptions::FPM_Precise)
		// This doesn't correspond to constrained fp,
		// equivalent to -fp-contract=fast
		Opts.setDefaultFPContractMode(LangOptions::FPC_Fast);
		else if (FPM == LangOptions::FPM_Fast) {
		// This doesn't correspond to constrained fp, equivalent to -ffast-math
		Opts.FastMath = true;
		Opts.FiniteMathOnly = true;
		Opts.setDefaultFPContractMode(LangOptions::FPC_Fast);
		}
		#endif

Opts.RetainCommentsFromSystemHeaders =		Opts.RetainCommentsFromSystemHeaders =
Args.hasArg(OPT_fretain_comments_from_system_headers);		Args.hasArg(OPT_fretain_comments_from_system_headers);

unsigned SSP = getLastArgIntValue(Args, OPT_stack_protector, 0, Diags);		unsigned SSP = getLastArgIntValue(Args, OPT_stack_protector, 0, Diags);
switch (SSP) {		switch (SSP) {
default:		default:
Diags.Report(diag::err_drv_invalid_value)		Diags.Report(diag::err_drv_invalid_value)
<< Args.getLastArg(OPT_stack_protector)->getAsString(Args) << SSP;		<< Args.getLastArg(OPT_stack_protector)->getAsString(Args) << SSP;
▲ Show 20 Lines • Show All 566 Lines • Show Last 20 Lines

clang/test/CodeGen/fpconstrained.c

This file was added.

				// RUN: %clang_cc1 -ftrapping-math -frounding-math -ffp-exception-behavior=strict -emit-llvm -o - %s \| FileCheck %s -check-prefix=FPMODELSTRICT
				// RUN: %clang_cc1 -ffp-contract=fast -emit-llvm -o - %s \| FileCheck %s -check-prefix=PRECISE
				// RUN: %clang_cc1 -ffast-math -ffp-contract=fast -emit-llvm -o - %s \| FileCheck %s -check-prefix=FAST
				// RUN: %clang_cc1 -ffast-math -emit-llvm -o - %s \| FileCheck %s -check-prefix=FAST
				// RUN: %clang_cc1 -ffast-math -ffp-contract=fast -ffp-exception-behavior=ignore -emit-llvm -o - %s \| FileCheck %s -check-prefix=FAST
				// RUN: %clang_cc1 -ffast-math -ffp-contract=fast -ffp-exception-behavior=strict -emit-llvm -o - %s \| FileCheck %s -check-prefix=EXCEPT
				// RUN: %clang_cc1 -ffast-math -ffp-contract=fast -ffp-exception-behavior=maytrap -emit-llvm -o - %s \| FileCheck %s -check-prefix=MAYTRAP
				float f0, f1, f2;

				void foo(void) {
				// CHECK-LABEL: define {{.*}}void @foo()

				// MAYTRAP: llvm.experimental.constrained.fadd.f32(float %0, float %1, metadata !"round.tonearest", metadata !"fpexcept.maytrap")
				// EXCEPT: llvm.experimental.constrained.fadd.f32(float %0, float %1, metadata !"round.tonearest", metadata !"fpexcept.strict")
				// FPMODELSTRICT: llvm.experimental.constrained.fadd.f32(float %0, float %1, metadata !"round.dynamic", metadata !"fpexcept.strict")
				// STRICTEXCEPT: llvm.experimental.constrained.fadd.f32(float %0, float %1, metadata !"round.dynamic", metadata !"fpexcept.strict")
				// STRICTNOEXCEPT: llvm.experimental.constrained.fadd.f32(float %0, float %1, metadata !"round.dynamic", metadata !"fpexcept.ignore")
				// PRECISE: fadd contract float %0, %1
				// FAST: fadd fast
				f0 = f1 + f2;

				// CHECK: ret
				}
				kpnUnsubmitted Not Done Reply Inline Actions This is another case of "fast" and "precise" doing the same thing. If we're using the regular fadd then it cannot be that "the compiler preserves the source expression ordering and rounding properties of floating-point". kpn: This is another case of "fast" and "precise" doing the same thing. If we're using the regular…
				mibintcAuthorUnsubmitted Not Done Reply Inline Actions I need an fp wizard to address this point, @andrew.w.kaylor ?? The msdn documentation says that strict and precise both preserve ... mibintc: I need an fp wizard to address this point, @andrew.w.kaylor ?? The msdn documentation says…

clang/test/Driver/clang_f_opts.c

	Show First 20 Lines • Show All 314 Lines • ▼ Show 20 Lines
	// RUN: -fno-keep-inline-functions \			// RUN: -fno-keep-inline-functions \
	// RUN: -freorder-blocks \			// RUN: -freorder-blocks \
	// RUN: -ffloat-store \			// RUN: -ffloat-store \
	// RUN: -fgcse \			// RUN: -fgcse \
	// RUN: -fivopts \			// RUN: -fivopts \
	// RUN: -fprefetch-loop-arrays \			// RUN: -fprefetch-loop-arrays \
	// RUN: -fprofile-correction \			// RUN: -fprofile-correction \
	// RUN: -fprofile-values \			// RUN: -fprofile-values \
	// RUN: -frounding-math \
	rjmccallUnsubmitted Not Done Reply Inline Actions Looks like the intent of this test is that you pull this to the lines above, to test that we don't emit an error on it. You should also test `-ffp-model`. rjmccall: Looks like the intent of this test is that you pull this to the lines above, to test that we…
	// RUN: -fschedule-insns \			// RUN: -fschedule-insns \
	// RUN: -fsignaling-nans \			// RUN: -fsignaling-nans \
	// RUN: -fstrength-reduce \			// RUN: -fstrength-reduce \
	// RUN: -ftracer \			// RUN: -ftracer \
	// RUN: -funroll-all-loops \			// RUN: -funroll-all-loops \
	// RUN: -funswitch-loops \			// RUN: -funswitch-loops \
	// RUN: -flto=1 \			// RUN: -flto=1 \
	// RUN: -falign-labels \			// RUN: -falign-labels \
	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	// CHECK-WARNING-DAG: optimization flag '-fno-keep-inline-functions' is not supported			// CHECK-WARNING-DAG: optimization flag '-fno-keep-inline-functions' is not supported
	// CHECK-WARNING-DAG: optimization flag '-freorder-blocks' is not supported			// CHECK-WARNING-DAG: optimization flag '-freorder-blocks' is not supported
	// CHECK-WARNING-DAG: optimization flag '-ffloat-store' is not supported			// CHECK-WARNING-DAG: optimization flag '-ffloat-store' is not supported
	// CHECK-WARNING-DAG: optimization flag '-fgcse' is not supported			// CHECK-WARNING-DAG: optimization flag '-fgcse' is not supported
	// CHECK-WARNING-DAG: optimization flag '-fivopts' is not supported			// CHECK-WARNING-DAG: optimization flag '-fivopts' is not supported
	// CHECK-WARNING-DAG: optimization flag '-fprefetch-loop-arrays' is not supported			// CHECK-WARNING-DAG: optimization flag '-fprefetch-loop-arrays' is not supported
	// CHECK-WARNING-DAG: optimization flag '-fprofile-correction' is not supported			// CHECK-WARNING-DAG: optimization flag '-fprofile-correction' is not supported
	// CHECK-WARNING-DAG: optimization flag '-fprofile-values' is not supported			// CHECK-WARNING-DAG: optimization flag '-fprofile-values' is not supported
	// CHECK-WARNING-DAG: optimization flag '-frounding-math' is not supported
	// CHECK-WARNING-DAG: optimization flag '-fschedule-insns' is not supported			// CHECK-WARNING-DAG: optimization flag '-fschedule-insns' is not supported
	// CHECK-WARNING-DAG: optimization flag '-fsignaling-nans' is not supported			// CHECK-WARNING-DAG: optimization flag '-fsignaling-nans' is not supported
	// CHECK-WARNING-DAG: optimization flag '-fstrength-reduce' is not supported			// CHECK-WARNING-DAG: optimization flag '-fstrength-reduce' is not supported
	// CHECK-WARNING-DAG: optimization flag '-ftracer' is not supported			// CHECK-WARNING-DAG: optimization flag '-ftracer' is not supported
	// CHECK-WARNING-DAG: optimization flag '-funroll-all-loops' is not supported			// CHECK-WARNING-DAG: optimization flag '-funroll-all-loops' is not supported
	// CHECK-WARNING-DAG: optimization flag '-funswitch-loops' is not supported			// CHECK-WARNING-DAG: optimization flag '-funswitch-loops' is not supported
	// CHECK-WARNING-DAG: unsupported argument '1' to option 'flto='			// CHECK-WARNING-DAG: unsupported argument '1' to option 'flto='
	// CHECK-WARNING-DAG: optimization flag '-falign-labels' is not supported			// CHECK-WARNING-DAG: optimization flag '-falign-labels' is not supported
	▲ Show 20 Lines • Show All 177 Lines • Show Last 20 Lines

clang/test/Driver/fast-math.c

	Show First 20 Lines • Show All 164 Lines • ▼ Show 20 Lines
	// program by adding a special preprocessor macro. Check that the frontend flag			// program by adding a special preprocessor macro. Check that the frontend flag
	// modeling this semantic change is provided. Also check that the flag is not			// modeling this semantic change is provided. Also check that the flag is not
	// present if any of the optimizations are disabled.			// present if any of the optimizations are disabled.
	// RUN: %clang -### -ffast-math -c %s 2>&1 \			// RUN: %clang -### -ffast-math -c %s 2>&1 \
	// RUN: \| FileCheck --check-prefix=CHECK-FAST-MATH %s			// RUN: \| FileCheck --check-prefix=CHECK-FAST-MATH %s
	// RUN: %clang -### -fno-fast-math -ffast-math -c %s 2>&1 \			// RUN: %clang -### -fno-fast-math -ffast-math -c %s 2>&1 \
	// RUN: \| FileCheck --check-prefix=CHECK-FAST-MATH %s			// RUN: \| FileCheck --check-prefix=CHECK-FAST-MATH %s
	// RUN: %clang -### -funsafe-math-optimizations -ffinite-math-only \			// RUN: %clang -### -funsafe-math-optimizations -ffinite-math-only \
	// RUN: -fno-math-errno -ffp-contract=fast -c %s 2>&1 \			// RUN: -fno-math-errno -ffp-contract=fast -fno-rounding-math -c %s 2>&1 \
	// RUN: \| FileCheck --check-prefix=CHECK-FAST-MATH %s			// RUN: \| FileCheck --check-prefix=CHECK-FAST-MATH %s
	// RUN: %clang -### -fno-honor-infinities -fno-honor-nans -fno-math-errno \			// RUN: %clang -### -fno-honor-infinities -fno-honor-nans -fno-math-errno \
	// RUN: -fassociative-math -freciprocal-math -fno-signed-zeros \			// RUN: -fassociative-math -freciprocal-math -fno-signed-zeros \
	// RUN: -fno-trapping-math -ffp-contract=fast -c %s 2>&1 \			// RUN: -fno-trapping-math -ffp-contract=fast -fno-rounding-math -c %s 2>&1 \
	// RUN: \| FileCheck --check-prefix=CHECK-FAST-MATH %s			// RUN: \| FileCheck --check-prefix=CHECK-FAST-MATH %s
	// CHECK-FAST-MATH: "-cc1"			// CHECK-FAST-MATH: "-cc1"
	// CHECK-FAST-MATH: "-ffast-math"			// CHECK-FAST-MATH: "-ffast-math"
	// CHECK-FAST-MATH: "-ffinite-math-only"			// CHECK-FAST-MATH: "-ffinite-math-only"
	//			//
	// RUN: %clang -### -ffast-math -fno-fast-math -c %s 2>&1 \			// RUN: %clang -### -ffast-math -fno-fast-math -c %s 2>&1 \
	// RUN: \| FileCheck --check-prefix=CHECK-NO-FAST-MATH %s			// RUN: \| FileCheck --check-prefix=CHECK-NO-FAST-MATH %s
	// RUN: %clang -### -ffast-math -fno-finite-math-only -c %s 2>&1 \			// RUN: %clang -### -ffast-math -fno-finite-math-only -c %s 2>&1 \
	▲ Show 20 Lines • Show All 136 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetOptions.h

Show First 20 Lines • Show All 101 Lines • ▼ Show 20 Lines	enum class GlobalISelAbortMode {
Enable, // Enable the abort.		Enable, // Enable the abort.
DisableWithDiag // Disable the abort but emit a diagnostic on failure.		DisableWithDiag // Disable the abort but emit a diagnostic on failure.
};		};

class TargetOptions {		class TargetOptions {
public:		public:
TargetOptions()		TargetOptions()
: PrintMachineCode(false), UnsafeFPMath(false), NoInfsFPMath(false),		: PrintMachineCode(false), UnsafeFPMath(false), NoInfsFPMath(false),
NoNaNsFPMath(false), NoTrappingFPMath(false),		NoNaNsFPMath(false), NoTrappingFPMath(true), RoundingFPMath(false),
NoSignedZerosFPMath(false),		NoSignedZerosFPMath(false),
HonorSignDependentRoundingFPMathOption(false), NoZerosInBSS(false),		HonorSignDependentRoundingFPMathOption(false), NoZerosInBSS(false),
GuaranteedTailCallOpt(false), StackSymbolOrdering(true),		GuaranteedTailCallOpt(false), StackSymbolOrdering(true),
EnableFastISel(false), EnableGlobalISel(false), UseInitArray(false),		EnableFastISel(false), EnableGlobalISel(false), UseInitArray(false),
DisableIntegratedAS(false), RelaxELFRelocations(false),		DisableIntegratedAS(false), RelaxELFRelocations(false),
FunctionSections(false), DataSections(false),		FunctionSections(false), DataSections(false),
UniqueSectionNames(true), TrapUnreachable(false),		UniqueSectionNames(true), TrapUnreachable(false),
NoTrapAfterNoreturn(false), EmulatedTLS(false),		NoTrapAfterNoreturn(false), EmulatedTLS(false),
Show All 30 Lines	public:
/// assume the FP arithmetic arguments and results are never NaNs.		/// assume the FP arithmetic arguments and results are never NaNs.
unsigned NoNaNsFPMath : 1;		unsigned NoNaNsFPMath : 1;

/// NoTrappingFPMath - This flag is enabled when the		/// NoTrappingFPMath - This flag is enabled when the
/// -enable-no-trapping-fp-math is specified on the command line. This		/// -enable-no-trapping-fp-math is specified on the command line. This
/// specifies that there are no trap handlers to handle exceptions.		/// specifies that there are no trap handlers to handle exceptions.
unsigned NoTrappingFPMath : 1;		unsigned NoTrappingFPMath : 1;

		/// RoundingFPMath - This flag is enabled when the
		/// -enable-rounding-fp-math is specified on the command line. This
		/// specifies dynamic rounding mode.
		unsigned RoundingFPMath : 1;

/// NoSignedZerosFPMath - This flag is enabled when the		/// NoSignedZerosFPMath - This flag is enabled when the
/// -enable-no-signed-zeros-fp-math is specified on the command line. This		/// -enable-no-signed-zeros-fp-math is specified on the command line. This
/// specifies that optimizations are allowed to treat the sign of a zero		/// specifies that optimizations are allowed to treat the sign of a zero
/// argument or result as insignificant.		/// argument or result as insignificant.
unsigned NoSignedZerosFPMath : 1;		unsigned NoSignedZerosFPMath : 1;

/// HonorSignDependentRoundingFPMath - This returns true when the		/// HonorSignDependentRoundingFPMath - This returns true when the
/// -enable-sign-dependent-rounding-fp-math is specified. If this returns		/// -enable-sign-dependent-rounding-fp-math is specified. If this returns
▲ Show 20 Lines • Show All 144 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add support for options -frounding-math, -ftrapping-math, -ffp-model=, and -ffp-exception-behavior=, : Specify floating point behaviorClosedPublic

Details

/fp (Specify floating-point behavior)

Syntax

Arguments

precise

strict

fast

except

Remarks

Diff Detail

Event Timeline

Please let me know if it isn't appropriate. Bug description:

Please let me know if it isn't appropriate. Bug description:

Revision Contents

Diff 223908

clang/docs/UsersManual.rst

clang/include/clang/Basic/CodeGenOptions.def

clang/include/clang/Basic/LangOptions.h

clang/include/clang/Driver/Options.td

clang/lib/CodeGen/BackendUtil.cpp

clang/lib/CodeGen/CodeGenFunction.h

clang/lib/CodeGen/CodeGenFunction.cpp

clang/lib/Driver/ToolChains/Clang.cpp

clang/lib/Frontend/CompilerInvocation.cpp

clang/test/CodeGen/fpconstrained.c

clang/test/Driver/clang_f_opts.c

clang/test/Driver/fast-math.c

llvm/include/llvm/Target/TargetOptions.h

Add support for options -frounding-math, -ftrapping-math, -ffp-model=, and -ffp-exception-behavior=, : Specify floating point behavior
ClosedPublic