User Details
- User Since
- Jul 31 2017, 10:42 PM (257 w, 7 h)
Feb 14 2019
tgamma stands for "true gamma", and it is the function name specified in the ANSI C standard.
Dec 11 2018
Where, when, how and by whom are these #pragmas inserted? Are they silently inserted by clang? Do they have to be manually inserted in existing code?
What is the effect of these #pragmas? The ARM closed-source commercial compiler requires OpenMP. How will these #pragmas be implemented in the open-source clang? Will they import OpenMP interfaces in the open-source clang as well?
If importing OpenMP interfaces is not triggered by the insertion of these #pragmas, why is the argument to -fveclib called 'openmp'? I would find it baffling to have to type -fveclib=openmp on compile-line, but ending up not using OpenMP.
The RFC for reimplementing -fveclib with OpenMP is titled "OpenMP" because the corresponding code for ARM HPC compiler requires OpenMP.
However, the proposal in RFC does not require OpenMP, as quoted below.
A header file that wasn't there before is now automagically included, and there's nothing they can do to not include it. Boom! - source code change. Happy re-certification planning.
That's because no commercial distro - if they ever decide to include SLEEF - will compile SLEEF for the maximum CPU feature-set possible. They do the exact opposite - they compile it for the lowest possible common denominator.
For example, on AArch64, you won't get distro libraries compiled specifically for -mcpu=thunderx2t99. Or on x86_64 you'll never get -mavx512ifma, for example.
I am taking it on faith that SLEEF won't keep changing their mangling and ABI.
Hello Renato,
Nov 9 2018
@shibatch, do you know for sure that we shouldn't use the 3.5 ULP version of the function?
Nov 5 2018
I am sorry for not reading the comments carefully, but I think 3.5-ULP functions should be used only when a fast-math option is specified.
Nov 1 2018
Is it easy to add pow and atan2?
Since these functions are relatively frequently used, I think people would appreciate if you could also add these two functions.