diff options
author | Adhemerval Zanella <adhemerval.zanella@linaro.org> | 2024-11-18 08:45:16 -0300 |
---|---|---|
committer | Adhemerval Zanella <adhemerval.zanella@linaro.org> | 2024-12-18 17:24:43 -0300 |
commit | a357d6273f796b384b7e36d58a8de947ebde9668 (patch) | |
tree | 60c206ae0acd2465638b5c35a12d95b29796e1cc /sysdeps/x86_64/fpu/libm-test-ulps | |
parent | ed608a40e2903351c64a3cea51ce8bd7c62996ef (diff) | |
download | glibc-a357d6273f796b384b7e36d58a8de947ebde9668.zip glibc-a357d6273f796b384b7e36d58a8de947ebde9668.tar.gz glibc-a357d6273f796b384b7e36d58a8de947ebde9668.tar.bz2 |
math: Use atanf from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows slight better performance to the generic atanf.
The code was adapted to glibc style and to use the definition of
math_config.h (to handle errno, overflow, and underflow).
Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1,
gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1):
Latency master patched improvement
x86_64 56.8265 53.6842 5.53%
x86_64v2 54.8177 53.6842 2.07%
x86_64v3 46.2915 48.7034 -5.21%
i686 158.3760 108.9560 31.20%
aarch64 (Neoverse) 21.687 20.5893 5.06%
power10 13.1903 13.5012 -2.36%
reciprocal-throughput master patched improvement
x86_64 16.6787 16.7601 -0.49%
x86_64v2 16.6983 16.7601 -0.37%
x86_64v3 16.2268 12.1391 25.19%
i686 138.6840 36.0640 74.00%
aarch64 (Neoverse) 11.8012 10.3565 12.24%
power10 5.3212 4.2894 19.39%
Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
Diffstat (limited to 'sysdeps/x86_64/fpu/libm-test-ulps')
-rw-r--r-- | sysdeps/x86_64/fpu/libm-test-ulps | 10 |
1 files changed, 3 insertions, 7 deletions
diff --git a/sysdeps/x86_64/fpu/libm-test-ulps b/sysdeps/x86_64/fpu/libm-test-ulps index 23c40d0..f054d83 100644 --- a/sysdeps/x86_64/fpu/libm-test-ulps +++ b/sysdeps/x86_64/fpu/libm-test-ulps @@ -208,7 +208,6 @@ ldouble: 2 Function: "atan": double: 1 -float: 1 float128: 1 ldouble: 1 @@ -281,19 +280,16 @@ ldouble: 2 Function: "atan_downward": double: 1 -float: 2 float128: 2 ldouble: 1 Function: "atan_towardzero": double: 1 -float: 1 float128: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 float128: 2 ldouble: 1 @@ -375,7 +371,7 @@ ldouble: 1 Function: "atanpi_towardzero": double: 1 -float: 1 +float: 2 float128: 1 ldouble: 1 @@ -431,7 +427,7 @@ ldouble: 6 Function: Real part of "cacos_towardzero": double: 3 -float: 2 +float: 3 float128: 3 ldouble: 2 @@ -485,7 +481,7 @@ ldouble: 5 Function: Imaginary part of "cacosh_towardzero": double: 3 -float: 2 +float: 3 float128: 3 ldouble: 2 |