diff options
author | Adhemerval Zanella <adhemerval.zanella@linaro.org> | 2024-10-30 11:50:03 -0300 |
---|---|---|
committer | Adhemerval Zanella <adhemerval.zanella@linaro.org> | 2024-11-22 10:52:27 -0300 |
commit | d846f4c12d7636efd5b7cff173456e616a185e24 (patch) | |
tree | 7107e7883729453e2e30bb5cd5fb8f7898e4ae06 /sysdeps/powerpc/fpu/libm-test-ulps | |
parent | baa495f2311f182d6a829cba1abc10363fd1e2cb (diff) | |
download | glibc-d846f4c12d7636efd5b7cff173456e616a185e24.zip glibc-d846f4c12d7636efd5b7cff173456e616a185e24.tar.gz glibc-d846f4c12d7636efd5b7cff173456e616a185e24.tar.bz2 |
math: Use lgammaf from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows better performance to the generic lgammaf.
The code was adapted to glibc style, to use the definition of
math_config.h, to remove errno handling, to use math_narrow_eval
on overflow usage, and to adapt to make it reentrant.
Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (M1,
gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1):
latency master patched improvement
x86_64 86.5609 70.3278 18.75%
x86_64v2 78.3030 69.9709 10.64%
x86_64v3 74.7470 59.8457 19.94%
i686 387.355 229.761 40.68%
aarch64 40.8341 33.7563 17.33%
power10 26.5520 16.1672 39.11%
powerpc 28.3145 17.0625 39.74%
reciprocal-throughput master patched improvement
x86_64 68.0461 48.3098 29.00%
x86_64v2 55.3256 47.2476 14.60%
x86_64v3 52.3015 38.9028 25.62%
i686 340.848 195.707 42.58%
aarch64 36.8000 30.5234 17.06%
power10 20.4043 12.6268 38.12%
powerpc 22.6588 13.8866 38.71%
Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
Diffstat (limited to 'sysdeps/powerpc/fpu/libm-test-ulps')
-rw-r--r-- | sysdeps/powerpc/fpu/libm-test-ulps | 4 |
1 files changed, 0 insertions, 4 deletions
diff --git a/sysdeps/powerpc/fpu/libm-test-ulps b/sysdeps/powerpc/fpu/libm-test-ulps index 4e448a2..56ca580 100644 --- a/sysdeps/powerpc/fpu/libm-test-ulps +++ b/sysdeps/powerpc/fpu/libm-test-ulps @@ -1431,25 +1431,21 @@ ldouble: 5 Function: "lgamma": double: 3 -float: 4 float128: 5 ldouble: 3 Function: "lgamma_downward": double: 4 -float: 4 float128: 8 ldouble: 15 Function: "lgamma_towardzero": double: 4 -float: 3 float128: 5 ldouble: 16 Function: "lgamma_upward": double: 4 -float: 5 float128: 8 ldouble: 11 |