riscv-gnu-toolchain/glibc.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-10-21 15:51:27 -0300
committer	Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-01 11:27:39 -0300
commit	9247f5321969abb701099b598b6eadb22b277ee4 (patch)
tree	3fd80bbb907ded24eca75de769474e7a3ca68278 /sysdeps/hppa/fpu
parent	bbd578b38df4ef417acc9af332fa2ddbb4952df4 (diff)
download	glibc-9247f5321969abb701099b598b6eadb22b277ee4.zip glibc-9247f5321969abb701099b598b6eadb22b277ee4.tar.gz glibc-9247f5321969abb701099b598b6eadb22b277ee4.tar.bz2

math: Use log10f from CORE-MATH

The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows better performance compared to the generic log10f. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 49.9017 33.5143 32.84% x86_64v2 50.4878 33.5623 33.52% x86_64v3 50.0991 27.6078 44.89% i686 140.874 106.086 24.69% aarch64 19.2846 11.3573 41.11% power10 14.0994 7.7739 44.86% powerpc 14.2898 7.92497 44.54% reciprocal-throughput master patched improvement x86_64 17.8336 12.9074 27.62% x86_64v2 16.4418 11.3220 31.14% x86_64v3 15.6002 10.5158 32.59% i686 66.0678 80.2287 -21.43% aarch64 9.4906 6.8393 27.94% power10 7.5255 5.5084 26.80% powerpc 9.5204 6.98055 26.68% The performance decrease for i686 is mostly due the use of x87 fpu, when building with '-msse2 -mfpmath=sse': master patched improvement latency 140.874 77.1137 45.26% reciprocal-throughput 64.481 56.4397 12.47% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>

Diffstat (limited to 'sysdeps/hppa/fpu')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: