diff options
author | H.J. Lu <hjl.tools@gmail.com> | 2017-10-22 07:55:49 -0700 |
---|---|---|
committer | H.J. Lu <hjl.tools@gmail.com> | 2017-10-22 07:57:50 -0700 |
commit | 5d15c9697577d2c342b5ade820bfd7b84a736b23 (patch) | |
tree | d16d35b14e05b635b9dc8d147fbd6c147852d446 /sysdeps/x86_64/fpu/multiarch/Makefile | |
parent | b2f6137ea570933fb4be286574cc58b794eb5b5f (diff) | |
download | glibc-5d15c9697577d2c342b5ade820bfd7b84a736b23.zip glibc-5d15c9697577d2c342b5ade820bfd7b84a736b23.tar.gz glibc-5d15c9697577d2c342b5ade820bfd7b84a736b23.tar.bz2 |
x86-64: Add exp2f with FMA
For workload-spec2017.wrf, on Skylake, it improves performance by:
Before After Improvement
reciprocal-throughput 13.0291 11.2225 16%
latency 44.5154 37.5766 18%
* sysdeps/x86_64/fpu/multiarch/Makefile (libm-sysdep_routines):
Add e_exp2f-fma.
(CFLAGS-e_exp2f-fma.c): New.
* sysdeps/x86_64/fpu/multiarch/e_exp2f-fma.c: New file.
* sysdeps/x86_64/fpu/multiarch/e_exp2f.c: Likewise.
Diffstat (limited to 'sysdeps/x86_64/fpu/multiarch/Makefile')
-rw-r--r-- | sysdeps/x86_64/fpu/multiarch/Makefile | 3 |
1 files changed, 2 insertions, 1 deletions
diff --git a/sysdeps/x86_64/fpu/multiarch/Makefile b/sysdeps/x86_64/fpu/multiarch/Makefile index 92e91d7..71a3b33 100644 --- a/sysdeps/x86_64/fpu/multiarch/Makefile +++ b/sysdeps/x86_64/fpu/multiarch/Makefile @@ -37,8 +37,9 @@ CFLAGS-slowpow-fma.c = -mfma -mavx2 CFLAGS-s_sin-fma.c = -mfma -mavx2 CFLAGS-s_tan-fma.c = -mfma -mavx2 -libm-sysdep_routines += e_expf-fma +libm-sysdep_routines += e_exp2f-fma e_expf-fma +CFLAGS-e_exp2f-fma.c = -mfma -mavx2 CFLAGS-e_expf-fma.c = -mfma -mavx2 libm-sysdep_routines += e_exp-fma4 e_log-fma4 e_pow-fma4 s_atan-fma4 \ |