Add new log2 implementation

Similar algorithm is used as in log: log2(2^k x) = k + log2(c) + log2(x/c) where the last term is approximated by a polynomial of x/c - 1, the first order coefficient is about 1/ln2 in this case. There is separate code path when fma instruction is not available for computing x/c - 1 precisely, for which the table size is doubled. The worst case error is 0.547 ULP (0.55 without fma), the read only global data size is 1168 bytes (2192 without fma) on aarch64. The non-nearest rounding error is less than 1 ULP. Improvements on Cortex-A72 compared to current glibc master: log2 thruput: 2.00x in [0.01 11.1] log2 latency: 2.04x in [0.01 11.1] log2 thruput: 2.17x in [0.999 1.001] log2 latency: 2.88x in [0.999 1.001] Tested on aarch64-linux-gnu (defined __FP_FAST_FMA) arm-linux-gnueabihf (!defined __FP_FAST_FMA) x86_64-linux-gnu (!defined __FP_FAST_FMA) powerpc64le-linxu-gnu (defined __FP_FAST_FMA) targets. * NEWS: Mention log2 improvements. * math/Makefile (type-double-routines): Add e_log2_data. * sysdeps/i386/fpu/e_log2_data.c: New file. * sysdeps/ia64/fpu/e_log2_data.c: New file. * sysdeps/ieee754/dbl-64/e_log2.c: Rewrite. * sysdeps/ieee754/dbl-64/e_log2_data.c: New file. * sysdeps/ieee754/dbl-64/math_config.h (__log2_data): Add. * sysdeps/ieee754/dbl-64/wordsize-64/e_log2.c: Remove. * sysdeps/m68k/m680x0/fpu/e_log2_data.c: New file.
author: Szabolcs Nagy <szabolcs.nagy@arm.com> 2018-06-13 17:48:52 +0100
committer: Szabolcs Nagy <szabolcs.nagy@arm.com> 2018-09-12 17:36:33 +0100
commit: 3e08ff544b86834cd24795de159f16b8c65c7b8f (patch)
tree: 550dc4a2c295fdfc9e747cb021d5a3860c23c301 /math
parent: f41b0a43e426831e391cafd8d0bd47a3efa4a840 (diff)
download: glibc-3e08ff544b86834cd24795de159f16b8c65c7b8f.zip
glibc-3e08ff544b86834cd24795de159f16b8c65c7b8f.tar.gz
glibc-3e08ff544b86834cd24795de159f16b8c65c7b8f.tar.bz2
1 files changed, 2 insertions, 1 deletions
diff --git a/math/Makefile b/math/Makefile
index 8bfbebc..2537b2a 100644
--- a/math/Makefile
+++ b/math/Makefile
@@ -127,7 +127,8 @@ type-ldouble-yes := ldouble
 type-double-suffix :=
 type-double-routines := branred doasin dosincos mpa mpatan2	\
 		       k_rem_pio2 mpatan mpsqrt mptan sincos32	\
-		       sincostab math_err e_exp_data e_log_data
+		       sincostab math_err e_exp_data e_log_data	\
+		       e_log2_data
 
 # float support
 type-float-suffix := f
author	Szabolcs Nagy <szabolcs.nagy@arm.com>	2018-06-13 17:48:52 +0100
committer	Szabolcs Nagy <szabolcs.nagy@arm.com>	2018-09-12 17:36:33 +0100
commit	3e08ff544b86834cd24795de159f16b8c65c7b8f (patch)
tree	550dc4a2c295fdfc9e747cb021d5a3860c23c301 /math
parent	f41b0a43e426831e391cafd8d0bd47a3efa4a840 (diff)
download	glibc-3e08ff544b86834cd24795de159f16b8c65c7b8f.zip glibc-3e08ff544b86834cd24795de159f16b8c65c7b8f.tar.gz glibc-3e08ff544b86834cd24795de159f16b8c65c7b8f.tar.bz2