aboutsummaryrefslogtreecommitdiff
path: root/ChangeLog
diff options
context:
space:
mode:
authorJoseph Myers <joseph@codesourcery.com>2017-09-20 16:54:05 +0000
committerJoseph Myers <joseph@codesourcery.com>2017-09-20 16:54:05 +0000
commitae8372d7e4c44f6839aa3d851d4d0cb486b81cd5 (patch)
tree83340587a4086402e9f1686c278aa1a264ef77e7 /ChangeLog
parenta856d4d4a8a56eaefdddb58884bfa2bfe922ee4c (diff)
downloadglibc-ae8372d7e4c44f6839aa3d851d4d0cb486b81cd5.zip
glibc-ae8372d7e4c44f6839aa3d851d4d0cb486b81cd5.tar.gz
glibc-ae8372d7e4c44f6839aa3d851d4d0cb486b81cd5.tar.bz2
Add SSE4.1 trunc, truncf (bug 20142).
This patch adds SSE4.1 versions of trunc and truncf, using the roundsd / roundss instructions, similar to the versions of ceil, floor, rint and nearbyint functions we already have. In my testing with the glibc benchtests these are about 30% faster than the C versions for double, 20% faster for float. Tested for x86_64. [BZ #20142] * sysdeps/x86_64/fpu/multiarch/Makefile (libm-sysdep_routines): Add s_trunc-c, s_truncf-c, s_trunc-sse4_1 and s_truncf-sse4_1. * sysdeps/x86_64/fpu/multiarch/s_trunc-c.c: New file. * sysdeps/x86_64/fpu/multiarch/s_trunc-sse4_1.S: Likewise. * sysdeps/x86_64/fpu/multiarch/s_trunc.c: Likewise. * sysdeps/x86_64/fpu/multiarch/s_truncf-c.c: Likewise. * sysdeps/x86_64/fpu/multiarch/s_truncf-sse4_1.S: Likewise. * sysdeps/x86_64/fpu/multiarch/s_truncf.c: Likewise.
Diffstat (limited to 'ChangeLog')
-rw-r--r--ChangeLog12
1 files changed, 12 insertions, 0 deletions
diff --git a/ChangeLog b/ChangeLog
index a07c903..d85bb87 100644
--- a/ChangeLog
+++ b/ChangeLog
@@ -1,3 +1,15 @@
+2017-09-20 Joseph Myers <joseph@codesourcery.com>
+
+ [BZ #20142]
+ * sysdeps/x86_64/fpu/multiarch/Makefile (libm-sysdep_routines):
+ Add s_trunc-c, s_truncf-c, s_trunc-sse4_1 and s_truncf-sse4_1.
+ * sysdeps/x86_64/fpu/multiarch/s_trunc-c.c: New file.
+ * sysdeps/x86_64/fpu/multiarch/s_trunc-sse4_1.S: Likewise.
+ * sysdeps/x86_64/fpu/multiarch/s_trunc.c: Likewise.
+ * sysdeps/x86_64/fpu/multiarch/s_truncf-c.c: Likewise.
+ * sysdeps/x86_64/fpu/multiarch/s_truncf-sse4_1.S: Likewise.
+ * sysdeps/x86_64/fpu/multiarch/s_truncf.c: Likewise.
+
2017-09-20 Paul Pluzhnikov <ppluzhnikov@google.com>
Ricky Zhou <rickyz@google.com>
Anoop V Chakkalakkal <anoop.vijayan@in.ibm.com>