aboutsummaryrefslogtreecommitdiff
path: root/ChangeLog
diff options
context:
space:
mode:
authorAdhemerval Zanella <azanella@linux.vnet.ibm.com>2014-12-31 11:47:41 -0500
committerAdhemerval Zanella <azanella@linux.vnet.ibm.com>2015-01-14 07:58:02 -0500
commita38f68f12fd03374d599eeb0b6943e50b0ff7348 (patch)
treee8a235f5f34fcb66ff5ff656b357bbd48dfa3a0e /ChangeLog
parent4242356131256e54ca3e96b0c6f2af773b7a69c8 (diff)
downloadglibc-a38f68f12fd03374d599eeb0b6943e50b0ff7348.zip
glibc-a38f68f12fd03374d599eeb0b6943e50b0ff7348.tar.gz
glibc-a38f68f12fd03374d599eeb0b6943e50b0ff7348.tar.bz2
powerpc: Optimized st{r,p}ncpy for POWER8/PPC64
This patch adds an optimized POWER8 st{r,p}ncpy using unaligned accesses. It shows 10%-80% improvement over the optimized POWER7 one that uses only aligned accesses, specially on unaligned inputs. The algorithm first read and check 16 bytes (if inputs do not cross a 4K page size). The it realign source to 16-bytes and issue a 16 bytes read and compare loop to speedup null byte checks for large strings. Also, different from POWER7 optimization, the null pad is done inline in the implementation using possible unaligned accesses, instead of realying on a memset call. Special case is added for page cross reads.
Diffstat (limited to 'ChangeLog')
-rw-r--r--ChangeLog15
1 files changed, 15 insertions, 0 deletions
diff --git a/ChangeLog b/ChangeLog
index 0c3f78d..383f1f5 100644
--- a/ChangeLog
+++ b/ChangeLog
@@ -1,5 +1,20 @@
2015-01-13 Adhemerval Zanella <azanella@linux.vnet.ibm.com>
+ * sysdeps/powerpc/powerpc64/multiarch/Makefile [sysdep_routines]:
+ Add strncpy-power8 and stpncpy-power8 objects.
+ * sysdeps/powerpc/powerpc64/multiarch/ifunc-impl-list.c
+ (__libc_ifunc_impl_list): Add __strncpy_power8 and stpncpy_power8
+ implementations.
+ * sysdeps/powerpc/powerpc64/multiarch/stpncpy-power8.S: New file.
+ * sysdeps/powerpc/powerpc64/multiarch/stpncpy.c (__stpncpy): Add
+ __stpncpy_power8 implementation.
+ * sysdeps/powerpc/powerpc64/multiarch/strncpy-power8.S: New file.
+ * sysdeps/powerpc/powerpc64/multiarch/strncpy.c (strncpy): Add
+ __strncpy_power8 implementation.
+ * sysdeps/powerpc/powerpc64/power8/stpncpy.S: New file.
+ * sysdeps/powerpc/powerpc64/power8/strncpy.S: New file.
+ * NEWS: Update.
+
* sysdeps/powerpc/powerpc64/multiarch/strncat-power7.c: New file.
* sysdeps/powerpc/powerpc64/multiarch/strncat-power7.S: Remove file.
* sysdeps/powerpc/powerpc64/power7/strncat.S: Likewise.