diff options
author | Ondrej Bilka <neleai@seznam.cz> | 2013-05-20 08:20:00 +0200 |
---|---|---|
committer | Ondrej Bilka <neleai@seznam.cz> | 2013-05-20 08:24:41 +0200 |
commit | 2d48b41c8fa610067c4d664ac2339ae6ca43e78c (patch) | |
tree | 4d1ca07a1228ba16c12a67ddb08595770f397da1 /ChangeLog | |
parent | 3e694268750d51acc6a68b0ee7ded25a52902c20 (diff) | |
download | glibc-2d48b41c8fa610067c4d664ac2339ae6ca43e78c.zip glibc-2d48b41c8fa610067c4d664ac2339ae6ca43e78c.tar.gz glibc-2d48b41c8fa610067c4d664ac2339ae6ca43e78c.tar.bz2 |
Faster memcpy on x64.
We add new memcpy version that uses unaligned loads which are fast
on modern processors. This allows second improvement which is avoiding
computed jump which is relatively expensive operation.
Tests available here:
http://kam.mff.cuni.cz/~ondra/memcpy_profile_result27_04_13.tar.bz2
Diffstat (limited to 'ChangeLog')
-rw-r--r-- | ChangeLog | 10 |
1 files changed, 10 insertions, 0 deletions
@@ -1,3 +1,13 @@ +2013-05-20 Ondřej Bílka <neleai@seznam.cz> + + * sysdeps/x86_64/multiarch/memcpy-sse2-unaligned.S: New file. + * sysdeps/x86_64/multiarch/memcpy.S (__new_memcpy): Add + __memcpy_sse2_unaligned ifunc selection. + * sysdeps/x86_64/multiarch/Makefile (sysdep_routines): + Add memcpy-sse2-unaligned.S. + * sysdeps/x86_64/multiarch/ifunc-impl-list.c (__libc_ifunc_impl_list): + Add: __memcpy_sse2_unaligned. + 2013-05-19 Joseph Myers <joseph@codesourcery.com> [BZ #15490] |