diff options
author | Chris Metcalf <cmetcalf@tilera.com> | 2012-11-02 12:53:57 -0400 |
---|---|---|
committer | Chris Metcalf <cmetcalf@tilera.com> | 2012-11-06 09:24:19 -0500 |
commit | cd84016efe83d92ee3903fef37f79ca2bafb3985 (patch) | |
tree | 7a92df4a9a081a6f20b8fd68f31139b6336e595c /ports/ChangeLog.tile | |
parent | 82477c28f46c579a149a8333c07233e9f4e43408 (diff) | |
download | glibc-cd84016efe83d92ee3903fef37f79ca2bafb3985.zip glibc-cd84016efe83d92ee3903fef37f79ca2bafb3985.tar.gz glibc-cd84016efe83d92ee3903fef37f79ca2bafb3985.tar.bz2 |
Optimize tile (mostly tilegx) memcpy and memmove performance.
- Override <memcopy.h> so we use full 8-byte word copies on tilegx32
for memmove, then use op_t in memcpy instead of the previous
locally-defined word_t just to avoid proliferating identical types.
- Fix bug in memcpy prefetch that caused us to never prefetch past
the first cache line.
- Optimize misaligned memcpy by inlining _wordcopy_fwd_dest_aligned
instead of just doing a dumb word-at-a-time copy.
- Make memcpy safe for forward copies by doing all the loads from
a given cache line prior to doing a wh64 (cache line zero-fill)
on the destination. Remove now-redundant src == dst check.
- Copy and optimize the generic wordcopy.c routines to use the tile
"double align" instruction instead of the MERGE macro; to avoid
offset addressing mode (which tile doesn't have) by rewriting the
pointer math to load and store with a zero index; and to use
post-increment addresses in the inner loops to improve scheduling.
Diffstat (limited to 'ports/ChangeLog.tile')
-rw-r--r-- | ports/ChangeLog.tile | 6 |
1 files changed, 6 insertions, 0 deletions
diff --git a/ports/ChangeLog.tile b/ports/ChangeLog.tile index 0fecf63..bc3946c 100644 --- a/ports/ChangeLog.tile +++ b/ports/ChangeLog.tile @@ -1,3 +1,9 @@ +2012-11-02 Chris Metcalf <cmetcalf@tilera.com> + + * sysdeps/tile/tilegx/memcpy.c (__memcpy): Optimize. + * sysdeps/tile/memcopy.h: New file. + * sysdeps/tile/wordcopy.c: New file. + 2012-11-03 Joseph Myers <joseph@codesourcery.com> [BZ #3439] |