diff options
author | Matthew Sterrett <matthew.sterrett@intel.com> | 2023-12-15 12:04:05 -0800 |
---|---|---|
committer | Noah Goldstein <goldstein.w.n@gmail.com> | 2023-12-18 12:38:01 -0600 |
commit | e957308723ac2e55dad360d602298632980bbd38 (patch) | |
tree | 75b99dddc0746f3e950e43eda54c51449dbbe612 /sysdeps/loongarch/dl-tls.h | |
parent | 442983319ba70de801fc856e8dd4748fba8f7f1b (diff) | |
download | glibc-e957308723ac2e55dad360d602298632980bbd38.zip glibc-e957308723ac2e55dad360d602298632980bbd38.tar.gz glibc-e957308723ac2e55dad360d602298632980bbd38.tar.bz2 |
x86: Unifies 'strlen-evex' and 'strlen-evex512' implementations.
This commit uses a common implementation 'strlen-evex-base.S' for both
'strlen-evex' and 'strlen-evex512'
The motivation is to reduce the number of implementations to maintain.
This incidentally gives a small performance improvement.
All tests pass on x86.
Benchmarks were taken on SKX.
https://www.intel.com/content/www/us/en/products/sku/123613/intel-core-i97900x-xseries-processor-13-75m-cache-up-to-4-30-ghz/specifications.html
Geometric mean for strlen-evex512 over all benchmarks (N=10) was (new/old) 0.939
Geometric mean for wcslen-evex512 over all benchmarks (N=10) was (new/old) 0.965
Code Size Changes:
strlen-evex512.S : +24 bytes
wcslen-evex512.S : +54 bytes
Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>
Diffstat (limited to 'sysdeps/loongarch/dl-tls.h')
0 files changed, 0 insertions, 0 deletions