Skip to content
Commit 3dc426b6 authored by Wilco Dijkstra's avatar Wilco Dijkstra
Browse files

AArch64: Improve generic strlen



Improve performance by handling another 16 bytes before entering the loop.
Use ADDHN in the loop to avoid SHRN+FMOV when it terminates.  Change final
size computation to avoid increasing latency.  On Neoverse V1 performance
of the random strlen benchmark improves by 4.6%.

Reviewed-by: default avatarAdhemerval Zanella <adhemerval.zanella@linaro.org>
parent d5ce0e96
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment