Skip to content
Commit 0237b615 authored by Xuelei Zhang's avatar Xuelei Zhang Committed by Adhemerval Zanella
Browse files

aarch64: Optimized implementation of strcpy



Optimize the strcpy implementation by using vector loads and operations
in main loop.Compared to aarch64/strcpy.S, it reduces latency of cases
in bench-strlen by 5%~18% when the length of src is greater than 64
bytes, with gains throughout the benchmark.

Checked on aarch64-linux-gnu.

Reviewed-by: default avatarWilco Dijkstra <Wilco.Dijkstra@arm.com>
parent 233efd43
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment