Skip to content
Commit 9bc2ed8f authored by Wilco Dijkstra's avatar Wilco Dijkstra
Browse files

[2/5] AArch64: Improve A64FX memset for large sizes



Improve performance of large memsets. Simplify alignment code. For zero memset
use DC ZVA, which almost doubles performance. For non-zero memsets use the
unroll8 loop which is about 10% faster.

Reviewed-by: default avatarNaohiro Tamura <naohirot@fujitsu.com>
parent 07b42729
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment