Skip to content
Commit 1d9f99ce authored by Naohiro Tamura's avatar Naohiro Tamura Committed by Szabolcs Nagy
Browse files

AArch64: Update A64FX memset not to degrade at 16KB



This patch updates unroll8 code so as not to degrade at the peak
performance 16KB for both FX1000 and FX700.

Inserted 2 instructions at the beginning of the unroll8 loop,
cmp and branch, are a workaround that is found heuristically.

Reviewed-by: default avatarWilco Dijkstra <Wilco.Dijkstra@arm.com>
parent f873adf3
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment