Commit 233efd43 authored Dec 19, 2019 by Xuelei Zhang Committed by Adhemerval Zanella Dec 19, 2019

aarch64: Optimized implementation of memcmp



The loop body is expanded from a 16-byte comparison to a 64-byte
comparison, and the usage of ldp is replaced by the Post-index
mode to the Base plus offset mode. Hence, compare can faster 18%
around > 128 bytes in all.

Checked on aarch64-linux-gnu.

Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>

parent 442d9c9c

Show whitespace changes

Inline Side-by-side

Please register or to comment