AArch64: Optimize strchr
Simplify calculation of the mask using shrn. Unroll the main loop.
Small strings are 20% faster on modern CPUs.
Reviewed-by:
Szabolcs Nagy <szabolcs.nagy@arm.com>
Loading
Simplify calculation of the mask using shrn. Unroll the main loop.
Small strings are 20% faster on modern CPUs.
Reviewed-by:
Szabolcs Nagy <szabolcs.nagy@arm.com>