Skip to content
Commit d154758e authored by Noah Goldstein's avatar Noah Goldstein
Browse files

x86: Optimize str{n}casecmp TOLOWER logic in strcmp-sse42.S



Slightly faster method of doing TOLOWER that saves an
instruction.

Also replace the hard coded 5-byte no with .p2align 4. On builds with
CET enabled this misaligned entry to strcasecmp.

geometric_mean(N=40) of all benchmarks New / Original: .920

All string/memory tests pass.
Reviewed-by: default avatarH.J. Lu <hjl.tools@gmail.com>
parent 670b54bc
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment