x86: Small improvements for wcslen
Just a few QOL changes. 1. Prefer `add` > `lea` as it has high execution units it can run on. 2. Don't break macro-fusion between `test` and `jcc` 3. Reduce code size by removing gratuitous padding bytes (-90 bytes). geometric_mean(N=20) of all benchmarks New / Original: 0.959 All string/memory tests pass. Reviewed-by:H.J. Lu <hjl.tools@gmail.com> (cherry picked from commit 244b415d)
Loading
Please register or sign in to comment