x86: Optimize memchr-evex.S
No bug. This commit optimizes memchr-evex.S. The optimizations include replacing some branches with cmovcc, avoiding some branches entirely in the less_4x_vec case, making the page cross logic less strict, saving some ALU in the alignment process, and most importantly increasing ILP in the 4x loop. test-memchr, test-rawmemchr, and test-wmemchr are all passing. Signed-off-by:Noah Goldstein <goldstein.w.n@gmail.com> Reviewed-by:
H.J. Lu <hjl.tools@gmail.com> (cherry picked from commit 2a76821c)
Loading
Please register or sign in to comment