Commit 355afae9 authored May 03, 2021 by Noah Goldstein Committed by H.J. Lu Jan 27, 2022

x86: Optimize memchr-evex.S



No bug. This commit optimizes memchr-evex.S. The optimizations include
replacing some branches with cmovcc, avoiding some branches entirely
in the less_4x_vec case, making the page cross logic less strict,
saving some ALU in the alignment process, and most importantly
increasing ILP in the 4x loop. test-memchr, test-rawmemchr, and
test-wmemchr are all passing.

Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>
Reviewed-by: H.J. Lu <hjl.tools@gmail.com>
(cherry picked from commit 2a76821c)

parent b72b8970

Show whitespace changes

Inline Side-by-side

Please register or to comment