x86: Optimize memcmp-evex-movbe.S
No bug. This commit optimizes memcmp-evex.S. The optimizations include adding a new vec compare path for small sizes, reorganizing the entry control flow, removing some unnecissary ALU instructions from the main loop, and most importantly replacing the heavy use of vpcmp + kand logic with vpxor + vptern. test-memcmp and test-wmemcmp are both passing. Signed-off-by:Noah Goldstein <goldstein.w.n@gmail.com> Reviewed-by:
H.J. Lu <hjl.tools@gmail.com> (cherry picked from commit 4ad473e9)
Loading
Please register or sign in to comment