aboutsummaryrefslogtreecommitdiff
path: root/dlfcn
diff options
context:
space:
mode:
authorH.J. Lu <hjl.tools@gmail.com>2021-10-29 12:56:53 -0700
committerAndreas K. Hüttel <dilfridge@gentoo.org>2022-05-14 23:09:22 +0200
commit4c02e665b205ac953d57df71c90421f6d7a4e312 (patch)
treeaa8caccf96e3d78b59440b3fc218c4a0b7144252 /dlfcn
parentx86-64: Improve EVEX strcmp with masked load (diff)
downloadglibc-4c02e665b205ac953d57df71c90421f6d7a4e312.tar.gz
glibc-4c02e665b205ac953d57df71c90421f6d7a4e312.tar.bz2
glibc-4c02e665b205ac953d57df71c90421f6d7a4e312.zip
x86-64: Remove Prefer_AVX2_STRCMP
Remove Prefer_AVX2_STRCMP to enable EVEX strcmp. When comparing 2 32-byte strings, EVEX strcmp has been improved to require 1 load, 1 VPTESTM, 1 VPCMP, 1 KMOVD and 1 INCL instead of 2 loads, 3 VPCMPs, 2 KORDs, 1 KMOVD and 1 TESTL while AVX2 strcmp requires 1 load, 2 VPCMPEQs, 1 VPMINU, 1 VPMOVMSKB and 1 TESTL. EVEX strcmp is now faster than AVX2 strcmp by up to 40% on Tiger Lake and Ice Lake. (cherry picked from commit 14dbbf46a007ae5df36646b51ad0c9e5f5259f30) (cherry picked from commit a182bb7a3922404f79def09d79ef89678b4049f0)
Diffstat (limited to 'dlfcn')
0 files changed, 0 insertions, 0 deletions