提交 de504945 编写于 作者: A Andy Polyakov

Two extra instructions in RC4 character loop give 80% performance

improvement on Core2. I still need to detect Core2 and choose this
path...
上级 3d1def01
......@@ -221,6 +221,8 @@ $code.=<<___;
movb $TY#b,($dat,$XX[0])
add $TX[0]#b,$TY#b
add \$1,$XX[0]#b
movzb $TY#b,$TY#d
movzb $XX[0]#b,$XX[0]#d
movzb ($dat,$TY),$TY#d
movzb ($dat,$XX[0]),$TX[0]#d
xorb ($inp),$TY#b
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册