提交 · a8f302e5bae18ce129b81a3f7a5f3ea7f9785ca1 · OpenHarmony / Third Party Openssl

26 11月, 2017 1 次提交

poly1305/asm/poly1305-x86_64.pl: switch to pure AVX512F. · a8f302e5

由 Andy Polyakov 提交于 11月 20, 2017

Convert AVX512F+VL+BW code path to pure AVX512F, so that it can be
executed even on Knights Landing. Trigger for modification was
observation that AVX512 code paths can negatively affect overall
Skylake-X system performance. Since we are likely to suppress
AVX512F capability flag [at least on Skylake-X], conversion serves
as kind of "investment protection".
Reviewed-by: NRich Salz <rsalz@openssl.org>
(Merged from https://github.com/openssl/openssl/pull/4758)

a8f302e5

12 11月, 2017 1 次提交

Many spelling fixes/typo's corrected. · 46f4e1be

由 Josh Soref 提交于 11月 11, 2017

Around 138 distinct errors found and fixed; thanks!
Reviewed-by: NKurt Roeckx <kurt@roeckx.be>
Reviewed-by: NTim Hudson <tjh@openssl.org>
Reviewed-by: NRich Salz <rsalz@openssl.org>
(Merged from https://github.com/openssl/openssl/pull/3459)

46f4e1be

21 7月, 2017 1 次提交

x86_64 assembly pack: "optimize" for Knights Landing, add AVX-512 results. · 64d92d74

由 Andy Polyakov 提交于 7月 20, 2017

"Optimize" is in quotes because it's rather a "salvage operation"
for now. Idea is to identify processor capability flags that
drive Knights Landing to suboptimial code paths and mask them.
Two flags were identified, XSAVE and ADCX/ADOX. Former affects
choice of AES-NI code path specific for Silvermont (Knights Landing
is of Silvermont "ancestry"). And 64-bit ADCX/ADOX instructions are
effectively mishandled at decode time. In both cases we are looking
at ~2x improvement.

AVX-512 results cover even Skylake-X :-)

Hardware used for benchmarking courtesy of Atos, experiments run by
Romain Dolbeau <romain.dolbeau@atos.net>. Kudos!
Reviewed-by: NRich Salz <rsalz@openssl.org>

64d92d74

04 7月, 2017 1 次提交
- A
  x86_64 assembly pack: fill some blanks in Ryzen results. · 54f8f9a1
  由 Andy Polyakov 提交于 6月 30, 2017
```
Reviewed-by: NBernd Edlinger <bernd.edlinger@hotmail.de>
```
  54f8f9a1
22 3月, 2017 2 次提交

poly1305/asm/poly1305-x86_64.pl: add poly1305_blocks_vpmadd52_8x. · 0a5d1a38

由 Andy Polyakov 提交于 3月 18, 2017

As hinted by its name new subroutine processes 8 input blocks in
parallel by loading data to 512-bit registers. It still needs more
work, as it needs to handle some specific input lengths better.
In this sense it's yet another intermediate step...
Reviewed-by: NRich Salz <rsalz@openssl.org>

0a5d1a38

A
x86_64 assembly pack: add some Ryzen performance results. · 6cbfd94d
由 Andy Polyakov 提交于 3月 18, 2017
```
Reviewed-by: NTim Hudson <tjh@openssl.org>
```
6cbfd94d

14 3月, 2017 1 次提交

poly1305/asm/poly1305-x86_64.pl: add poly1305_blocks_vpmadd52_4x. · c2b93590

由 Andy Polyakov 提交于 3月 12, 2017

As hinted by its name new subroutine processes 4 input blocks in
parallel. It still operates on 256-bit registers and is just
another step toward full-blown AVX512IFMA procedure.
Reviewed-by: NRich Salz <rsalz@openssl.org>

c2b93590

27 2月, 2017 2 次提交
- A
  poly1305/asm/poly1305-x86_64.pl: minor AVX512 optimization. · e052083c
  由 Andy Polyakov 提交于 2月 25, 2017
```
Reviewed-by: NRich Salz <rsalz@openssl.org>
```
  e052083c
- A
  poly1305/asm/poly1305-x86_64.pl: add CFI annotations. · 1c47e883
  由 Andy Polyakov 提交于 2月 25, 2017
```
Reviewed-by: NRich Salz <rsalz@openssl.org>
```
  1c47e883
26 2月, 2017 3 次提交

A
poly1305/asm/poly1305-x86_64.pl: add VPMADD52 code path. · fd910ef9
由 Andy Polyakov 提交于 12月 30, 2016
```
This is initial and minimal single-block implementation.
Reviewed-by: NRich Salz <rsalz@openssl.org>
```
fd910ef9
A
poly1305/asm/poly1305-x86_64.pl: switch to vpermdd in table expansion. · 73e8a5c8
由 Andy Polyakov 提交于 12月 25, 2016
```
Effectively it's minor size optimization, 5-6% per affected subroutine.
Reviewed-by: NRich Salz <rsalz@openssl.org>
```
73e8a5c8

poly1305/asm/poly1305-x86_64.pl: optimize AVX512 code path. · c1e1fc50

由 Andy Polyakov 提交于 12月 25, 2016

On pre-Skylake best optimization strategy was balancing port-specific
instructions, while on Skylake minimizing the sheer amount appears
more sensible.
Reviewed-by: NRich Salz <rsalz@openssl.org>

c1e1fc50

16 12月, 2016 1 次提交
- A
  poly1305/asm/poly1305-x86_64.pl: allow nasm to assemble AVX512 code. · 1ea01427
  由 Andy Polyakov 提交于 12月 14, 2016
```
chacha/asm/chacha-x86_64.pl: refine nasm version detection logic.
Reviewed-by: NRichard Levitte <levitte@openssl.org>
```
  1ea01427
12 12月, 2016 1 次提交
- A
  x86_64 assembly pack: add AVX512 ChaCha20 and Poly1305 code paths. · abb8c44f
  由 Andy Polyakov 提交于 12月 09, 2016
```
Reviewed-by: NRich Salz <rsalz@openssl.org>
```
  abb8c44f
24 10月, 2016 1 次提交
- A
  x86_64 assembly pack: add Goldmont performance results. · ace05265
  由 Andy Polyakov 提交于 10月 14, 2016
```
Reviewed-by: NRichard Levitte <levitte@openssl.org>
```
  ace05265
29 5月, 2016 1 次提交

x86_64 assembly pack: tolerate spaces in source directory name. · cfe1d992

由 Andy Polyakov 提交于 5月 28, 2016

[as it is now quoting $output is not required, but done just in case]
Reviewed-by: NRichard Levitte <levitte@openssl.org>

cfe1d992

21 5月, 2016 1 次提交
- R
  Add OpenSSL copyright to .pl files · 6aa36e8e
  由 Rich Salz 提交于 5月 21, 2016
```
Reviewed-by: NRichard Levitte <levitte@openssl.org>
```
  6aa36e8e
06 5月, 2016 2 次提交
- A
  poly1305/asm/poly1305-x86_64.pl: contain symbols within shared lib. · 3992e8c0
  由 Andy Polyakov 提交于 5月 04, 2016
```
We don't need it, but external users might find it handy.
Reviewed-by: NRichard Levitte <levitte@openssl.org>
```
  3992e8c0
- A
  poly1305/asm/poly1305-x86_64.pl: make it cross-compile. · 28411657
  由 Andy Polyakov 提交于 5月 04, 2016
```
Reviewed-by: NRichard Levitte <levitte@openssl.org>
```
  28411657
20 4月, 2016 1 次提交
- A
  poly1305/asm/poly1305-x86_64.pl: not all assemblers manage << in constants. · 6ca3e6e7
  由 Andy Polyakov 提交于 4月 18, 2016
```
Reviewed-by: NRichard Levitte <levitte@openssl.org>
```
  6ca3e6e7
04 4月, 2016 1 次提交

crypto/poly1305: don't break carry chains. · 4b8736a2

由 Andy Polyakov 提交于 3月 29, 2016

RT#4483

[poly1305-armv4.pl: remove redundant #ifdef __thumb2__]
[poly1305-ppc*.pl: presumably more accurate benchmark results]
Reviewed-by: NRichard Levitte <levitte@openssl.org>

4b8736a2

16 3月, 2016 1 次提交
- A
  poly1305/asm/poly1305-x86_64.pl: make it work with linux-x32. · 2460c7f1
  由 Andy Polyakov 提交于 3月 15, 2016
```
Reviewed-by: NRichard Levitte <levitte@openssl.org>
```
  2460c7f1
02 3月, 2016 1 次提交

poly1305/asm/poly1305-*.pl: flip horizontal add and reduction. · 1ea8ae50

由 Andy Polyakov 提交于 2月 28, 2016

Formally only 32-bit AVX2 code path needs this, but I choose to
harmonize all vector code paths.

RT#4346
Reviewed-by: NRichard Levitte <levitte@openssl.org>

1ea8ae50

12 2月, 2016 2 次提交
- A
  poly1305/asm/poly1305-x86_64.pl: MacOS X portability fix. · 4ef29667
  由 Andy Polyakov 提交于 2月 11, 2016
```
Reviewed-by: NViktor Dukhovni <viktor@openssl.org>
```
  4ef29667
- A
  poly1305/asm/poly1305-x86_64.pl: fix mingw64 build. · a85dbf11
  由 Andy Polyakov 提交于 2月 11, 2016
```
Reviewed-by: NTim Hudson <tjh@openssl.org>
```
  a85dbf11
10 2月, 2016 1 次提交
- A
  x86[_64] assembly pack: add ChaCha20 and Poly1305 modules. · a98c648e
  由 Andy Polyakov 提交于 12月 13, 2015
```
Reviewed-by: NRich Salz <rsalz@openssl.org>
```
  a98c648e

OpenHarmony / Third Party Openssl 大约 1 年 前同步成功

OpenHarmony / Third Party Openssl
大约 1 年前同步成功