提交 · b1a57c4cb29d2c6541d367085ff0631203bc386b · 自由之枫~ / opencv

22 12月, 2021 1 次提交
- A
  
  fix 3.4 links · b1a57c4c
  由 Alexander Alekhin 提交于 12月 22, 2021
  
  b1a57c4c
21 12月, 2021 1 次提交
- R
  
  fix const/x in Div · 0a178a68
  由 rogday 提交于 12月 20, 2021
  
  0a178a68
17 12月, 2021 3 次提交
- S
  
  fix Flatten layer · fec2c7e7
  由 Smirnov Egor 提交于 12月 16, 2021
  
  fec2c7e7
- A
  
  pre: OpenCV 3.4.17 (version++) · 60c093f0
  由 Alexander Alekhin 提交于 12月 17, 2021
  
  60c093f0
- M
  (3.4) Fixed several issues found by static analysis · 792b7e06
  由 Maksim Shabunin 提交于 12月 15, 2021
```
original commit: a079c2eb
```
  792b7e06
15 12月, 2021 3 次提交

A

dnn(test): update ONNX conformance filters · f3ba88c8
由 Alexander Alekhin 提交于 12月 15, 2021

f3ba88c8

fix max_unpool missing attributes, add default value of keepdims in... · e97c7e04

由 Smirnov Egor 提交于 12月 14, 2021

fix max_unpool missing attributes, add default value of keepdims in reducemean/max/sum, add support for keepdims=true in full reduction branch, add new padding type to Pad

e97c7e04

Merge pull request #21088 from rogday:onnx_tests · 4827fe86

由 rogday 提交于 12月 14, 2021

Onnx conformance tests

* Add ONNX conformance tests

* dnn(test): add filters for ONNX conformance tests

* add filter lists for OCV backend

* address review comments

* move test_clip_inbounds to all_denylist

* address clip issue

* avoid empty lists
Co-authored-by: NAlexander Alekhin <alexander.a.alekhin@gmail.com>

4827fe86

03 12月, 2021 1 次提交
- R
  Merge pull request #21159 from rogday:ceil_mode · 1613d305
  由 rogday 提交于 12月 02, 2021
```
fix ceil_mode for Average/MaxPooling

* fix ceil_mode

* add a comment
```
  1613d305
02 12月, 2021 1 次提交
- A
  
  dnn(test): re-enable tests which works with OpenVINO 2021.4.x (3.4) · bd396e1f
  由 Alexander Alekhin 提交于 11月 30, 2021
  
  bd396e1f
30 11月, 2021 6 次提交

S

add sum of 1 input · 33e97e99
由 Smirnov Egor 提交于 11月 30, 2021

33e97e99
S

add default order to transpose · 11e6848b
由 Smirnov Egor 提交于 11月 30, 2021

11e6848b
S

add new (Log)SoftMax simplification passes · 82941072
由 Smirnov Egor 提交于 11月 30, 2021

82941072
S

add alpha parameter to ELU layer · 0e2a3686
由 Smirnov Egor 提交于 11月 30, 2021

0e2a3686

Merge pull request #20658 from smbz:lstm_optimisation · ea7d4be3

由 Andrew Ryrie 提交于 11月 29, 2021

* dnn: LSTM optimisation

This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm.

fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications:
- Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned.
- Allow for weight matrices where the number of columns is not a multiple of 8.

I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on.

* Fix warning about initialisation order

* Remove C++11 syntax

* Fix build when AVX(2) is not available

In this case the CV_TRY_X macros are defined to 0, rather than being undefined.

* Minor changes as requested:

- Don't check hardware support for AVX(2) when dispatch is disabled for these
- Add braces

* Fix out-of-bounds access in fully connected layer

The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway).

This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems.

* Improve tail mask handling

- Use static array for generating tail masks (as requested)
- Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs

* Revert whitespace change

* Improve readability of conditions for using AVX

* dnn(lstm): minor coding style changes, replaced left aligned load

ea7d4be3

S

fix Clip, LeakyReLU, LRN, Split defaults · 05db8784
由 Smirnov Egor 提交于 11月 29, 2021

05db8784

28 11月, 2021 3 次提交
- A
  
  dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling · 58b06222
  由 Alexander Alekhin 提交于 11月 28, 2021
  
  58b06222
- A
  dnn(test): add two_inputs test with FP32/U8 data types · 58dc3979
  由 Alexander Alekhin 提交于 11月 27, 2021
```
- remove similar test from IE scope under HAVE_INF_ENGINE
```
  58dc3979
- Y
  Merge pull request #21107 from take1014:remove_assert_21038 · a6277370
  由 yuki takehara 提交于 11月 28, 2021
```
resolves #21038

* remove C assert

* revert C header

* fix several points in review

* fix test_ds.cpp
```
  a6277370
27 11月, 2021 1 次提交
- A
  
  dnn(test): update InferenceEngine tests · 985aa042
  由 Alexander Alekhin 提交于 11月 25, 2021
  
  985aa042
12 11月, 2021 1 次提交

Merge pull request #21025 from alalek:issue_21004 · 8041ab8a

由 Alexander Alekhin 提交于 11月 12, 2021

* dnn(ocl4dnn): fix LRN layer accuracy problems

- FP16 intermediate computation is not accurate and may provide NaN values

* dnn(test): update tolerance for FP16

8041ab8a

10 11月, 2021 1 次提交

Merge pull request #20904 from Crayon-new:fix_bug_in_maxLayer · 98b6ce35

由 ZaKiiiiiiiii 提交于 11月 10, 2021

fix bug: wrong output dimension when "keep_dims" is false in pooling layer.

* fix bug in max layer

* code align

* delete permute layer and add test case

* add name assert

* check other cases

* remove c++11 features

* style:add "const" remove assert

* style:sanitize file names

98b6ce35

04 11月, 2021 1 次提交
- A
  
  dnn(cmake): don't hijack OpenCL options with Tengine · c1d61c88
  由 Alexander Alekhin 提交于 11月 04, 2021
  
  c1d61c88
03 11月, 2021 1 次提交

Merge pull request #20999 from alalek:dnn_replace_deprecated_calls · d484939c

由 Alexander Alekhin 提交于 11月 03, 2021

dnn(protobuf): replace deprecated calls

* dnn: replace deprecated ByteSize() => ByteSizeLong()

* dnn: replace deprecated calls, use GetRepeatedFieldRef

d484939c

19 10月, 2021 1 次提交
- R
  Merge pull request #20883 from rogday:eltwise_refactoring · b3f966e2
  由 rogday 提交于 10月 19, 2021
```
* backport elementwise_layers refactor

* keep NULL
```
  b3f966e2
12 10月, 2021 1 次提交
- S
  
  change asserts for Sum · 238dbffb
  由 Smirnov Egor 提交于 10月 11, 2021
  
  238dbffb
11 10月, 2021 1 次提交
- S
  
  fix const - input and remove unimplemented function · a9d7b6ea
  由 Smirnov Egor 提交于 10月 11, 2021
  
  a9d7b6ea
08 10月, 2021 2 次提交
- A
  
  dnn(ocl4dnn): cleanup dead code, improve logging · 8c2dd5fb
  由 Alexander Alekhin 提交于 10月 08, 2021
  
  8c2dd5fb
- A
  dnn(ocl4dnn): add extra checks to convolution layer · 724e04e9
  由 Alexander Alekhin 提交于 10月 07, 2021
```
- prevent running code over unsupported/non-tested configurations
- prevent integer div by zero
```
  724e04e9
07 10月, 2021 1 次提交

Merge pull request #20725 from mologie:fix-dnn-tf-on-arm · a3d7811f

由 Oliver Kuckertz 提交于 10月 06, 2021

* dnn: fix unaligned memory access crash on armv7

The getTensorContent function would return a Mat pointing to some
member of a Protobuf-encoded message. Protobuf does not make any
alignment guarantees, which results in a crash on armv7 when loading
models while bit 2 is set in /proc/cpu/alignment (or the relevant
kernel feature for alignment compatibility is disabled). Any read
attempt from the previously unaligned data member would send SIGBUS.

As workaround, this commit makes an aligned copy via existing clone
functionality in getTensorContent. The unsafe copy=false option is
removed. Unfortunately, a rather crude hack in PReLUSubgraph in fact
writes(!) to the Protobuf message. We limit ourselves to fixing the
alignment issues in this commit, and add getTensorContentRefUnaligned
to cover the write case with a safe memcpy. A FIXME marks the issue.

* dnn: reduce amount of .clone() calls

* dnn: update FIXME comment
Co-authored-by: NAlexander Alekhin <alexander.a.alekhin@gmail.com>

a3d7811f

06 10月, 2021 1 次提交
- A
  
  dnn(pytest/test_input_3d): reload model between switching targets · 646924fc
  由 Alexander Alekhin 提交于 10月 05, 2021
  
  646924fc
05 10月, 2021 1 次提交
- A
  
  pre: OpenCV 3.4.16 (version++) · ebef84e9
  由 Alexander Alekhin 提交于 10月 04, 2021
  
  ebef84e9
02 10月, 2021 1 次提交
- A
  
  dnn(ocl): fix conv DWCONV workgroup · f977d10a
  由 Alexander Alekhin 提交于 10月 01, 2021
  
  f977d10a
29 9月, 2021 1 次提交
- A
  
  dnn(ocl): fix conv BASIC workgroup · 846317ef
  由 Alexander Alekhin 提交于 9月 29, 2021
  
  846317ef
17 9月, 2021 1 次提交
- S
  
  fix for unsqueeze opset version 13 · 9c5d7716
  由 SamFC10 提交于 9月 17, 2021
  
  9c5d7716
15 9月, 2021 1 次提交

Merge pull request #20671 from rogday:yolov4x-mish · c410d7a9

由 rogday 提交于 9月 14, 2021

Add support for YOLOv4x-mish

* backport to 3.4 for supporting yolov4x-mish

* add YOLOv4x-mish test

* address review comments
Co-authored-by: NGuo Xu <guoxu@1school.com.cn>

c410d7a9

12 9月, 2021 1 次提交
- A
  
  dnn(onnx): fix format specifier · 6e66a922
  由 Alexander Alekhin 提交于 9月 11, 2021
  
  6e66a922
11 9月, 2021 1 次提交
- Z
  
  BiasAdd could load Const from second place. · 51b03b87
  由 Zihao Mu 提交于 9月 10, 2021
  
  51b03b87
10 9月, 2021 2 次提交
- A
  
  dnn(perf): update convolution tests · 1aacb9bb
  由 Alexander Alekhin 提交于 9月 10, 2021
  
  1aacb9bb
- R
  Merge pull request #20674 from rogday:prelu_slope · d31b93b5
  由 rogday 提交于 9月 10, 2021
```
Fix PReLU negative slope access pattern

* fix prelu negative slope access pattern

* change begin() to ptr()
```
  d31b93b5

自由之枫~ / opencv 与 Fork 源项目一致

自由之枫~ / opencv
与 Fork 源项目一致