提交 · b4e9b4fe7d6a00bb51aa2ed7f608a15a51a38c19 · Greenplum / Opencv

09 8月, 2020 1 次提交
- Y
  
  fix compile-time errors, disable unsupported tests · f0149cda
  由 YashasSamaga 提交于 8月 09, 2020
  
  f0149cda
04 8月, 2020 1 次提交
- L
  Merge pull request #17967 from l-bat:non_const_weights_for_conv · d6952087
  由 Liubov Batanina 提交于 8月 03, 2020
```
* Supported convolution with non-const weights

* Fix opencl blobs

* Update tests
```
  d6952087
20 7月, 2020 1 次提交
- L
  
  Support Gather for variable inputs · a35d4f90
  由 Liubov Batanina 提交于 7月 20, 2020
  
  a35d4f90
16 7月, 2020 1 次提交
- A
  
  dnn(test): adjust tests for OpenVINO 2020.4 · 1c371d07
  由 Alexander Alekhin 提交于 7月 15, 2020
  
  1c371d07
11 6月, 2020 1 次提交
- Y
  
  allow multiple inputs to resize, fix tests · 265acccd
  由 YashasSamaga 提交于 6月 11, 2020
  
  265acccd
12 5月, 2020 2 次提交
- L
  
  Switch v1::Multiply to v0::Multiply · b27ae9c6
  由 Liubov Batanina 提交于 5月 12, 2020
  
  b27ae9c6
- L
  Merge pull request #17233 from l-bat:onnx_bn · 79f8b7fd
  由 Liubov Batanina 提交于 5月 12, 2020
```
* Added ONNX BatchNorm subgraph

* Move removing constant inputs to addConstantNodesForInitializers

* Added initializers to ONNXGraphWrapper
```
  79f8b7fd
30 4月, 2020 1 次提交
- A
  
  dnn(test): update skip tests on Win32 configuration · b805115c
  由 Alexander Alekhin 提交于 4月 29, 2020
  
  b805115c
22 4月, 2020 2 次提交
- A
  
  dnn(test): skip failed NGRAPH/MYRIAD tests · 83c4378d
  由 Alexander Alekhin 提交于 4月 22, 2020
  
  83c4378d
- A
  
  add fused batchNorm Upsample · e0ac0cfb
  由 ashishiva3@gmail.com 提交于 3月 19, 2020
  
  e0ac0cfb
21 4月, 2020 1 次提交
- A
  
  modification for upsample node fused from unfused Resize subgraph · d37180a2
  由 AshihsKrShrivastava 提交于 4月 10, 2020
  
  d37180a2
13 4月, 2020 1 次提交
- D
  
  Enable ONNX SSD from https://github.com/amdegroot/ssd.pytorch · d3f9ad11
  由 Dmitry Kurtaev 提交于 3月 28, 2020
  
  d3f9ad11
11 4月, 2020 1 次提交
- A
  
  ReflecitonPad2d and ZeroPad2d Subgraph fusion added · bef6b628
  由 AshihsKrShrivastava 提交于 4月 05, 2020
  
  bef6b628
07 4月, 2020 1 次提交

Merge pull request #16840 from l-bat:matmul_inputs · 73477141

由 Liubov Batanina 提交于 4月 07, 2020

* Supported FullyConnected layer with two inputs

* Skipped test

* Fix conditions

* Added OpenCL support

* Supported ReduceMean3D

* Supported Expand layer

* Fix warning

* Added Normalize subgraph

* refactoring

* Used addLayer

* Fix check

* Used addLayer

* Skip failed test

* Added normalize1 subgraph

* Fix comments

73477141

04 4月, 2020 1 次提交
- D
  
  Case sensitive dnn layers types · 8574a757
  由 Dmitry Kurtaev 提交于 3月 22, 2020
  
  8574a757
23 3月, 2020 1 次提交
- D
  
  Add checks for LSTM initial h and c · 467c3ef0
  由 Dmitry Kurtaev 提交于 3月 22, 2020
  
  467c3ef0
22 3月, 2020 1 次提交
- D
  
  Bidirectional LSTM · 84336202
  由 Dmitry Kurtaev 提交于 3月 22, 2020
  
  84336202
18 3月, 2020 2 次提交
- D
  
  LSTM from ONNX works · 8d69dbdf
  由 Dmitry Kurtaev 提交于 3月 15, 2020
  
  8d69dbdf
- D
  
  LSTM scalar · 14da5ec3
  由 Dmitry Kurtaev 提交于 3月 15, 2020
  
  14da5ec3
17 3月, 2020 1 次提交

Merge pull request #16715 from l-bat:slice_onnx · 718d7e4b

由 Liubov Batanina 提交于 3月 17, 2020

* Support Slice layer with multiple inputs

* Add test

* Supported Resize from PyTorch

* Rewrite test

* Remove Cast layer (supported in #16735)

* Support ConstantOfShape

* Fix tests

* Fix coments

* Remove useless condition

* Fixed failed tests

718d7e4b

14 3月, 2020 1 次提交

Merge pull request #16735 from l-bat:flatten_const_onnx · 2645ee90

由 Liubov Batanina 提交于 3月 14, 2020

* Supported Flatten for constant nodes

* Added default axis

* Refactoring

* Refactoring

* Added cast layer

* Fix comments

* Add Cast for layers

2645ee90

06 3月, 2020 1 次提交
- D
  
  Broadcasting from ONNX · 9e332dc5
  由 Dmitry Kurtaev 提交于 3月 05, 2020
  
  9e332dc5
04 3月, 2020 1 次提交
- L
  Merge pull request #16722 from l-bat:reshape_opset_11 · 9ed13323
  由 Liubov Batanina 提交于 3月 04, 2020
```
* Supported Div op for constants

* Added Mul test
```
  9ed13323
03 3月, 2020 1 次提交
- A
  
  Gather-Cast, Mul-Cast fusion · e18d5e94
  由 ashishiva3@gmail.com 提交于 3月 01, 2020
  
  e18d5e94
02 3月, 2020 1 次提交
- L
  
  Skipped ResizeUnfused test on Builder API · b1b78aed
  由 Liubov Batanina 提交于 3月 02, 2020
  
  b1b78aed
29 2月, 2020 1 次提交
- A
  
  ONNX: upsample subgraph fusion added · 8559237d
  由 ashishiva3@gmail.com 提交于 2月 13, 2020
  
  8559237d
25 2月, 2020 1 次提交
- A
  
  dnn(test): configure filtering for 32-bit systems (part 2) · c2f5f5a2
  由 Alexander Alekhin 提交于 2月 24, 2020
  
  c2f5f5a2
23 2月, 2020 1 次提交
- A
  
  dnn(test): configure filtering for 32-bit systems · 1540ae34
  由 Alexander Alekhin 提交于 2月 20, 2020
  
  1540ae34
18 2月, 2020 1 次提交

Merge pull request #16472 from l-bat:cp_vton · e970eccb

由 Liubov Batanina 提交于 2月 17, 2020

Add CP-VTON sample

* Support resize from PyTorch

* Add CP-VTON sample

* Fix downsampling

* Fix test

* Add model links

* Add default args

* Speed up resize

* Fix TOM link

* Add default args

* Fix comments

* Set aspect ratio for input

* Update links

* Check files exist

e970eccb

15 2月, 2020 1 次提交

Merge pull request #16424 from czgdp1807:issue-16370 · a6f3a212

由 Gagandeep Singh 提交于 2月 15, 2020

* fixed Split layer in ONNXImporter

* added test for fix of split layer

* fixed tests for Split layer

* applied reviews

* updated tests

* fixed paths in tests

a6f3a212

14 1月, 2020 1 次提交
- D
  
  ONNX graphs simplifier · c1c84d2f
  由 Dmitry Kurtaev 提交于 1月 06, 2020
  
  c1c84d2f
13 1月, 2020 1 次提交

Disable some tests for Myriad target of nGraph · 8f1e36f7

由 Dmitry Kurtaev 提交于 12月 24, 2019

Add lightweight IE hardware targets checks

nGraph: Concat with paddings

Enable more nGraph tests

Restore FP32->FP16 for GPU plugin of IE

try to fix buildbot

Use lightweight IE targets check only starts from R4

8f1e36f7

20 12月, 2019 1 次提交

Merge pull request #16010 from YashasSamaga:cuda4dnn-fp16-tests · 1fac1421

由 Yashas Samaga B L 提交于 12月 20, 2019

* enable tests for DNN_TARGET_CUDA_FP16

* disable deconvolution tests

* disable shortcut tests

* fix typos and some minor changes

* dnn(test): skip CUDA FP16 test too (run_pool_max)

1fac1421

06 12月, 2019 1 次提交
- Y
  
  add DIV support to EltwiseOp · a91eca6e
  由 YashasSamaga 提交于 12月 06, 2019
  
  a91eca6e
02 12月, 2019 1 次提交
- L
  Merge pull request #15537 from l-bat:ngraph · 7523c777
  由 Lubov Batanina 提交于 12月 02, 2019
```
* Support nGraph

* Fix resize
```
  7523c777
09 11月, 2019 1 次提交

Merge pull request #15811 from l-bat:eltwise_div · cfc78194

由 Lubov Batanina 提交于 11月 09, 2019

Supported ONNX Squeeze, ReduceL2 and Eltwise::DIV

* Support eltwise div

* Fix test

* OpenCL support added

* refactoring

* fix code style

* Only squeeze with axes supported

cfc78194

21 10月, 2019 1 次提交

Merge pull request #14827 from YashasSamaga:cuda4dnn-csl-low · 613c12e5

由 Yashas Samaga B L 提交于 10月 21, 2019

CUDA backend for the DNN module

* stub cuda4dnn design

* minor fixes for tests and doxygen

* add csl public api directory to module headers

* add low-level CSL components

* add high-level CSL components

* integrate csl::Tensor into backbone code

* switch to CPU iff unsupported; otherwise, fail on error

* add fully connected layer

* add softmax layer

* add activation layers

* support arbitary rank TensorDescriptor

* pass input wrappers to `initCUDA()`

* add 1d/2d/3d-convolution

* add pooling layer

* reorganize and refactor code

* fixes for gcc, clang and doxygen; remove cxx14/17 code

* add blank_layer

* add LRN layer

* add rounding modes for pooling layer

* split tensor.hpp into tensor.hpp and tensor_ops.hpp

* add concat layer

* add scale layer

* add batch normalization layer

* split math.cu into activations.cu and math.hpp

* add eltwise layer

* add flatten layer

* add tensor transform api

* add asymmetric padding support for convolution layer

* add reshape layer

* fix rebase issues

* add permute layer

* add padding support for concat layer

* refactor and reorganize code

* add normalize layer

* optimize bias addition in scale layer

* add prior box layer

* fix and optimize normalize layer

* add asymmetric padding support for pooling layer

* add event API

* improve pooling performance for some padding scenarios

* avoid over-allocation of compute resources to kernels

* improve prior box performance

* enable layer fusion

* add const layer

* add resize layer

* add slice layer

* add padding layer

* add deconvolution layer

* fix channelwise  ReLU initialization

* add vector traits

* add vectorized versions of relu, clipped_relu, power

* add vectorized concat kernels

* improve concat_with_offsets performance

* vectorize scale and bias kernels

* add support for multi-billion element tensors

* vectorize prior box kernels

* fix address alignment check

* improve bias addition performance of conv/deconv/fc layers

* restructure code for supporting multiple targets

* add DNN_TARGET_CUDA_FP64

* add DNN_TARGET_FP16

* improve vectorization

* add region layer

* improve tensor API, add dynamic ranks

1. use ManagedPtr instead of a Tensor in backend wrapper
2. add new methods to tensor classes
  - size_range: computes the combined size of for a given axis range
  - tensor span/view can be constructed from a raw pointer and shape
3. the tensor classes can change their rank at runtime (previously rank was fixed at compile-time)
4. remove device code from tensor classes (as they are unused)
5. enforce strict conditions on tensor class APIs to improve debugging ability

* fix parametric relu activation

* add squeeze/unsqueeze tensor API

* add reorg layer

* optimize permute and enable 2d permute

* enable 1d and 2d slice

* add split layer

* add shuffle channel layer

* allow tensors of different ranks in reshape primitive

* patch SliceOp to allow Crop Layer

* allow extra shape inputs in reshape layer

* use `std::move_backward` instead of `std::move` for insert in resizable_static_array

* improve workspace management

* add spatial LRN

* add nms (cpu) to region layer

* add max pooling with argmax ( and a fix to limits.hpp)

* add max unpooling layer

* rename DNN_TARGET_CUDA_FP32 to DNN_TARGET_CUDA

* update supportBackend to be more rigorous

* remove stray include from preventing non-cuda build

* include op_cuda.hpp outside condition #if

* refactoring, fixes and many optimizations

* drop DNN_TARGET_CUDA_FP64

* fix gcc errors

* increase max. tensor rank limit to six

* add Interp layer

* drop custom layers; use BackendNode

* vectorize activation kernels

* fixes for gcc

* remove wrong assertion

* fix broken assertion in unpooling primitive

* fix build errors in non-CUDA build

* completely remove workspace from public API

* fix permute layer

* enable accuracy and perf. tests for DNN_TARGET_CUDA

* add asynchronous forward

* vectorize eltwise ops

* vectorize fill kernel

* fixes for gcc

* remove CSL headers from public API

* remove csl header source group from cmake

* update min. cudnn version in cmake

* add numerically stable FP32 log1pexp

* refactor code

* add FP16 specialization to cudnn based tensor addition

* vectorize scale1 and bias1 + minor refactoring

* fix doxygen build

* fix invalid alignment assertion

* clear backend wrappers before allocateLayers

* ignore memory lock failures

* do not allocate internal blobs

* integrate NVTX

* add numerically stable half precision log1pexp

* fix indentation, following coding style,  improve docs

* remove accidental modification of IE code

* Revert "add asynchronous forward"

This reverts commit 1154b9da9da07e9b52f8a81bdcea48cf31c56f70.

* [cmake] throw error for unsupported CC versions

* fix rebase issues

* add more docs, refactor code, fix bugs

* minor refactoring and fixes

* resolve warnings/errors from clang

* remove haveCUDA() checks from supportBackend()

* remove NVTX integration

* changes based on review comments

* avoid exception when no CUDA device is present

* add color code for CUDA in Net::dump

613c12e5

04 10月, 2019 2 次提交
- D
  
  Enable ENet with Inference Engine backend on CPU · e35fd463
  由 Dmitry Kurtaev 提交于 10月 02, 2019
  
  e35fd463
- A
  
  dnn: update IE tests · fd11e3a8
  由 Alexander Alekhin 提交于 10月 04, 2019
  
  fd11e3a8
29 7月, 2019 1 次提交
- D
  
  Add support for slice from ONNX with multiple outputs · f9f16040
  由 Dmitry Kurtaev 提交于 7月 27, 2019
  
  f9f16040

Greenplum / Opencv 11 个月 前同步成功

Greenplum / Opencv
11 个月前同步成功