提交 · dab697c5a73bfd67df2f836c34dc2a18365197d0 · PaddlePaddle / Paddle-Lite

24 1月, 2020 1 次提交
- Z
  
  [NPU] clean code (#2798) · d23b3456
  由 zhupengyang 提交于 1月 24, 2020
  
  d23b3456
16 1月, 2020 1 次提交
- Z
  
  [NPU] enhance conv_transpose and ut (#2773) · b14e21c3
  由 zhupengyang 提交于 1月 16, 2020
  
  b14e21c3
15 1月, 2020 1 次提交
- H
  
  [LITE][NPU] Add layer_norm op bridge (#2767) · 974c50db
  由 hong19860320 提交于 1月 15, 2020
  
  974c50db
14 1月, 2020 3 次提交
- H
  [arm]add gemm + relu6/leakyrelu fusion (#2674) · 789accae
  由 HappyAngel 提交于 1月 14, 2020
```
add gemm + relu6/leakyrelu fusion
```
  789accae
- Z
  [NPU] enhance concat, nearest_interp, bilinear_interp ut (#2764) · 5209b4b6
  由 zhupengyang 提交于 1月 14, 2020
```
- enhance interp InferShape
```
  5209b4b6
- Support bitman backend,test=develop (#2761) · c4a87224
  由 myq406450149 提交于 1月 14, 2020
```
* Support bitman backend
```
  c4a87224
13 1月, 2020 2 次提交
- Z
  
  [NPU] enhance conv2d ut (#2753) · cd73447e
  由 zhupengyang 提交于 1月 13, 2020
  
  cd73447e
- H
  [LITE][NPU] Add instance_norm op bridge and unit test, refine the registration... · f7809701
  由 hong19860320 提交于 1月 13, 2020
```
[LITE][NPU] Add instance_norm op bridge and unit test, refine the registration of op bridges (#2747)
```
  f7809701
10 1月, 2020 1 次提交
- Z
  
  [XPU] cast op bridge and ut (#2738) · 4f917867
  由 zhupengyang 提交于 1月 10, 2020
  
  4f917867
09 1月, 2020 1 次提交
- Z
  
  [NPU] dropout op bridge and ut (#2745) · a0f455ee
  由 zhupengyang 提交于 1月 09, 2020
  
  a0f455ee
08 1月, 2020 1 次提交
- Z
  [NPU] enhance unittest for shuffle_channel, unsqueeze, pool (#2730) · d90f34de
  由 zhupengyang 提交于 1月 08, 2020
```
* [NPU] enhance unittest for shuffle_channel, unsqueeze, pool

test=develop
```
  d90f34de
07 1月, 2020 1 次提交
- Z
  [NPU] add host kernels, enhance reshape ut (#2733) · dabf181a
  由 zhupengyang 提交于 1月 07, 2020
```
test=develop
```
  dabf181a
03 1月, 2020 1 次提交
- Z
  [NPU] enhance unittest for bn, transpose (#2716) · eacc42f2
  由 zhupengyang 提交于 1月 03, 2020
```
test=develop
```
  eacc42f2
31 12月, 2019 3 次提交

X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo... · a48c8b23

由 Wilber 提交于 12月 31, 2019

X86 and cuda compile simutaneously cmake ..  -DCMAKE_BUILD_TYPE=RelWithDebInfo  -DWITH_MKL=ON           -DLITE_WITH_CUDA=ON           -DWITH_MKLDNN=OFF           -DLITE_WITH_X86=ON           -DLITE_WITH_PROFILE=OFF          -DWITH_LITE=OFF           -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF           -DWITH_PYTHON=OFF           -DWITH_TESTING=ON           -DLITE_WITH_ARM=OFF           -DLITE_ON_TINY_PUBLISH=OFF           -DCUDNN_ROOT=/usr/local/cudnn/           -DLITE_BUILD_EXTRA=ON (#2708)

x86 and cuda compile simutaneously

a48c8b23

Z
[XPU] bn unit test (#2706) · 9c124cb0
由 zhupengyang 提交于 12月 31, 2019
```
test=develop
```
9c124cb0

[LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · fb668935

由 hong19860320 提交于 12月 31, 2019

* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.

* Refine the registration of op bridges and make it similar to the registration of op and kernel.

* Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.

* Add the unit test of the scale and softmax op bridge for NPU.

fb668935

26 12月, 2019 1 次提交
- Z
  [XPU] mul unittest (#2676) · 4df2ba00
  由 zhupengyang 提交于 12月 26, 2019
```
test=develop
```
  4df2ba00
24 12月, 2019 3 次提交
- H
  [LITE][NPU][XPU] Support multiple types for XPU and NPU op bridges (#2646) · dd5779d8
  由 hong19860320 提交于 12月 24, 2019
```
* Support multiple types for XPU and NPU op bridges

* Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT

* Fix the definition of lookup_table kernel for X86
```
  dd5779d8
- Z
  [XPU] add dropout bridge and unit test (#2650) · d185e8c1
  由 zhupengyang 提交于 12月 24, 2019
```
test=develop
```
  d185e8c1
- Z
  [XPU] elementwise_add, softmax unit test (#2653) · ae28c0f7
  由 zhupengyang 提交于 12月 24, 2019
```
* [XPU] elementwise_add unit test

* [XPU] softmax unit test

test=develop
```
  ae28c0f7
23 12月, 2019 1 次提交
- Y
  
  [ARM] add grid_sampler op and ut, test=develop (#2598) · 9875843e
  由 yiicy 提交于 12月 23, 2019
  
  9875843e
21 12月, 2019 1 次提交
- Z
  [XPU] add layer_norm bridge and unit test (#2640) · aadb65d2
  由 zhupengyang 提交于 12月 21, 2019
```
test=develop
```
  aadb65d2
20 12月, 2019 2 次提交
- Z
  [XPU] add reshape bridge and unit test (#2621) · 7bd142bd
  由 zhupengyang 提交于 12月 20, 2019
```
test=develop
```
  7bd142bd
- Z
  [XPU] add transpose bridge and unit test (#2630) · 3ef94cd4
  由 zhupengyang 提交于 12月 20, 2019
```
* [XPU] add transpose bridge and unit test

test=develop
```
  3ef94cd4
13 12月, 2019 1 次提交
- H
  [LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at... · 1dbcd51d
  由 hong19860320 提交于 12月 13, 2019
```
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
```
  1dbcd51d
10 12月, 2019 1 次提交
- Y
  
  [ARM] add instance norm op and ut, test=develop (#2578) · 933a3724
  由 yiicy 提交于 12月 10, 2019
  
  933a3724
07 12月, 2019 1 次提交

Support mask_rcnn (#2484) · cf31b835

由 juncaipeng 提交于 12月 07, 2019

* add arm split lod tensor, test=develop

* add arm merge lod tensor, test=develop

* update split merge lod tensor, test=develop

* add reduce_prob op, test=develop

* support mask_rcnn succeed, test=develop

cf31b835

16 11月, 2019 1 次提交
- H
  
  [LITE][X86] Add search_aligned_mat_mul and search_seq_fc op for X86 (#2428) · 2148bf49
  由 hong19860320 提交于 11月 16, 2019
  
  2148bf49
12 11月, 2019 1 次提交
- J
  Upgrade concat and unsqueeze, test=develop (#2378) · 284e8166
  由 juncaipeng 提交于 11月 12, 2019
```
* update concat and unsqueeze, test=develop
```
  284e8166
07 11月, 2019 1 次提交

check arm kernels type to make sure all_library_links work normally (#2386) · 6b38eab8

由 huzhiqiang 提交于 11月 06, 2019

We have changed 11 arm_kernels into extra type in #2347 , which has caused test_compiling failure. In this PR , we move their 11 related arm_kernel_test into build_extra=ON

6b38eab8

28 10月, 2019 1 次提交

[LITE][XPU] initial support for XPU (#2202) · ac1b2f9f

由 hong19860320 提交于 10月 28, 2019

* Initial support for XPU
* Fix compiling errors of XPU
* Move XPU op kernel bridges from backends to kernels to fix deps order
* Change the namespace and directory of XPU bridges
* Add XPU SDK
* Fix header files and namespace of XPU SDK
* Add unit tests for relu and conv2d ops
* Restore the modification of paddle_api_test
* Supports simple model which contains only a relu layer
* Add compiling scripts for XPU
* Fix compiling errors of XPU
* Add comments for XPU LoadModel and BuildModel

ac1b2f9f

18 9月, 2019 1 次提交

fix bias quantize error && fix clang build error (#2049) · 8d6f475e

由 Xiaoyang LI 提交于 9月 18, 2019

* fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias

* change conv impl

* neon int8 kernel support float bias

* arm compute kernel support float bias

* add math_test target

* add tensor utils for testing, fix sgemm ut error

* add gemm_int8 unit test, support float bias

* fix build script

* add conv compute unit test for arm

* fix build script, test=develop

* fix fp32 dw conv3x3s1, test=develop

* add fp32 dw conv3x3s1, test=develop

* add armv7 fp32 dw conv3x3s1, test=develop

* add fp32 depthwise conv3x3s2, test=develop

* fix fp32 conv3x3 depthwise build error, test=develop

* fix gemm_like conv trans weights error, test=develop

* fix int8 depthwise conv3x3 error, test=develop

* turn on all test for arm fp32 conv, test=develop

* fix int8 conv1x1 error

* fix int8 direct conv3x3s1 error, test=develop

* fix int8 direct conv3x3s2, test=develop

* turn on all test for arm int8 conv, test=develop

* fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop

* remove debug info, strip ut binary, test=develop

* fix conv compute error, test=develop

* change Init() to ReInitWhenNeeded(), test=develop

* fix code style, test=develop

* remote engine_test, test=develop

* fix building server tests error, test=develop

* fix sdot clang build error, test=develop

* fix sgemm ut timeout error, test=develop

* fix clang build error, test=develop

* turn off math basic test due to ci time out, test=develop

* fix conv_int8 ut error, test=develop

8d6f475e

12 9月, 2019 1 次提交

add unsqueeze and range op (x2paddle) (#1988) · cca0aec6

由 Wilber 提交于 9月 12, 2019

* add unsqueeze and range op. modify concat op test=develop

* modify exception in range_test_x86

cca0aec6

09 9月, 2019 1 次提交
- J
  add assign_value and hard_sigmoid, add fluid_type (#1983) · 9796c57d
  由 juncaipeng 提交于 9月 09, 2019
```
* add assign_value op, arm kernel and test, add fluid_type, test=develop

* add hard_sigmoid, test=develop
```
  9796c57d
04 9月, 2019 1 次提交
- W
  modify slice op and add slice test (#1944) · 52f933d4
  由 Wilber 提交于 9月 04, 2019
```
* modify slice op and add slice test

* modify slice op bug
```
  52f933d4
02 9月, 2019 1 次提交

Add ops and fix bugs for Faster RCNN (#1942) · cfd5abe5

由 juncaipeng 提交于 9月 02, 2019

* add ops for faster rcnn

* disable test for generate_proposals and roi_align, test=develop

* remove .swp file

* remove log in tensor slice

* finish the unit test for roi_align, test=develop

* add box_clip op and fix tensor slice bug

* remove add four op twice

* rewrite the implement for box_coder and sequence_expand, add faster_rcnn_test, test=develop

* fix test bug of box_clip in x86 server, test=develop

cfd5abe5

29 8月, 2019 3 次提交

Add yolo_box_cuda multiclass_nms_host kernel. (#1908) · 5752dbd7

由 Wilber 提交于 8月 29, 2019

* add yolo_box_compute cuda

* move multiclass_nms(arm) to host

* add lod in scale op

* add yolo_box_cuda cmake config

* modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop

* reshape and transpose op don't have xshape output.

* modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop

* add yolo_box use kernel test=develop

5752dbd7

L

add stack op and add reduce_mean op and their unit tests (#1888) · 8ccd01a6
由 liu zhengxi 提交于 8月 29, 2019

8ccd01a6

ad ops for faster rcnn, including affine_channel, anchor_generator,... · f3035827

由 juncaipeng 提交于 8月 29, 2019

ad ops for faster rcnn, including affine_channel, anchor_generator, generate_proposals and roi_align (#1895)

* add ops for faster rcnn

* disable test for generate_proposals and roi_align, test=develop

* remove .swp file

* remove log in tensor slice

* finish the unit test for roi_align, test=develop

f3035827

28 8月, 2019 1 次提交
- J
  Modify cast op and remove warning in argmax_test (#1894) · 5fe41d5c
  由 juncaipeng 提交于 8月 28, 2019
```
* modify cast op, test=develop

* modify cast op and remove warning in argmax_test, test=develop
```
  5fe41d5c