提交 · fa396e0d7aa7ec6f4984714f364ca7a6d96bc390 · PaddlePaddle / Paddle-Lite

17 2月, 2020 1 次提交
- G
  Add reduce sum op test (#2899) · fa396e0d
  由 GaoWei8 提交于 2月 17, 2020
```
* Add reduce sum op test
test=develop
```
  fa396e0d
24 1月, 2020 1 次提交
- Z
  
  [NPU] clean code (#2798) · 69ad4b80
  由 zhupengyang 提交于 1月 24, 2020
  
  69ad4b80
16 1月, 2020 1 次提交
- Z
  
  [NPU] enhance conv_transpose and ut (#2773) · d2fb7f8f
  由 zhupengyang 提交于 1月 16, 2020
  
  d2fb7f8f
15 1月, 2020 1 次提交
- H
  
  [LITE][NPU] Add layer_norm op bridge (#2767) · 2ac5fe33
  由 hong19860320 提交于 1月 15, 2020
  
  2ac5fe33
14 1月, 2020 3 次提交
- H
  [arm]add gemm + relu6/leakyrelu fusion (#2674) · c0af965c
  由 HappyAngel 提交于 1月 14, 2020
```
add gemm + relu6/leakyrelu fusion
```
  c0af965c
- Z
  [NPU] enhance concat, nearest_interp, bilinear_interp ut (#2764) · 7a8118b0
  由 zhupengyang 提交于 1月 14, 2020
```
- enhance interp InferShape
```
  7a8118b0
- Support bitman backend,test=develop (#2761) · 14811017
  由 myq406450149 提交于 1月 14, 2020
```
* Support bitman backend
```
  14811017
13 1月, 2020 2 次提交
- Z
  
  [NPU] enhance conv2d ut (#2753) · 1816f57f
  由 zhupengyang 提交于 1月 13, 2020
  
  1816f57f
- H
  [LITE][NPU] Add instance_norm op bridge and unit test, refine the registration... · 91f0ef0b
  由 hong19860320 提交于 1月 13, 2020
```
[LITE][NPU] Add instance_norm op bridge and unit test, refine the registration of op bridges (#2747)
```
  91f0ef0b
10 1月, 2020 1 次提交
- Z
  
  [XPU] cast op bridge and ut (#2738) · e5c62f96
  由 zhupengyang 提交于 1月 10, 2020
  
  e5c62f96
09 1月, 2020 1 次提交
- Z
  
  [NPU] dropout op bridge and ut (#2745) · b678e43c
  由 zhupengyang 提交于 1月 09, 2020
  
  b678e43c
08 1月, 2020 1 次提交
- Z
  [NPU] enhance unittest for shuffle_channel, unsqueeze, pool (#2730) · 08afd3aa
  由 zhupengyang 提交于 1月 08, 2020
```
* [NPU] enhance unittest for shuffle_channel, unsqueeze, pool

test=develop
```
  08afd3aa
07 1月, 2020 1 次提交
- Z
  [NPU] add host kernels, enhance reshape ut (#2733) · 8fef7532
  由 zhupengyang 提交于 1月 07, 2020
```
test=develop
```
  8fef7532
03 1月, 2020 1 次提交
- Z
  [NPU] enhance unittest for bn, transpose (#2716) · 59d079e8
  由 zhupengyang 提交于 1月 03, 2020
```
test=develop
```
  59d079e8
31 12月, 2019 3 次提交

X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo... · f1cedb8f

由 Wilber 提交于 12月 31, 2019

X86 and cuda compile simutaneously cmake ..  -DCMAKE_BUILD_TYPE=RelWithDebInfo  -DWITH_MKL=ON           -DLITE_WITH_CUDA=ON           -DWITH_MKLDNN=OFF           -DLITE_WITH_X86=ON           -DLITE_WITH_PROFILE=OFF          -DWITH_LITE=OFF           -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF           -DWITH_PYTHON=OFF           -DWITH_TESTING=ON           -DLITE_WITH_ARM=OFF           -DLITE_ON_TINY_PUBLISH=OFF           -DCUDNN_ROOT=/usr/local/cudnn/           -DLITE_BUILD_EXTRA=ON (#2708)

x86 and cuda compile simutaneously

f1cedb8f

Z
[XPU] bn unit test (#2706) · bc6d5adc
由 zhupengyang 提交于 12月 31, 2019
```
test=develop
```
bc6d5adc

[LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · a29c84a2

由 hong19860320 提交于 12月 31, 2019

* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.

* Refine the registration of op bridges and make it similar to the registration of op and kernel.

* Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.

* Add the unit test of the scale and softmax op bridge for NPU.

a29c84a2

26 12月, 2019 1 次提交
- Z
  [XPU] mul unittest (#2676) · 6bce0133
  由 zhupengyang 提交于 12月 26, 2019
```
test=develop
```
  6bce0133
24 12月, 2019 3 次提交
- H
  [LITE][NPU][XPU] Support multiple types for XPU and NPU op bridges (#2646) · 05da0c72
  由 hong19860320 提交于 12月 24, 2019
```
* Support multiple types for XPU and NPU op bridges

* Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT

* Fix the definition of lookup_table kernel for X86
```
  05da0c72
- Z
  [XPU] add dropout bridge and unit test (#2650) · d904c9dd
  由 zhupengyang 提交于 12月 24, 2019
```
test=develop
```
  d904c9dd
- Z
  [XPU] elementwise_add, softmax unit test (#2653) · 64d01cb9
  由 zhupengyang 提交于 12月 24, 2019
```
* [XPU] elementwise_add unit test

* [XPU] softmax unit test

test=develop
```
  64d01cb9
23 12月, 2019 1 次提交
- Y
  
  [ARM] add grid_sampler op and ut, test=develop (#2598) · 3723451b
  由 yiicy 提交于 12月 23, 2019
  
  3723451b
21 12月, 2019 1 次提交
- Z
  [XPU] add layer_norm bridge and unit test (#2640) · 4dd6a4b8
  由 zhupengyang 提交于 12月 21, 2019
```
test=develop
```
  4dd6a4b8
20 12月, 2019 2 次提交
- Z
  [XPU] add reshape bridge and unit test (#2621) · a13c592d
  由 zhupengyang 提交于 12月 20, 2019
```
test=develop
```
  a13c592d
- Z
  [XPU] add transpose bridge and unit test (#2630) · b53ece7a
  由 zhupengyang 提交于 12月 20, 2019
```
* [XPU] add transpose bridge and unit test

test=develop
```
  b53ece7a
13 12月, 2019 1 次提交
- H
  [LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at... · d5434aa2
  由 hong19860320 提交于 12月 13, 2019
```
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
```
  d5434aa2
10 12月, 2019 1 次提交
- Y
  
  [ARM] add instance norm op and ut, test=develop (#2578) · 9a3552db
  由 yiicy 提交于 12月 10, 2019
  
  9a3552db
07 12月, 2019 1 次提交

Support mask_rcnn (#2484) · c2f72cb3

由 juncaipeng 提交于 12月 07, 2019

* add arm split lod tensor, test=develop

* add arm merge lod tensor, test=develop

* update split merge lod tensor, test=develop

* add reduce_prob op, test=develop

* support mask_rcnn succeed, test=develop

c2f72cb3

16 11月, 2019 1 次提交
- H
  
  [LITE][X86] Add search_aligned_mat_mul and search_seq_fc op for X86 (#2428) · 78f76834
  由 hong19860320 提交于 11月 16, 2019
  
  78f76834
12 11月, 2019 1 次提交
- J
  Upgrade concat and unsqueeze, test=develop (#2378) · 26470600
  由 juncaipeng 提交于 11月 12, 2019
```
* update concat and unsqueeze, test=develop
```
  26470600
07 11月, 2019 1 次提交

check arm kernels type to make sure all_library_links work normally (#2386) · 916e80c2

由 huzhiqiang 提交于 11月 06, 2019

We have changed 11 arm_kernels into extra type in #2347 , which has caused test_compiling failure. In this PR , we move their 11 related arm_kernel_test into build_extra=ON

916e80c2

28 10月, 2019 1 次提交

[LITE][XPU] initial support for XPU (#2202) · 06d058fe

由 hong19860320 提交于 10月 28, 2019

* Initial support for XPU
* Fix compiling errors of XPU
* Move XPU op kernel bridges from backends to kernels to fix deps order
* Change the namespace and directory of XPU bridges
* Add XPU SDK
* Fix header files and namespace of XPU SDK
* Add unit tests for relu and conv2d ops
* Restore the modification of paddle_api_test
* Supports simple model which contains only a relu layer
* Add compiling scripts for XPU
* Fix compiling errors of XPU
* Add comments for XPU LoadModel and BuildModel

06d058fe

18 9月, 2019 1 次提交

fix bias quantize error && fix clang build error (#2049) · 81dffbe8

由 Xiaoyang LI 提交于 9月 18, 2019

* fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias

* change conv impl

* neon int8 kernel support float bias

* arm compute kernel support float bias

* add math_test target

* add tensor utils for testing, fix sgemm ut error

* add gemm_int8 unit test, support float bias

* fix build script

* add conv compute unit test for arm

* fix build script, test=develop

* fix fp32 dw conv3x3s1, test=develop

* add fp32 dw conv3x3s1, test=develop

* add armv7 fp32 dw conv3x3s1, test=develop

* add fp32 depthwise conv3x3s2, test=develop

* fix fp32 conv3x3 depthwise build error, test=develop

* fix gemm_like conv trans weights error, test=develop

* fix int8 depthwise conv3x3 error, test=develop

* turn on all test for arm fp32 conv, test=develop

* fix int8 conv1x1 error

* fix int8 direct conv3x3s1 error, test=develop

* fix int8 direct conv3x3s2, test=develop

* turn on all test for arm int8 conv, test=develop

* fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop

* remove debug info, strip ut binary, test=develop

* fix conv compute error, test=develop

* change Init() to ReInitWhenNeeded(), test=develop

* fix code style, test=develop

* remote engine_test, test=develop

* fix building server tests error, test=develop

* fix sdot clang build error, test=develop

* fix sgemm ut timeout error, test=develop

* fix clang build error, test=develop

* turn off math basic test due to ci time out, test=develop

* fix conv_int8 ut error, test=develop

81dffbe8

12 9月, 2019 1 次提交

add unsqueeze and range op (x2paddle) (#1988) · 3c08f676

由 Wilber 提交于 9月 12, 2019

* add unsqueeze and range op. modify concat op test=develop

* modify exception in range_test_x86

3c08f676

09 9月, 2019 1 次提交
- J
  add assign_value and hard_sigmoid, add fluid_type (#1983) · 92eeabeb
  由 juncaipeng 提交于 9月 09, 2019
```
* add assign_value op, arm kernel and test, add fluid_type, test=develop

* add hard_sigmoid, test=develop
```
  92eeabeb
04 9月, 2019 1 次提交
- W
  modify slice op and add slice test (#1944) · cfc7af76
  由 Wilber 提交于 9月 04, 2019
```
* modify slice op and add slice test

* modify slice op bug
```
  cfc7af76
02 9月, 2019 1 次提交

Add ops and fix bugs for Faster RCNN (#1942) · 635b4958

由 juncaipeng 提交于 9月 02, 2019

* add ops for faster rcnn

* disable test for generate_proposals and roi_align, test=develop

* remove .swp file

* remove log in tensor slice

* finish the unit test for roi_align, test=develop

* add box_clip op and fix tensor slice bug

* remove add four op twice

* rewrite the implement for box_coder and sequence_expand, add faster_rcnn_test, test=develop

* fix test bug of box_clip in x86 server, test=develop

635b4958

29 8月, 2019 3 次提交

Add yolo_box_cuda multiclass_nms_host kernel. (#1908) · de43e479

由 Wilber 提交于 8月 29, 2019

* add yolo_box_compute cuda

* move multiclass_nms(arm) to host

* add lod in scale op

* add yolo_box_cuda cmake config

* modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop

* reshape and transpose op don't have xshape output.

* modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop

* add yolo_box use kernel test=develop

de43e479

L

add stack op and add reduce_mean op and their unit tests (#1888) · 20001636
由 liu zhengxi 提交于 8月 29, 2019

20001636

ad ops for faster rcnn, including affine_channel, anchor_generator,... · 53b05ce8

由 juncaipeng 提交于 8月 29, 2019

ad ops for faster rcnn, including affine_channel, anchor_generator, generate_proposals and roi_align (#1895)

* add ops for faster rcnn

* disable test for generate_proposals and roi_align, test=develop

* remove .swp file

* remove log in tensor slice

* finish the unit test for roi_align, test=develop

53b05ce8