提交 · 6135fd4a79d2f757c54124e794088fe2fa294c56 · PaddlePaddle / Paddle-Lite

08 1月, 2020 3 次提交
- Z
  [NPU] enhance unittest for shuffle_channel, unsqueeze, pool (#2730) · 08afd3aa
  由 zhupengyang 提交于 1月 08, 2020
```
* [NPU] enhance unittest for shuffle_channel, unsqueeze, pool

test=develop
```
  08afd3aa
- H
  fix the issue that: loading model consumes too much time test=decelop (#2726) · 8e7906d0
  由 huzhiqiang 提交于 1月 08, 2020
```
* fix the issue that: loading model consumes too much time test=decelop
```
  8e7906d0
- X
  
  fix: fix op dropout and fc infershape lod bug (#2732) · 4430df40
  由 xiaogang 提交于 1月 08, 2020
  
  4430df40
07 1月, 2020 3 次提交
- Y
  
  add yolov3 demo, test=develop (#2731) · b918067c
  由 yiicy 提交于 1月 07, 2020
  
  b918067c
- Z
  [NPU] add host kernels, enhance reshape ut (#2733) · 8fef7532
  由 zhupengyang 提交于 1月 07, 2020
```
test=develop
```
  8fef7532
- H
  
  Add CI Task: IOS Compiling Task (#2725) · 2ad0e84a
  由 huzhiqiang 提交于 1月 07, 2020
  
  2ad0e84a
06 1月, 2020 2 次提交
- 石
  
  fix build errors, test=develop (#2728) · 947cda26
  由石晓伟提交于 1月 06, 2020
  
  947cda26
- L
  [X86] Alter the api name to set_x86_math_library_math_threads (#2720) · 7950edd7
  由 liu zhengxi 提交于 1月 06, 2020
```
* alter the api name from cpu to x86, test=develop

* correct the step_rnn model test, test=develop
```
  7950edd7
03 1月, 2020 3 次提交
- Z
  [NPU] enhance unittest for bn, transpose (#2716) · 59d079e8
  由 zhupengyang 提交于 1月 03, 2020
```
test=develop
```
  59d079e8
- W
  temporarily remove cuda fc fuse because we don't support cuda fc now. test=develop (#2715) · a1527e80
  由 Wilber 提交于 1月 03, 2020
```
temporarily remove cuda fc fuse because we don't support cuda fc now
```
  a1527e80
- H
  
  [LITE][NPU][XPU] Fix the data feeding of the input tensors in subgraph_pass_test (#2714) · e4aa194b
  由 hong19860320 提交于 1月 03, 2020
  
  e4aa194b
02 1月, 2020 3 次提交
- 石
  
  Jit macro definition ambiguity fix, test=develop (#2713) · 0176d4bf
  由石晓伟提交于 1月 02, 2020
  
  0176d4bf
- G
  [X86] Enhance fc_fuse_pass to enable fusing relu to fc_op (#2701) · 73450636
  由 GaoWei8 提交于 1月 02, 2020
```
* Enhance fc_fuse_pass to enable fusing relu to fc_op
test=develop

* restrict fusing relu in x86
test=develop
```
  73450636
- H
  
  [LITE][XPU] Supporting llvm and xpu device target (#2711) · 8c0397c6
  由 hong19860320 提交于 1月 02, 2020
  
  8c0397c6
31 12月, 2019 3 次提交

X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo... · f1cedb8f

由 Wilber 提交于 12月 31, 2019

X86 and cuda compile simutaneously cmake ..  -DCMAKE_BUILD_TYPE=RelWithDebInfo  -DWITH_MKL=ON           -DLITE_WITH_CUDA=ON           -DWITH_MKLDNN=OFF           -DLITE_WITH_X86=ON           -DLITE_WITH_PROFILE=OFF          -DWITH_LITE=OFF           -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF           -DWITH_PYTHON=OFF           -DWITH_TESTING=ON           -DLITE_WITH_ARM=OFF           -DLITE_ON_TINY_PUBLISH=OFF           -DCUDNN_ROOT=/usr/local/cudnn/           -DLITE_BUILD_EXTRA=ON (#2708)

x86 and cuda compile simutaneously

f1cedb8f

Z
[XPU] bn unit test (#2706) · bc6d5adc
由 zhupengyang 提交于 12月 31, 2019
```
test=develop
```
bc6d5adc

[LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · a29c84a2

由 hong19860320 提交于 12月 31, 2019

* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.

* Refine the registration of op bridges and make it similar to the registration of op and kernel.

* Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.

* Add the unit test of the scale and softmax op bridge for NPU.

a29c84a2

30 12月, 2019 2 次提交
- J
  Fix yolo_box bug (#2704) · bc5bd154
  由 juncaipeng 提交于 12月 30, 2019
```
* fix yolov3 bug when run several times, test=develop
```
  bc5bd154
- Y
  Optimize the execution of RuntimeProgram by saving the bool whether the op is... · bb1cf7ff
  由 Yiqun Liu 提交于 12月 30, 2019
```
Optimize the execution of RuntimeProgram by saving the bool whether the op is feed/fetch op. (#2703)

test=develop
```
  bb1cf7ff
28 12月, 2019 1 次提交
- H
  
  Upgrade of Model_optimize_tool (#2624) · 4300ef75
  由 huzhiqiang 提交于 12月 28, 2019
  
  4300ef75
27 12月, 2019 4 次提交
- 石
  
  update profiler, test=develop (#2644) · 9171b70e
  由石晓伟提交于 12月 27, 2019
  
  9171b70e
- Y
  
  move flatten op from extra to basic, test=develop (#2659) · ba32906a
  由 yiicy 提交于 12月 27, 2019
  
  ba32906a
- H
  
  [LITE][NPU][XPU] Add kernel context to NPU/XPU subgraph engine (#2686) · 1e10b471
  由 hong19860320 提交于 12月 27, 2019
  
  1e10b471
- H
  remove test_models ci projects, because these project hass been removed in ci... · ad1dfbf2
  由 huzhiqiang 提交于 12月 27, 2019
```
remove test_models ci projects, because these project hass been removed in ci test test=develop (#2669)
```
  ad1dfbf2
26 12月, 2019 3 次提交
- W
  fix fluid-lite-subgraph x86 compile error test=develop (#2682) · 53a5906c
  由 Wilber 提交于 12月 26, 2019
```
-fix fluid-lite-subgraph x86 compile error
    - Replace FLAGS with environment variables
```
  53a5906c
- X
  add multi_thread ut (#2677) · 19c08de2
  由 xiaogang 提交于 12月 26, 2019
```
* feat: add multi_thread ut
```
  19c08de2
- Z
  [XPU] mul unittest (#2676) · 6bce0133
  由 zhupengyang 提交于 12月 26, 2019
```
test=develop
```
  6bce0133
25 12月, 2019 6 次提交

J
fix mask rcnn error when run twice, test=develop (#2675) · cd49b0a3
由 juncaipeng 提交于 12月 25, 2019
```
add clear for tensor
```
cd49b0a3
J
fix op inputs and outputs type (#2647) · 168ce9a9
由 juncaipeng 提交于 12月 25, 2019
```
* fix op inputs and outputs type, test=develop
```
168ce9a9
W
optimize softmax cuda kernel test=develop (#2660) · 8f593443
由 Wilber 提交于 12月 25, 2019
```
optimize softmax cuda kernel
```
8f593443
J

add benchmark in cmakefile, test=develop (#2663) · 00fee283
由 juncaipeng 提交于 12月 25, 2019

00fee283
H

[LITE][XPU] Fix matmul op bridge (#2668) · 3fe5cddf
由 hong19860320 提交于 12月 25, 2019

3fe5cddf

[X86] Polish the implementation of fc and imporve the unittest (#2656) · 28481458

由 Yiqun Liu 提交于 12月 25, 2019

* Remove GEMM padding in fc_compute.
test=develop

* Write a common ParallelFor function to run the for loop in parallel.

* Add the codes of padding GEMM back in fc.

* Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor.

* Refine the unit test of fc and add testing case of padding and parallel.
test=develop

* Enable more test cases in common fc unittest, including padding and parallel for x86 target.

* Remove the fc test under kernels/x86.
test=develop

* Disable relu in test of fc for non-x86 target.
test=develop

* Change the eps of arm.
test=develop

28481458

24 12月, 2019 7 次提交
- Z
  
  [XPU] matmul bridge and unit test (#2666) · d345a7fc
  由 zhupengyang 提交于 12月 24, 2019
  
  d345a7fc
- H
  
  [LITE][XPU] Fix dropout op bridge and unit test for BERT (#2665) · d444ecbf
  由 hong19860320 提交于 12月 24, 2019
  
  d444ecbf
- H
  
  conclude model_test in CI into Android test (#2639) · 18f9ea1b
  由 huzhiqiang 提交于 12月 24, 2019
  
  18f9ea1b
- H
  [LITE][NPU][XPU] Support multiple types for XPU and NPU op bridges (#2646) · 05da0c72
  由 hong19860320 提交于 12月 24, 2019
```
* Support multiple types for XPU and NPU op bridges

* Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT

* Fix the definition of lookup_table kernel for X86
```
  05da0c72
- Y
  
  [ARM] multiclass_nms op add index output, test=develop (#2654) · e1c4adfd
  由 yiicy 提交于 12月 24, 2019
  
  e1c4adfd
- Z
  [XPU] add dropout bridge and unit test (#2650) · d904c9dd
  由 zhupengyang 提交于 12月 24, 2019
```
test=develop
```
  d904c9dd
- Z
  fix npu compile (#2658) · d4a89129
  由 zhupengyang 提交于 12月 24, 2019
```
test=develop
```
  d4a89129