提交 · d1b7aec5a5bc2448c7975fba9e8ab86d6ee75f84 · PaddlePaddle / Paddle-Lite

18 12月, 2019 2 次提交

J
Support Mask RCNN2 (#2588) · d1b7aec5
由 juncaipeng 提交于 12月 18, 2019
```
* Support Mask RCNN2 (#2588)
```
d1b7aec5

Add set_cpu_math_library_math_threads for CxxConfig (#2592) · b8992673

由 liu zhengxi 提交于 12月 18, 2019

* add set_cpu_math_library_math_threads for lite x86 platform, test=develop

* update the #if defined and add a condition LITE_WITH_X86, test=develop

* add if not defined LITE_ON_MODEL_OPTIMIZE_TOOL, test=develop

b8992673

17 12月, 2019 6 次提交

H

[lite][arm] add conv+relu6/leakyRelu fusion (#2599) · 3455ab0a
由 HappyAngel 提交于 12月 17, 2019

3455ab0a

[lite]add some fusion (#2604) · ec8353e8

由 HappyAngel 提交于 12月 17, 2019

* add cv image process

* fix arm liunx build error

* add LITE_WITH_CV defien to make cv, test=develop

* fix cv format, annd add describe in utils/cv

* delete some Meaningless comments, test=develop

* set LITE_WITH_CV=OFF in build.sh, test=develop

* delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop

* according to reviews to redefine paddle_image_preprocess.h, test=develop

* add detailed note of flipParam, test=develop

* fix format in paddle_image_preprocess.h, test=develop

* fix error when build x86. test=develop

lite_with_X86 does not contain lite_with_cv

* fix cmake error in llite/CMakeLists.txt, missing mkdir cxx, test=develop

* according to review change, test=develop

* chang grb to rgb, test=develop

* add elemetnwise mul constant elimination and deconv+relu, deconv+batchnorm fusion, test=develop

* fix format, test=develop

ec8353e8

G
[ARMLinux] Fix the error that armlinux can not compile (#2612) · 1e9823a0
由 guofei 提交于 12月 17, 2019
```
test=develop
```
1e9823a0

Develop lite reshape (#2613) · e3808d79

由 xiebaiyuan 提交于 12月 17, 2019

* add reshape opencl kernel && optimise conv 1x1 ,test=develop

* add reshape opencl kernel && optimise conv 1x1 &&code style ,test=develop

* add reshape opencl kernel && optimise conv 1x1 &&code style ,test=develop

e3808d79

Y

add fp16 opencl kernel. fix nhwc as imagedefault for opencl layout. test=develop (#2616) · 24c98e60
由 Yuan Shuai 提交于 12月 17, 2019

24c98e60
H

change the url_source of readme img into github network (#2614) · 9bb68a8f
由 huzhiqiang 提交于 12月 17, 2019

9bb68a8f

16 12月, 2019 4 次提交

T
update fpga KD patch (#2609) · 90e5895a
由 TianXiaogang 提交于 12月 16, 2019
```
* fix: update backend fpga patch
```
90e5895a

[LITE][OPENCL] Add relu image2d kernel unit test, Fix conv2d_1x1, relu, layout... · ee177c6b

由 Yuan Shuai 提交于 12月 16, 2019

[LITE][OPENCL] Add relu image2d kernel unit test, Fix conv2d_1x1, relu, layout using new Image2D Layout (#2564)

* add 3 layout for opencl image. test=develop

* add relu image2d test. test=develop

ee177c6b

[LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel (#2601) · 600c8c20

由 Jiaying Zhao 提交于 12月 16, 2019

* [LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel

* [LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel. test=develop

* [LITE][OPENCL] Add Pool opencl kernel. test=develop

600c8c20

石
update profiler, test=develop (#2607) · af37a14f
由石晓伟提交于 12月 16, 2019
```
* update profiler, test=develop

* warm up times of profiler, test=develop
```
af37a14f

15 12月, 2019 1 次提交
- W
  optimize search_grnn test=develop (#2608) · dad43f81
  由 Wilber 提交于 12月 15, 2019
```
optimize search_grnn
```
  dad43f81
13 12月, 2019 1 次提交
- H
  [LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at... · d5434aa2
  由 hong19860320 提交于 12月 13, 2019
```
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
```
  d5434aa2
12 12月, 2019 4 次提交

H

modify ci_build.sh to ensure the num of adb devices on mac ci (#2597) · d8750966
由 huzhiqiang 提交于 12月 12, 2019

d8750966

[LITE][OPENCL] Add conv2d_1x1 opencl kernel (#2591) · e583a55d

由 xiebaiyuan 提交于 12月 12, 2019

* add opencl conv1x1 image impl and unit test pass with relu & bias,
add layout_compute --> buffer2image float32 --> with unit test pass
suite checked test for more situation , test=develop

* add opencl conv1x1 image impl and unit test pass with relu & bias,
add layout_compute --> buffer2image float32 --> with unit test pass
suite checked test for more situation , test=develop

* fix white space cpp lint , test=develop

e583a55d

fix 1x1_wrapped crashed in huawei && reinit super image . test = dev… (#2595) · aab5f53f

由 xiebaiyuan 提交于 12月 12, 2019

* fix 1x1_wrapped crashed in huawei && reinit super image  . test = develop

* fix 1x1_wrapped crashed in huawei && reinit super image  . test = mobile

aab5f53f

H

modify the test of adb deveices num test=develop (#2596) · ae0a7a9d
由 huzhiqiang 提交于 12月 12, 2019

ae0a7a9d

11 12月, 2019 2 次提交
- T
  
  add winograd f23 implement (#2584) · f99c34c8
  由 TianXiaogang 提交于 12月 11, 2019
  
  f99c34c8
- Y
  [LITE][OPENCL] add 3 data layouts for opencl image2d (#2561) · fbb0d3b5
  由 Yuan Shuai 提交于 12月 11, 2019
```
* add 3 layout for opencl image. test=develop
```
  fbb0d3b5
10 12月, 2019 7 次提交

H

modify code_style of CMakeList.txt to make ci_build_test_server able to work on gcc_4.8.2 · 4ae43560
由 huzhiqiang 提交于 12月 10, 2019

4ae43560

fix a bug in ewadd in FPGA v2, test=mobile close#2574 (#2575) · 482a2aa8

由 qnqinan 提交于 12月 10, 2019

* update proposal and psroipool kernel file in FPGA V2 track

* update, test=develop

* update FPGA v2 pe cpp file and ew kernel files, test=develop

* fix a bug of sigmoid kernel in FPGA v2 track, test=develop

* fix bugs of concat, reshape and slice op and add usleep in fpga regpoll, test=develop

* add interupt clear operation before op compute in FPGA V2 track, test=develop

* fix a bug in ewadd in FPGA v2, test=mobile

482a2aa8

H

modify ci_build.sh to support ci test with actual mobilephone test=develop (#2525) · 381fa7f6
由 huzhiqiang 提交于 12月 10, 2019

381fa7f6
W
fix type_target_cast pass. support only copy once for multiple use arg. test=develop (#2572) · 8903c795
由 Wilber 提交于 12月 10, 2019
```
For multiple-use parameters, only copy once
```
8903c795

modify static_kernel_pass to support select the kernel according to input type (#2488) · 7ef0e7fe

由 Wilber 提交于 12月 10, 2019

修改了选kernel的逻辑，默认从模型文件中读取出lod_tensor的data type，在static_kernel_pick pass中如果kernel输入输出的类型与读取的data type完全一致，则选择该Kernel的概率增大。

- 增加 从模型文件__model__读取lod_tensor的data type到cpp::vardesc

- program中增加unordered_map<string, type>字段，并在 Program::PrepareWorkspace中对该字段赋值

- 修改了node.h文件，将const Type* 更改为Type*，并在SSAGraph::Build过程中为符合条件的type*赋值

- static_kernel_pick_pass中添加新规则，如果kernel的输入类型输出类型与__model__中存储的类型的一致，则score*=2。

- 支持模型中用到sequence_reverse_float kernel（输入输出均为float）和sequence_reverse_int64 kernel（输入输出均为int64），能够根据输入输出type选kernel

7ef0e7fe

Y

[ARM] add instance norm op and ut, test=develop (#2578) · 9a3552db
由 yiicy 提交于 12月 10, 2019

9a3552db
Z
[NPU] add argamx op bridge and unit test (#2580) · c64036ca
由 zhupengyang 提交于 12月 10, 2019
```
test=develop
```
c64036ca

09 12月, 2019 4 次提交
- Y
  
  fix ios demo build error, test=develop (#2579) · 0c44ac9c
  由 yiicy 提交于 12月 09, 2019
  
  0c44ac9c
- Y
  
  [JAVA API]java tensor api setData and getData support Int type, test=develop (#2583) · 71aa1b49
  由 yiicy 提交于 12月 09, 2019
  
  71aa1b49
- Z
  [NPU] support relu6 (#2582) · a2f981a4
  由 zhupengyang 提交于 12月 09, 2019
```
test=develop
```
  a2f981a4
- H
  Static libraty in tiny pub (#2560) · ea837ec7
  由 huzhiqiang 提交于 12月 09, 2019
```
* add static library in tiny_publish
* move flto and ffunction-sections cmake option into the tiny publish so result of java,cxx,python 
```
  ea837ec7
08 12月, 2019 1 次提交
- L
  
  Add fc op on lite x86 platform (#2568) · d76c529a
  由 liu zhengxi 提交于 12月 08, 2019
  
  d76c529a
07 12月, 2019 2 次提交

Z
[NPU] add unsqueeze op bridge and unit test (#2570) · 3f28e00e
由 zhupengyang 提交于 12月 07, 2019
```
test=develop
```
3f28e00e

Support mask_rcnn (#2484) · c2f72cb3

由 juncaipeng 提交于 12月 07, 2019

* add arm split lod tensor, test=develop

* add arm merge lod tensor, test=develop

* update split merge lod tensor, test=develop

* add reduce_prob op, test=develop

* support mask_rcnn succeed, test=develop

c2f72cb3

06 12月, 2019 1 次提交
- W
  fix fill_constant bug and add int64->int32 cast test=develop (#2566) · e17295cc
  由 Wilber 提交于 12月 06, 2019
```
- fix fill_constant bug.
- cast op support int64_t->int32_t
```
  e17295cc
04 12月, 2019 3 次提交
- W
  update cuda kernels to run content-dnn models test=develop (#2554) · aa67c28e
  由 Wilber 提交于 12月 04, 2019
```
update cuda kernels to run content-dnn model
```
  aa67c28e
- Z
  [cuda] [int8] resnet50 cuda int8 support (#2417) · f7574646
  由 Zhaolong Xing 提交于 12月 04, 2019
```
* init resnet cuda int8 support
test=develop

* refine cuda unit test
test=develop

* add the forgeted file.
test=develop
```
  f7574646
- 石
  
  refactor profile tools, test=develop (#2536) · 8a634b71
  由石晓伟提交于 12月 04, 2019
  
  8a634b71
03 12月, 2019 2 次提交

[Demo] add cxx mobilenetv1-ssd detection demo, test=develop (#2541) · c8b51f82

由 yiicy 提交于 12月 03, 2019

* [Demo] add cxx mobilenetv1-ssd detection demo, test=develop

* add makefile to mobile detection demo, test=develop

* [Demo] add cxx mobilenetv1-ssd detection demo, test=develop

* [demo] fix mobile_detection code style, test=develop

* [Demo] fix demo code style, test=develop

* [Demo] fix detection demo makefile dependency, test=develop

c8b51f82

Z
fix quant dequant fuse pass bug (#2552) · 137d7a6d
由 Zhaolong Xing 提交于 12月 03, 2019
```
test=develop
```
137d7a6d