提交 · 1b74fded360313bba9772c319dcfb4ac2dd80fc0 · PaddlePaddle / Paddle-Lite

23 12月, 2019 4 次提交
- W
  add sequence_pool_concat fuse and kernel test=develop (#2645) · 1b74fded
  由 Wilber 提交于 12月 23, 2019
```
add sequence_pool_concat fuse pass

add fuse kernel
```
  1b74fded
- Y
  
  [ARM] add grid_sampler op and ut, test=develop (#2598) · 3723451b
  由 yiicy 提交于 12月 23, 2019
  
  3723451b
- Y
  
  [FIX] model_test and benchmark remove x86 targets (#2600) · df160500
  由 yiicy 提交于 12月 23, 2019
  
  df160500
- L
  
  fix_typo in set_cpu_math_library_num_threads, test=develop (#2642) · f3ef8f52
  由 liu zhengxi 提交于 12月 23, 2019
  
  f3ef8f52
21 12月, 2019 1 次提交
- Z
  [XPU] add layer_norm bridge and unit test (#2640) · 4dd6a4b8
  由 zhupengyang 提交于 12月 21, 2019
```
test=develop
```
  4dd6a4b8
20 12月, 2019 3 次提交
- Z
  [XPU] add reshape bridge and unit test (#2621) · a13c592d
  由 zhupengyang 提交于 12月 20, 2019
```
test=develop
```
  a13c592d
- W
  add var_conv_2d_relu pass test=develop (#2631) · 8304bc84
  由 Wilber 提交于 12月 20, 2019
```
add var_conv_2d + relu fuse pass
```
  8304bc84
- Z
  [XPU] add transpose bridge and unit test (#2630) · b53ece7a
  由 zhupengyang 提交于 12月 20, 2019
```
* [XPU] add transpose bridge and unit test

test=develop
```
  b53ece7a
19 12月, 2019 5 次提交
- 石
  
  fix cmake flaws, test=develop (#2468) · 23ae767e
  由石晓伟提交于 12月 19, 2019
  
  23ae767e
- W
  optimize cuda kernel test=develop (#2628) · 09aa15a5
  由 Wilber 提交于 12月 19, 2019
```
* optimize content-dnn cuda kernel
```
  09aa15a5
- X
  feature: update fpga kernel patch (#2627) · 3f35879b
  由 xiaogang 提交于 12月 19, 2019
```
* feature: update fpga kernel patch
```
  3f35879b
- T
  
  fix: fix sgemm_c4 bug when n=1 (#2615) · c1d397e3
  由 TianXiaogang 提交于 12月 19, 2019
  
  c1d397e3
- Y
  [ARM] change global pooling choose kernel policy, test=develop (#2602) · 49f03648
  由 yiicy 提交于 12月 19, 2019
```
* [ARM] change global pooling choose kernel policy, test=develop
```
  49f03648
18 12月, 2019 2 次提交

J
Support Mask RCNN2 (#2588) · d1b7aec5
由 juncaipeng 提交于 12月 18, 2019
```
* Support Mask RCNN2 (#2588)
```
d1b7aec5

Add set_cpu_math_library_math_threads for CxxConfig (#2592) · b8992673

由 liu zhengxi 提交于 12月 18, 2019

* add set_cpu_math_library_math_threads for lite x86 platform, test=develop

* update the #if defined and add a condition LITE_WITH_X86, test=develop

* add if not defined LITE_ON_MODEL_OPTIMIZE_TOOL, test=develop

b8992673

17 12月, 2019 5 次提交

H

[lite][arm] add conv+relu6/leakyRelu fusion (#2599) · 3455ab0a
由 HappyAngel 提交于 12月 17, 2019

3455ab0a

[lite]add some fusion (#2604) · ec8353e8

由 HappyAngel 提交于 12月 17, 2019

* add cv image process

* fix arm liunx build error

* add LITE_WITH_CV defien to make cv, test=develop

* fix cv format, annd add describe in utils/cv

* delete some Meaningless comments, test=develop

* set LITE_WITH_CV=OFF in build.sh, test=develop

* delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop

* according to reviews to redefine paddle_image_preprocess.h, test=develop

* add detailed note of flipParam, test=develop

* fix format in paddle_image_preprocess.h, test=develop

* fix error when build x86. test=develop

lite_with_X86 does not contain lite_with_cv

* fix cmake error in llite/CMakeLists.txt, missing mkdir cxx, test=develop

* according to review change, test=develop

* chang grb to rgb, test=develop

* add elemetnwise mul constant elimination and deconv+relu, deconv+batchnorm fusion, test=develop

* fix format, test=develop

ec8353e8

G
[ARMLinux] Fix the error that armlinux can not compile (#2612) · 1e9823a0
由 guofei 提交于 12月 17, 2019
```
test=develop
```
1e9823a0

Develop lite reshape (#2613) · e3808d79

由 xiebaiyuan 提交于 12月 17, 2019

* add reshape opencl kernel && optimise conv 1x1 ,test=develop

* add reshape opencl kernel && optimise conv 1x1 &&code style ,test=develop

* add reshape opencl kernel && optimise conv 1x1 &&code style ,test=develop

e3808d79

Y

add fp16 opencl kernel. fix nhwc as imagedefault for opencl layout. test=develop (#2616) · 24c98e60
由 Yuan Shuai 提交于 12月 17, 2019

24c98e60

16 12月, 2019 4 次提交

T
update fpga KD patch (#2609) · 90e5895a
由 TianXiaogang 提交于 12月 16, 2019
```
* fix: update backend fpga patch
```
90e5895a

[LITE][OPENCL] Add relu image2d kernel unit test, Fix conv2d_1x1, relu, layout... · ee177c6b

由 Yuan Shuai 提交于 12月 16, 2019

[LITE][OPENCL] Add relu image2d kernel unit test, Fix conv2d_1x1, relu, layout using new Image2D Layout (#2564)

* add 3 layout for opencl image. test=develop

* add relu image2d test. test=develop

ee177c6b

[LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel (#2601) · 600c8c20

由 Jiaying Zhao 提交于 12月 16, 2019

* [LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel

* [LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel. test=develop

* [LITE][OPENCL] Add Pool opencl kernel. test=develop

600c8c20

石
update profiler, test=develop (#2607) · af37a14f
由石晓伟提交于 12月 16, 2019
```
* update profiler, test=develop

* warm up times of profiler, test=develop
```
af37a14f

15 12月, 2019 1 次提交
- W
  optimize search_grnn test=develop (#2608) · dad43f81
  由 Wilber 提交于 12月 15, 2019
```
optimize search_grnn
```
  dad43f81
13 12月, 2019 1 次提交
- H
  [LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at... · d5434aa2
  由 hong19860320 提交于 12月 13, 2019
```
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
```
  d5434aa2
12 12月, 2019 3 次提交

H

modify ci_build.sh to ensure the num of adb devices on mac ci (#2597) · d8750966
由 huzhiqiang 提交于 12月 12, 2019

d8750966

[LITE][OPENCL] Add conv2d_1x1 opencl kernel (#2591) · e583a55d

由 xiebaiyuan 提交于 12月 12, 2019

* add opencl conv1x1 image impl and unit test pass with relu & bias,
add layout_compute --> buffer2image float32 --> with unit test pass
suite checked test for more situation , test=develop

* add opencl conv1x1 image impl and unit test pass with relu & bias,
add layout_compute --> buffer2image float32 --> with unit test pass
suite checked test for more situation , test=develop

* fix white space cpp lint , test=develop

e583a55d

H

modify the test of adb deveices num test=develop (#2596) · ae0a7a9d
由 huzhiqiang 提交于 12月 12, 2019

ae0a7a9d

11 12月, 2019 2 次提交
- T
  
  add winograd f23 implement (#2584) · f99c34c8
  由 TianXiaogang 提交于 12月 11, 2019
  
  f99c34c8
- Y
  [LITE][OPENCL] add 3 data layouts for opencl image2d (#2561) · fbb0d3b5
  由 Yuan Shuai 提交于 12月 11, 2019
```
* add 3 layout for opencl image. test=develop
```
  fbb0d3b5
10 12月, 2019 6 次提交

H

modify code_style of CMakeList.txt to make ci_build_test_server able to work on gcc_4.8.2 · 4ae43560
由 huzhiqiang 提交于 12月 10, 2019

4ae43560
H

modify ci_build.sh to support ci test with actual mobilephone test=develop (#2525) · 381fa7f6
由 huzhiqiang 提交于 12月 10, 2019

381fa7f6
W
fix type_target_cast pass. support only copy once for multiple use arg. test=develop (#2572) · 8903c795
由 Wilber 提交于 12月 10, 2019
```
For multiple-use parameters, only copy once
```
8903c795

modify static_kernel_pass to support select the kernel according to input type (#2488) · 7ef0e7fe

由 Wilber 提交于 12月 10, 2019

修改了选kernel的逻辑，默认从模型文件中读取出lod_tensor的data type，在static_kernel_pick pass中如果kernel输入输出的类型与读取的data type完全一致，则选择该Kernel的概率增大。

- 增加 从模型文件__model__读取lod_tensor的data type到cpp::vardesc

- program中增加unordered_map<string, type>字段，并在 Program::PrepareWorkspace中对该字段赋值

- 修改了node.h文件，将const Type* 更改为Type*，并在SSAGraph::Build过程中为符合条件的type*赋值

- static_kernel_pick_pass中添加新规则，如果kernel的输入类型输出类型与__model__中存储的类型的一致，则score*=2。

- 支持模型中用到sequence_reverse_float kernel（输入输出均为float）和sequence_reverse_int64 kernel（输入输出均为int64），能够根据输入输出type选kernel

7ef0e7fe

Y

[ARM] add instance norm op and ut, test=develop (#2578) · 9a3552db
由 yiicy 提交于 12月 10, 2019

9a3552db
Z
[NPU] add argamx op bridge and unit test (#2580) · c64036ca
由 zhupengyang 提交于 12月 10, 2019
```
test=develop
```
c64036ca

09 12月, 2019 3 次提交
- Y
  
  fix ios demo build error, test=develop (#2579) · 0c44ac9c
  由 yiicy 提交于 12月 09, 2019
  
  0c44ac9c
- Y
  
  [JAVA API]java tensor api setData and getData support Int type, test=develop (#2583) · 71aa1b49
  由 yiicy 提交于 12月 09, 2019
  
  71aa1b49
- Z
  [NPU] support relu6 (#2582) · a2f981a4
  由 zhupengyang 提交于 12月 09, 2019
```
test=develop
```
  a2f981a4