提交 · 08a3ed12af356b075ff9f8e499610bb6270184e4 · PaddlePaddle / Paddle-Lite

10 3月, 2020 1 次提交
- H
  
  [CORE] Support the fully quantized model for MTK and RK NPU (#3096) · 08a3ed12
  由 hong19860320 提交于 3月 10, 2020
  
  08a3ed12
09 3月, 2020 1 次提交
- Z
  
  avoid reusing non-tensor in memery_optimize_pass (#3111) · e0bf1152
  由 zhupengyang 提交于 3月 09, 2020
  
  e0bf1152
06 3月, 2020 1 次提交
- Z
  [MLU] resnet50 supported on MLU,test=develop (#3087) · 95e7f6f3
  由 zhangshijin 提交于 3月 06, 2020
```
* [MLU] support resnet50 on MLU

* [MLU] support resnet50 on MLU
```
  95e7f6f3
05 3月, 2020 1 次提交
- Y
  [LITE][PASS][OPENCL] Fix memory_resuse for opencl (#3077) · b601d81f
  由 Yuan Shuai 提交于 3月 05, 2020
```
* Fix memory_resuse for opencl. test=develop

* remove useless code. test=develop
```
  b601d81f
03 3月, 2020 1 次提交
- H
  
  [Core] Fix memory_optmize_pass for reshape/reshape2 op with inplace=True (#3045) · 331aaecd
  由 hong19860320 提交于 3月 03, 2020
  
  331aaecd
02 3月, 2020 1 次提交

[LITE][OPENCL] Support video-sr feature using OpenCL FP16 Image (#3049) · 3d06dcfe

由 Yuan Shuai 提交于 3月 02, 2020

* [LITE][OPENCL] Support video-sr feature using OpenCL FP16 Image. test=develop

* optimize image2d_to_buffer_with_post255. test=develop

* add def debug in cl kernel. test=develop

* remove conv image code in conv buffer. test=develop

3d06dcfe

01 3月, 2020 1 次提交
- C
  
  Support quantizing softmax op, test=develop (#3051) · 13568f85
  由 cc 提交于 3月 01, 2020
  
  13568f85
28 2月, 2020 1 次提交
- H
  
  remove API headfile test=develop (#3027) · 35ea6f9c
  由 huzhiqiang 提交于 2月 28, 2020
  
  35ea6f9c
26 2月, 2020 1 次提交
- H
  
  [opencl]add pre_process attribute into layoutop (#3001) · 360b4013
  由 huzhiqiang 提交于 2月 26, 2020
  
  360b4013
24 2月, 2020 2 次提交

石

fix data type of asserts, test=develop (#2787) · 2b6fb9ba
由石晓伟提交于 2月 24, 2020

2b6fb9ba

[LITE][OPENCL] support fp16 for cl_image_converter, layout, activation all... · 8b90a0c7

由 Yuan Shuai 提交于 2月 24, 2020

[LITE][OPENCL] support fp16 for cl_image_converter, layout, activation all OpenCL image kernel. test=develop (#2964)

* [LITE][OPENCL] support fp16 for cl_image_converter, layout, activation image kernel. test=develop

* add conv, depthwise and UT. test=develop

* add pool, conv, nearest_interp kernel. test=develop

* support fp16 for scale, reshape, concat, fc buffer opencl kernel. test=develop

* refactor for mul opencl buffer kernel. test=develop

* support fp16 for elementwise_mul opecl image kernel. test=develop

* support fp16 for elementwise_mul opencl image kernel. test=develop

* support fp16 for ele_add, fuse_ele_add_act opencl kernel. test=develop

* rename io_copy. test=develop

* mobilenetv1,v2 passed on 855. test=develop

* fix opt for opencl. test=develop

8b90a0c7

22 2月, 2020 1 次提交
- H
  
  【arm】fix pooling no-equal padding problem (#2956) · 12a17969
  由 HappyAngel 提交于 2月 22, 2020
  
  12a17969
21 2月, 2020 1 次提交
- H
  
  [NPU][XPU][BM] Remove the dependencies from X86 and ARM kernels (#2963) · 294375f9
  由 hong19860320 提交于 2月 21, 2020
  
  294375f9
20 2月, 2020 1 次提交
- C
  skip fusing quantized conv2d + relu6 for now, test=develop (#2952) · 9c561663
  由 cc 提交于 2月 20, 2020
```
skip fusing quantized conv2d + relu6 for now
```
  9c561663
16 2月, 2020 1 次提交
- H
  
  【opt fix】change the optimized_model name of opt (#2892) · 0961a938
  由 huzhiqiang 提交于 2月 16, 2020
  
  0961a938
14 2月, 2020 1 次提交
- X
  fix: fix fpga run the feed/fetch op (#2868) · 27ec5deb
  由 xiaogang 提交于 2月 14, 2020
```
fix fpga lite_tensor compile bug
     add fake quantize_abs_max op
     test=develop
```
  27ec5deb
13 2月, 2020 1 次提交
- H
  
  【MODEL UPDATE】: combine params and model (#2800) · 63eca0a9
  由 huzhiqiang 提交于 2月 13, 2020
  
  63eca0a9
12 2月, 2020 1 次提交

[arm] add conv+relu6/leakyRelu in conv_activation (#2833) · 3d0b463a

由 HappyAngel 提交于 2月 12, 2020

* fix con+relu6/leakyRelu fusion in Fp32, test=develop

* note m=397 in sgemv_int8 ut, test=develop

* fix ios build error. test=develop

3d0b463a

06 2月, 2020 1 次提交

Support weight quantization (#2791) · 6329a9a2

由 juncaipeng 提交于 2月 06, 2020

* optimize quant_dequant_fuse_pass, test=develop

* update, test=develop

* update, test=develop

* fix bug for accessing the removed node, test=develop

* set the bias of int8 conv as float, test=develop

* support weight quantization, test=develop

* up, test=develop

* up, test=develop

* up, test=develop

6329a9a2

22 1月, 2020 1 次提交
- H
  
  fix optimize_tool bug (#2779) · 124c43a0
  由 HappyAngel 提交于 1月 22, 2020
  
  124c43a0
16 1月, 2020 1 次提交
- H
  [arm] add conv_5x5s2_dw to support any padding (#2770) · c35d8e14
  由 HappyAngel 提交于 1月 16, 2020
```
1. add conv_5x5s2_dw to support any padding
2. add 1x1s2pooling impl
3. fix conv dw 3x3 s1p01 bug
```
  c35d8e14
14 1月, 2020 1 次提交
- Support bitman backend,test=develop (#2761) · 14811017
  由 myq406450149 提交于 1月 14, 2020
```
* Support bitman backend
```
  14811017
09 1月, 2020 1 次提交
- W
  temporarily remove x86 fuse test=develop (#2742) · b30dc65b
  由 Wilber 提交于 1月 09, 2020
```
* temporarily remove x86 fuse test=develop

* remove useless logs test=develop
```
  b30dc65b
06 1月, 2020 1 次提交
- 石
  
  fix build errors, test=develop (#2728) · 947cda26
  由石晓伟提交于 1月 06, 2020
  
  947cda26
03 1月, 2020 2 次提交
- W
  temporarily remove cuda fc fuse because we don't support cuda fc now. test=develop (#2715) · a1527e80
  由 Wilber 提交于 1月 03, 2020
```
temporarily remove cuda fc fuse because we don't support cuda fc now
```
  a1527e80
- H
  
  [LITE][NPU][XPU] Fix the data feeding of the input tensors in subgraph_pass_test (#2714) · e4aa194b
  由 hong19860320 提交于 1月 03, 2020
  
  e4aa194b
02 1月, 2020 1 次提交
- G
  [X86] Enhance fc_fuse_pass to enable fusing relu to fc_op (#2701) · 73450636
  由 GaoWei8 提交于 1月 02, 2020
```
* Enhance fc_fuse_pass to enable fusing relu to fc_op
test=develop

* restrict fusing relu in x86
test=develop
```
  73450636
31 12月, 2019 1 次提交

[LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · a29c84a2

由 hong19860320 提交于 12月 31, 2019

* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.

* Refine the registration of op bridges and make it similar to the registration of op and kernel.

* Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.

* Add the unit test of the scale and softmax op bridge for NPU.

a29c84a2

24 12月, 2019 1 次提交

[LITE][NPU][XPU] Support multiple types for XPU and NPU op bridges (#2646) · 05da0c72

由 hong19860320 提交于 12月 24, 2019

* Support multiple types for XPU and NPU op bridges

* Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT

* Fix the definition of lookup_table kernel for X86

05da0c72

23 12月, 2019 1 次提交
- W
  add sequence_pool_concat fuse and kernel test=develop (#2645) · 1b74fded
  由 Wilber 提交于 12月 23, 2019
```
add sequence_pool_concat fuse pass

add fuse kernel
```
  1b74fded
20 12月, 2019 1 次提交
- W
  add var_conv_2d_relu pass test=develop (#2631) · 8304bc84
  由 Wilber 提交于 12月 20, 2019
```
add var_conv_2d + relu fuse pass
```
  8304bc84
18 12月, 2019 1 次提交
- J
  Support Mask RCNN2 (#2588) · d1b7aec5
  由 juncaipeng 提交于 12月 18, 2019
```
* Support Mask RCNN2 (#2588)
```
  d1b7aec5
17 12月, 2019 2 次提交

[lite]add some fusion (#2604) · ec8353e8

由 HappyAngel 提交于 12月 17, 2019

* add cv image process

* fix arm liunx build error

* add LITE_WITH_CV defien to make cv, test=develop

* fix cv format, annd add describe in utils/cv

* delete some Meaningless comments, test=develop

* set LITE_WITH_CV=OFF in build.sh, test=develop

* delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop

* according to reviews to redefine paddle_image_preprocess.h, test=develop

* add detailed note of flipParam, test=develop

* fix format in paddle_image_preprocess.h, test=develop

* fix error when build x86. test=develop

lite_with_X86 does not contain lite_with_cv

* fix cmake error in llite/CMakeLists.txt, missing mkdir cxx, test=develop

* according to review change, test=develop

* chang grb to rgb, test=develop

* add elemetnwise mul constant elimination and deconv+relu, deconv+batchnorm fusion, test=develop

* fix format, test=develop

ec8353e8

G
[ARMLinux] Fix the error that armlinux can not compile (#2612) · 1e9823a0
由 guofei 提交于 12月 17, 2019
```
test=develop
```
1e9823a0

13 12月, 2019 1 次提交
- H
  [LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at... · d5434aa2
  由 hong19860320 提交于 12月 13, 2019
```
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
```
  d5434aa2
10 12月, 2019 2 次提交

W
fix type_target_cast pass. support only copy once for multiple use arg. test=develop (#2572) · 8903c795
由 Wilber 提交于 12月 10, 2019
```
For multiple-use parameters, only copy once
```
8903c795

modify static_kernel_pass to support select the kernel according to input type (#2488) · 7ef0e7fe

由 Wilber 提交于 12月 10, 2019

修改了选kernel的逻辑，默认从模型文件中读取出lod_tensor的data type，在static_kernel_pick pass中如果kernel输入输出的类型与读取的data type完全一致，则选择该Kernel的概率增大。

- 增加 从模型文件__model__读取lod_tensor的data type到cpp::vardesc

- program中增加unordered_map<string, type>字段，并在 Program::PrepareWorkspace中对该字段赋值

- 修改了node.h文件，将const Type* 更改为Type*，并在SSAGraph::Build过程中为符合条件的type*赋值

- static_kernel_pick_pass中添加新规则，如果kernel的输入类型输出类型与__model__中存储的类型的一致，则score*=2。

- 支持模型中用到sequence_reverse_float kernel（输入输出均为float）和sequence_reverse_int64 kernel（输入输出均为int64），能够根据输入输出type选kernel

7ef0e7fe

03 12月, 2019 1 次提交
- Z
  fix quant dequant fuse pass bug (#2552) · 137d7a6d
  由 Zhaolong Xing 提交于 12月 03, 2019
```
test=develop
```
  137d7a6d
29 11月, 2019 2 次提交
- Y
  [LITE][PASS] Fix static kernel pick pass, if op is not int8, but kernel is... · ddce609e
  由 Yuan Shuai 提交于 11月 29, 2019
```
[LITE][PASS] Fix static kernel pick pass, if op is not int8, but kernel is int8. test=develop (#2526)
```
  ddce609e
- Z
  [NPU] add reduce_mean op bridge and unit test (#2522) · c809321d
  由 zhupengyang 提交于 11月 29, 2019
```
* [NPU] add reduce_mean op bridge and unit test

test=develop

* refine xpu_pass place order; add bridges use

test=develop
```
  c809321d