提交 · 381fa7f66f252b0628c97205df3669f0d0f5a9a3 · PaddlePaddle / Paddle-Lite

10 12月, 2019 5 次提交

H

modify ci_build.sh to support ci test with actual mobilephone test=develop (#2525) · 381fa7f6
由 huzhiqiang 提交于 12月 10, 2019

381fa7f6
W
fix type_target_cast pass. support only copy once for multiple use arg. test=develop (#2572) · 8903c795
由 Wilber 提交于 12月 10, 2019
```
For multiple-use parameters, only copy once
```
8903c795

modify static_kernel_pass to support select the kernel according to input type (#2488) · 7ef0e7fe

由 Wilber 提交于 12月 10, 2019

修改了选kernel的逻辑，默认从模型文件中读取出lod_tensor的data type，在static_kernel_pick pass中如果kernel输入输出的类型与读取的data type完全一致，则选择该Kernel的概率增大。

- 增加 从模型文件__model__读取lod_tensor的data type到cpp::vardesc

- program中增加unordered_map<string, type>字段，并在 Program::PrepareWorkspace中对该字段赋值

- 修改了node.h文件，将const Type* 更改为Type*，并在SSAGraph::Build过程中为符合条件的type*赋值

- static_kernel_pick_pass中添加新规则，如果kernel的输入类型输出类型与__model__中存储的类型的一致，则score*=2。

- 支持模型中用到sequence_reverse_float kernel（输入输出均为float）和sequence_reverse_int64 kernel（输入输出均为int64），能够根据输入输出type选kernel

7ef0e7fe

Y

[ARM] add instance norm op and ut, test=develop (#2578) · 9a3552db
由 yiicy 提交于 12月 10, 2019

9a3552db
Z
[NPU] add argamx op bridge and unit test (#2580) · c64036ca
由 zhupengyang 提交于 12月 10, 2019
```
test=develop
```
c64036ca

09 12月, 2019 4 次提交
- Y
  
  fix ios demo build error, test=develop (#2579) · 0c44ac9c
  由 yiicy 提交于 12月 09, 2019
  
  0c44ac9c
- Y
  
  [JAVA API]java tensor api setData and getData support Int type, test=develop (#2583) · 71aa1b49
  由 yiicy 提交于 12月 09, 2019
  
  71aa1b49
- Z
  [NPU] support relu6 (#2582) · a2f981a4
  由 zhupengyang 提交于 12月 09, 2019
```
test=develop
```
  a2f981a4
- H
  Static libraty in tiny pub (#2560) · ea837ec7
  由 huzhiqiang 提交于 12月 09, 2019
```
* add static library in tiny_publish
* move flto and ffunction-sections cmake option into the tiny publish so result of java,cxx,python 
```
  ea837ec7
08 12月, 2019 1 次提交
- L
  
  Add fc op on lite x86 platform (#2568) · d76c529a
  由 liu zhengxi 提交于 12月 08, 2019
  
  d76c529a
07 12月, 2019 2 次提交

Z
[NPU] add unsqueeze op bridge and unit test (#2570) · 3f28e00e
由 zhupengyang 提交于 12月 07, 2019
```
test=develop
```
3f28e00e

Support mask_rcnn (#2484) · c2f72cb3

由 juncaipeng 提交于 12月 07, 2019

* add arm split lod tensor, test=develop

* add arm merge lod tensor, test=develop

* update split merge lod tensor, test=develop

* add reduce_prob op, test=develop

* support mask_rcnn succeed, test=develop

c2f72cb3

06 12月, 2019 1 次提交
- W
  fix fill_constant bug and add int64->int32 cast test=develop (#2566) · e17295cc
  由 Wilber 提交于 12月 06, 2019
```
- fix fill_constant bug.
- cast op support int64_t->int32_t
```
  e17295cc
04 12月, 2019 3 次提交
- W
  update cuda kernels to run content-dnn models test=develop (#2554) · aa67c28e
  由 Wilber 提交于 12月 04, 2019
```
update cuda kernels to run content-dnn model
```
  aa67c28e
- Z
  [cuda] [int8] resnet50 cuda int8 support (#2417) · f7574646
  由 Zhaolong Xing 提交于 12月 04, 2019
```
* init resnet cuda int8 support
test=develop

* refine cuda unit test
test=develop

* add the forgeted file.
test=develop
```
  f7574646
- 石
  
  refactor profile tools, test=develop (#2536) · 8a634b71
  由石晓伟提交于 12月 04, 2019
  
  8a634b71
03 12月, 2019 5 次提交
- Y
  [Demo] add cxx mobilenetv1-ssd detection demo, test=develop (#2541) · c8b51f82
  由 yiicy 提交于 12月 03, 2019
```
* [Demo] add cxx mobilenetv1-ssd detection demo, test=develop

* add makefile to mobile detection demo, test=develop

* [Demo] add cxx mobilenetv1-ssd detection demo, test=develop

* [demo] fix mobile_detection code style, test=develop

* [Demo] fix demo code style, test=develop

* [Demo] fix detection demo makefile dependency, test=develop
```
  c8b51f82
- Z
  fix quant dequant fuse pass bug (#2552) · 137d7a6d
  由 Zhaolong Xing 提交于 12月 03, 2019
```
test=develop
```
  137d7a6d
- T
  Armv8 4x4 gemm (#2528) · 1ebac1c0
  由 TianXiaogang 提交于 12月 03, 2019
```
* feat: add sgemm4x4 for armv8

* fix: fix armv7 gemm choose condition
```
  1ebac1c0
- J
  set the bias of int8 conv as float, test=develop (#2553) · 04d2b4eb
  由 juncaipeng 提交于 12月 03, 2019
```
* set the bias of int8 conv as float, test=develop
```
  04d2b4eb
- Z
  [NPU] support 1-dimension input y in elemetwise ops (#2546) · 919282cf
  由 zhupengyang 提交于 12月 03, 2019
```
test=develop
```
  919282cf
02 12月, 2019 2 次提交
- L
  
  delete useless code for x86 platform (#2535) · 1b875ae8
  由 liu zhengxi 提交于 12月 02, 2019
  
  1b875ae8
- L
  
  Fix the conv compute shape (#2534) · 0a100120
  由 liu zhengxi 提交于 12月 02, 2019
  
  0a100120
30 11月, 2019 1 次提交
- Z
  [NPU] fix act; refine act unit tests; fix batch_norm (#2533) · 0349bfd0
  由 zhupengyang 提交于 11月 30, 2019
```
test=develop
```
  0349bfd0
29 11月, 2019 3 次提交
- Y
  [LITE][PASS] Fix static kernel pick pass, if op is not int8, but kernel is... · ddce609e
  由 Yuan Shuai 提交于 11月 29, 2019
```
[LITE][PASS] Fix static kernel pick pass, if op is not int8, but kernel is int8. test=develop (#2526)
```
  ddce609e
- Z
  [NPU] fix pool padding type (#2524) · 40137111
  由 zhupengyang 提交于 11月 29, 2019
```
test=develop
```
  40137111
- Z
  [NPU] add reduce_mean op bridge and unit test (#2522) · c809321d
  由 zhupengyang 提交于 11月 29, 2019
```
* [NPU] add reduce_mean op bridge and unit test

test=develop

* refine xpu_pass place order; add bridges use

test=develop
```
  c809321d
28 11月, 2019 5 次提交
- 石
  
  fix cuda cmake, test=develop (#2514) · 1984a785
  由石晓伟提交于 11月 28, 2019
  
  1984a785
- H
  
  add dependency of fluid_data_type into gather_compute_x86 test=develop (#2516) · 1d232071
  由 huzhiqiang 提交于 11月 28, 2019
  
  1d232071
- W
  remove opencl place in benchmark test=develop (#2517) · 279ee29d
  由 Wilber 提交于 11月 28, 2019
```
remove opencl place in benchmark
```
  279ee29d
- Z
  [NPU] add sqrt op bridge and unit test (#2515) · 2900f3ee
  由 zhupengyang 提交于 11月 28, 2019
```
test=develop
```
  2900f3ee
- Y
  
  [cherry-pick][ARM] conv_transpose operator support padding_algorithm, test=develop (#2500) · 5fac0949
  由 yiicy 提交于 11月 28, 2019
  
  5fac0949
27 11月, 2019 5 次提交

fix winograd reinitwhenneed (#2511) · f8cc8783

由 TianXiaogang 提交于 11月 27, 2019


* add winograd c4 implement (#2494)
test=develop
fix: fix conv_block prepack_input_nxwc4 bug
* fix: optimize sgemm_c4 in armv7
     change condition of choose winograd kernel
* fix: change conv choose kernel condition
test=develop

f8cc8783

Z
[NPU] fix elementwise_add op bridge and unit test (#2503) · fc913904
由 zhupengyang 提交于 11月 27, 2019
```
add elementwise_sub, mul, div op bridge

test=develop
```
fc913904
H

add into beam_search.cc test=develop (#2506) · e8ea4a56
由 huzhiqiang 提交于 11月 27, 2019

e8ea4a56

fix bugs of concat, reshape and slice op and add usleep in fpga regpoll,... · 1fa9d2e8

由 qnqinan 提交于 11月 27, 2019

fix bugs of concat, reshape and slice op and add usleep in fpga regpoll, test=develop，  close #2501 (#2502)

* update proposal and psroipool kernel file in FPGA V2 track

* update, test=develop

* update FPGA v2 pe cpp file and ew kernel files, test=develop

* fix a bug of sigmoid kernel in FPGA v2 track, test=develop

* fix bugs of concat, reshape and slice op and add usleep in fpga regpoll, test=develop

* add interupt clear operation before op compute in FPGA V2 track, test=develop

1fa9d2e8

fill_constant op support param shape can be tensor or tensorlist, test=develop (#2459) · 89df8f01
由 myq406450149 提交于 11月 27, 2019
```
* fill_constant can support shape is tensor or tensorlist
```
89df8f01

26 11月, 2019 3 次提交
- H
  
  add graph_op into basic type test=develop (#2504) · 826f6605
  由 huzhiqiang 提交于 11月 26, 2019
  
  826f6605
- T
  add winograd c4 implement (#2494) · e0eee83c
  由 TianXiaogang 提交于 11月 26, 2019
```
fix: fix conv_block prepack_input_nxwc4 bug
* fix: optimize sgemm_c4 in armv7
     change condition of choose winograd kernel
* fix: change conv choose kernel condition
```
  e0eee83c
- Z
  [NPU] add square op bridge and unit test (#2498) · 93cfddb5
  由 zhupengyang 提交于 11月 26, 2019
```
test=develop
```
  93cfddb5