提交 · 94244715c4f4cccc678b19347755ed154b6284d3 · PaddlePaddle / Paddle-Lite

29 8月, 2019 11 次提交

T

[NPU] fix npu compile of publish_inference (#1911) · 94244715
由 tensor-tang 提交于 8月 29, 2019

94244715
T

add conv2d transpose fuse (#1909) · ce319638
由 tensor-tang 提交于 8月 29, 2019

ce319638

Add yolo_box_cuda multiclass_nms_host kernel. (#1908) · 5752dbd7

由 Wilber 提交于 8月 29, 2019

* add yolo_box_compute cuda

* move multiclass_nms(arm) to host

* add lod in scale op

* add yolo_box_cuda cmake config

* modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop

* reshape and transpose op don't have xshape output.

* modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop

* add yolo_box use kernel test=develop

5752dbd7

S

[Java API][Comment] upate java api & delete some comments (#1912) · 75f8bf3d
由 sangoly 提交于 8月 29, 2019

75f8bf3d

refine toolchain test=develop (#1904) · b120bccb

由 Yanzhan Yang 提交于 8月 29, 2019

* refine toolchain test=develop

* fix wrap compilation error

* fix yolov3 armv8 compilation test=develop

* revert to armv7 as default test=develop

* fix fpga compilation test=develop

b120bccb

L

add stack op and add reduce_mean op and their unit tests (#1888) · 8ccd01a6
由 liu zhengxi 提交于 8月 29, 2019

8ccd01a6

Add load from memory interface (#1903) · 2109d231

由 Zhaolong Xing 提交于 8月 29, 2019

* paddle lite cuda init
can run model with leaky_relu

* add the missing file.
test=develop

* add the load from memory interface.
test=develop

* refine this pr. fix comments
fix ci error
test=develop

2109d231

S

[Java API] add setThreads & setPowerMode interface (#1907) · ab84a3d5
由 sangoly 提交于 8月 29, 2019

ab84a3d5
T
[NPU] enable npu program rollback (#1906) · 28aca4a8
由 tensor-tang 提交于 8月 29, 2019
```
test=develop
```
28aca4a8

ad ops for faster rcnn, including affine_channel, anchor_generator,... · f3035827

由 juncaipeng 提交于 8月 29, 2019

ad ops for faster rcnn, including affine_channel, anchor_generator, generate_proposals and roi_align (#1895)

* add ops for faster rcnn

* disable test for generate_proposals and roi_align, test=develop

* remove .swp file

* remove log in tensor slice

* finish the unit test for roi_align, test=develop

f3035827

[NPU] refine npu subgraph and clean code (#1902) · 829fad6d

由 tensor-tang 提交于 8月 29, 2019

* add npu script and tester

* fix npu armv7 so and refine tests

test=develop

* update fix and refine log

test=develop

* refine npu generate api

* refine npu subgraph

* refine npu gen and clean code

* fix model laod

* refine node2rm in subgraph

* refine the build npu functions

test=develop

829fad6d

28 8月, 2019 14 次提交
- J
  Modify cast op and remove warning in argmax_test (#1894) · 5fe41d5c
  由 juncaipeng 提交于 8月 28, 2019
```
* modify cast op, test=develop

* modify cast op and remove warning in argmax_test, test=develop
```
  5fe41d5c
- Y
  
  fix yolov3 concat op test=develop (#1901) · 0af82ab3
  由 Yanzhan Yang 提交于 8月 28, 2019
  
  0af82ab3
- Y
  
  fix elementwise_add && fix run.py && add namespace change script (#1898) · e520e49e
  由 Yanzhan Yang 提交于 8月 28, 2019
  
  e520e49e
- Y
  
  change num proc; test=develop (#1889) · 97c73cae
  由 Yan Chunwei 提交于 8月 28, 2019
  
  97c73cae
- Z
  add nearest_interp op converter (#1879) · 1ca6bf92
  由 zhupengyang 提交于 8月 28, 2019
```
test=developt branch
```
  1ca6bf92
- Y
  
  refine wrap to support GPU test=develop (#1892) · c0a0ea60
  由 Yanzhan Yang 提交于 8月 28, 2019
  
  c0a0ea60
- H
  
  add floor op,elementwise_div op and assign op test=develop (#1882) · ca6974c5
  由 huzhiqiang 提交于 8月 28, 2019
  
  ca6974c5
- Z
  add transpose-softmax-transpose fuse pass (#1863) · 58700062
  由 zhupengyang 提交于 8月 28, 2019
```
* add transpose-softmax-transpose fuse pass

test=develop

* enable supported lite-npu ops

test=develop
```
  58700062
- H
  add x86 math:sequence_scale,sequence_padding,sequence2batch,sequence_pooling. test=develop (#1884) · 0fc8b4d4
  由 huzhiqiang 提交于 8月 28, 2019
```
add x86 math:sequence_scale,sequence_padding,sequence2batch,sequence_pooling. test=develop (#1884)
```
  0fc8b4d4
- H
  [NPU] fix conv2d npu bridge, supports bias from input map (#1839) · eff086e4
  由 hong19860320 提交于 8月 28, 2019
```
* [NPU] fix conv2d npu bridge, supports bias from input map
test=develop

* [NPU] support more dimensions for the bias of conv2d NPU bridge
test=develop
```
  eff086e4
- S
  
  [Publish] include light api impl in full publish lib test=develop (#1885) · e986dce6
  由 sangoly 提交于 8月 27, 2019
  
  e986dce6
- S
  
  [Protobuf] add combined-param model save/load supported test=develop (#1876) · fd07242c
  由 sangoly 提交于 8月 27, 2019
  
  fd07242c
- H
  1.add density_prior_box for gpu. (#1877) · 32033452
  由 Huie 提交于 8月 28, 2019
```
2.add flatten2 for gpu.
3.add concat 4 inputs size for gpu.
4.fix pool.
5.fix transpose2
test=develop
```
  32033452
- Z
  
  fix infershape when cmakelist option CPU&GPU_CL is ON, test=develop (#1880) · aae3f926
  由 zp7 提交于 8月 28, 2019
  
  aae3f926
27 8月, 2019 5 次提交
- Z
  lite cuda init: can run a simple model with leaky_relu (#1860) · a270d326
  由 Zhaolong Xing 提交于 8月 27, 2019
```
* paddle lite cuda init
can run model with leaky_relu

* add the missing file.
test=develop
```
  a270d326
- Z
  [test=develop]1.fix crash when gpu op scale&elementwise_add input dim… (#1856) · cdd5745d
  由 zp7 提交于 8月 27, 2019
```
* [test=develop]1.fix crash when gpu op scale&elementwise_add input dim size equal 2
2.add gpu op mul

* fix code style test=develop
```
  cdd5745d
- T
  [NPU] add script and refine tests (#1873) · de2e8d84
  由 tensor-tang 提交于 8月 27, 2019
```
* add npu script and tester

* fix npu armv7 so and refine tests

test=develop

* update fix and refine log

test=develop
```
  de2e8d84
- Y
  
  fix yolov3 test=develop (#1875) · b08e931d
  由 Yanzhan Yang 提交于 8月 27, 2019
  
  b08e931d
- Y
  
  add BUILD_EXTRA option to build.sh and ci_build.sh (#1849) · 2b60e813
  由 Yan Chunwei 提交于 8月 27, 2019
  
  2b60e813
26 8月, 2019 7 次提交
- J
  Catch mobile exceptions test=develop (#1867) · 690ab8be
  由 Jiaying Zhao 提交于 8月 26, 2019
```
* Catch mobile exceptions test=develop

* code style format test=develop
```
  690ab8be
- S
  
  [API Generate] fix generate api bug test=develop (#1872) · 413db281
  由 sangoly 提交于 8月 26, 2019
  
  413db281
- J
  
  modify benchmark.sh test=develop (#1871) · 3bbeffa7
  由 juncaipeng 提交于 8月 26, 2019
  
  3bbeffa7
- W
  Add matmul op (#1837) · a1f4059f
  由 Wilber 提交于 8月 26, 2019
```
* test=develop add matmul_op

* use lite::arm::math::sgemm func to implement matmul

* test=develop  pre-commit command to run clang-format

* Revert "test=develop  pre-commit command to run clang-format"

This reverts commit 3f56474f.

* test=develop pre-commit command to run clang-format
```
  a1f4059f
- J
  
  fix benchmark threads, test=develop (#1870) · 8955124b
  由 juncaipeng 提交于 8月 26, 2019
  
  8955124b
- S
  
  [Navie buffer] make load/save be compatible with 32 and 64 arch test=develop (#1858) · 94671412
  由 sangoly 提交于 8月 26, 2019
  
  94671412
- Y
  
  enhance thirdparty download (#1857) · 249cbf81
  由 Yan Chunwei 提交于 8月 26, 2019
  
  249cbf81
25 8月, 2019 2 次提交
- Y
  
  leave tiny-publish out of third-party dependencies (#1853) · e5a76c98
  由 Yan Chunwei 提交于 8月 25, 2019
  
  e5a76c98
- J
  
  Modify benchmark (#1851) · 6ba3869b
  由 juncaipeng 提交于 8月 25, 2019
  
  6ba3869b
24 8月, 2019 1 次提交
- Y
  enable fast test compilation && push opencl kernels by run.py (#1852) · ff36d9fc
  由 Yanzhan Yang 提交于 8月 24, 2019
```
* enable fast test compilation && push opencl kernels by run.py

* merge cl kernels into so

* restore pre-commit
```
  ff36d9fc