提交 · d3d6ed4bd36ed3d49cc10f77395c92d2c4c7edb2 · PaddlePaddle / Paddle-Lite

21 10月, 2019 2 次提交

to support yolov3 unet alexnet can run on tx2 (#2216) · d3d6ed4b
由 myq406450149 提交于 10月 21, 2019
```
* add gpu kernel mul pool relu scale softmax dropout bilinear_interp and can run in tx2

* rm GREATER_EQUAL
```
d3d6ed4b

add cuda op(pool & softmax), support conv with padding_algorithm · 1a18d682

由 yiicy 提交于 10月 21, 2019

* cuda add softmax and pool op

* * fix armlinux can find sys/system_properties.h
* conv add padding_algorithm
test=develop

* delete padding_algorithm in op param, test=develop

* fix bugs, test=develop

1a18d682

18 10月, 2019 1 次提交
- W
  fix yolobox_cuda_test (#2208) · 2f57f5b4
  由 Wilber 提交于 10月 18, 2019
```
fix yolobox_cuda test precision error
```
  2f57f5b4
17 10月, 2019 1 次提交
- J
  
  add bilinear_interp_cuda_op, test=develop (#2197) · cb6b1b1c
  由 juncaipeng 提交于 10月 17, 2019
  
  cb6b1b1c
14 10月, 2019 1 次提交
- Z
  align yolov3 cuda int8 (#2183) · ed38d79b
  由 Zhaolong Xing 提交于 10月 14, 2019
```
test=develop
```
  ed38d79b
11 10月, 2019 1 次提交

CUDA: can run yolov3 int8 (#2172) · 29f448c6

由 Zhaolong Xing 提交于 10月 11, 2019

* add conv int8 support(in condition which the input or output channel not be the times of 4)
add add_kernel for cuda.

* can run yolov3 fp32
test=develop

* 1. fix bug with yolov3 run
test=develop

* can run yolov3 int8 test=develop

29f448c6

10 10月, 2019 1 次提交
- W
  fix yolobox_cuda bug · 8bc7c043
  由 Wilber 提交于 10月 10, 2019
```
* fix yolobox_cuda bug 
* update code format
```
  8bc7c043
27 9月, 2019 1 次提交

can run yolov3 fp32 on cuda devices (#2092) · c4b5e32c

由 Zhaolong Xing 提交于 9月 27, 2019

* add conv int8 support(in condition which the input or output channel not be the times of 4)
add add_kernel for cuda.

* can run yolov3 fp32
test=develop

* 1. fix bug with yolov3 run
test=develop

c4b5e32c

20 9月, 2019 1 次提交
- P
  
  refine concat cuda kernel, test=develop (#2081) · 716478ce
  由 Pei Yang 提交于 9月 20, 2019
  
  716478ce
19 9月, 2019 1 次提交

石

add full_api_static target and fix building errors, test=develop (#2064) · 4a948cfc

由石晓伟提交于 9月 19, 2019

* add full_api_static target and fix building errors, test=develop

* fix build errors, test=develop

* fix code style, test=develop

* fix lite/model_parser/pb/var_desc.cc, test=develop

* fix building errors, test=develop

* modify lite/tools/debug/CMakeLists.txt, test=develop

4a948cfc

12 9月, 2019 1 次提交
- W
  add transpose kernel for cuda test=develop (#1997) · fb40c748
  由 Wilber 提交于 9月 12, 2019
```
add transpose kernel for cuda
```
  fb40c748
09 9月, 2019 2 次提交

Add concat and elementwise_add cuda kernel (#1979) · ad8f9faa

由 Pei Yang 提交于 9月 09, 2019

* add nearest_interp_cuda kernel, test=develop

* add concat op and elementwise_add op

* remove eigen dependency from nearest_interp cuda kernel, test=develop

* free cuda pointers, test=develop

ad8f9faa

Z
add calib cuda kernel. (#1977) · 9681b642
由 Zhen Wang 提交于 9月 09, 2019
```
* add calib cuda kernel.

* add unit test for calib cuda kernel. test=develop
```
9681b642

06 9月, 2019 1 次提交

add cudnn conv fp32, int8 support (#1974) · 23d83c04

由 Zhaolong Xing 提交于 9月 06, 2019

* paddle lite cuda init
can run model with leaky_relu

* add the missing file.
test=develop

* add the load from memory interface.
test=develop

* refine this pr. fix comments
fix ci error
test=develop

* conv impl
fp32:
conv, conv+bais, conv+bias+relu, conv+bias+leaky_relu

int8:
conv, conv+bais+relu(int8 or fp32 output), conv+bias+leaky_relu(int8 or fp32 output)

can run conv+ bias+relu using cxx_api
test=develop

* move the lite/cuda/math to backends/cuda/math
test=develop

23d83c04

03 9月, 2019 1 次提交
- H
  
  create backends directory and move hardware backends into it (#1954) · fede4a1c
  由 huzhiqiang 提交于 9月 03, 2019
  
  fede4a1c
30 8月, 2019 1 次提交
- P
  add nearest_interp_cuda kernel, test=develop (#1920) · 7931c758
  由 Pei Yang 提交于 8月 30, 2019
```
add nearest_interp cuda kernel for Paddle-Lite
```
  7931c758
29 8月, 2019 1 次提交

Add yolo_box_cuda multiclass_nms_host kernel. (#1908) · 5752dbd7

由 Wilber 提交于 8月 29, 2019

* add yolo_box_compute cuda

* move multiclass_nms(arm) to host

* add lod in scale op

* add yolo_box_cuda cmake config

* modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop

* reshape and transpose op don't have xshape output.

* modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop

* add yolo_box use kernel test=develop

5752dbd7

27 8月, 2019 1 次提交
- Z
  lite cuda init: can run a simple model with leaky_relu (#1860) · a270d326
  由 Zhaolong Xing 提交于 8月 27, 2019
```
* paddle lite cuda init
can run model with leaky_relu

* add the missing file.
test=develop
```
  a270d326
16 8月, 2019 1 次提交
- Y
  
  publish lite (#1800) · 7a9e16c0
  由 Yan Chunwei 提交于 8月 16, 2019
  
  7a9e16c0