提交 · 8bc7c04358e78912398aa8c364fabb5c46600a16 · PaddlePaddle / Paddle-Lite

10 10月, 2019 1 次提交
- W
  fix yolobox_cuda bug · 8bc7c043
  由 Wilber 提交于 10月 10, 2019
```
* fix yolobox_cuda bug 
* update code format
```
  8bc7c043
09 10月, 2019 1 次提交

由 yiicy 提交于 10月 09, 2019

*  imporve prepack_input func speed in int8 3x3s1 dw conv

* fix code style

* fix code style

* improve 3x3s1 dw fp32 conv speed a little

* arm add 5x5s1 int8 dw conv, test=develop

498a30cf

27 9月, 2019 1 次提交

can run yolov3 fp32 on cuda devices (#2092) · c4b5e32c

由 Zhaolong Xing 提交于 9月 27, 2019

* add conv int8 support(in condition which the input or output channel not be the times of 4)
add add_kernel for cuda.

* can run yolov3 fp32
test=develop

* 1. fix bug with yolov3 run
test=develop

c4b5e32c

25 9月, 2019 1 次提交
- X
  
  add workspace compute funcs for direct conv, test=develop (#2132) · f03217b4
  由 Xiaoyang LI 提交于 9月 25, 2019
  
  f03217b4
19 9月, 2019 3 次提交

石

add full_api_static target and fix building errors, test=develop (#2064) · 4a948cfc

由石晓伟提交于 9月 19, 2019

* add full_api_static target and fix building errors, test=develop

* fix build errors, test=develop

* fix code style, test=develop

* fix lite/model_parser/pb/var_desc.cc, test=develop

* fix building errors, test=develop

* modify lite/tools/debug/CMakeLists.txt, test=develop

4a948cfc

fix building model_optimize_tool error on mac (#2075) · 26925ab9

由 Xiaoyang LI 提交于 9月 19, 2019

* fix building model_optimize_tool error on mac, test=develop

* fix model_optimize_tool build error, test=develop

26925ab9

Bug fix for model save and load (#1992) · 5404c2ee

由 TianXiaogang 提交于 9月 19, 2019

* fix: fix model parser and save bug

* style: delete debug code

* fix: fix light_predictor program run model with subblock bug

5404c2ee

18 9月, 2019 1 次提交

fix bias quantize error && fix clang build error (#2049) · 8d6f475e

由 Xiaoyang LI 提交于 9月 18, 2019

* fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias

* change conv impl

* neon int8 kernel support float bias

* arm compute kernel support float bias

* add math_test target

* add tensor utils for testing, fix sgemm ut error

* add gemm_int8 unit test, support float bias

* fix build script

* add conv compute unit test for arm

* fix build script, test=develop

* fix fp32 dw conv3x3s1, test=develop

* add fp32 dw conv3x3s1, test=develop

* add armv7 fp32 dw conv3x3s1, test=develop

* add fp32 depthwise conv3x3s2, test=develop

* fix fp32 conv3x3 depthwise build error, test=develop

* fix gemm_like conv trans weights error, test=develop

* fix int8 depthwise conv3x3 error, test=develop

* turn on all test for arm fp32 conv, test=develop

* fix int8 conv1x1 error

* fix int8 direct conv3x3s1 error, test=develop

* fix int8 direct conv3x3s2, test=develop

* turn on all test for arm int8 conv, test=develop

* fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop

* remove debug info, strip ut binary, test=develop

* fix conv compute error, test=develop

* change Init() to ReInitWhenNeeded(), test=develop

* fix code style, test=develop

* remote engine_test, test=develop

* fix building server tests error, test=develop

* fix sdot clang build error, test=develop

* fix sgemm ut timeout error, test=develop

* fix clang build error, test=develop

* turn off math basic test due to ci time out, test=develop

* fix conv_int8 ut error, test=develop

8d6f475e

17 9月, 2019 1 次提交
- J
  fix yolo_box bug (#2034) · 7497a7d2
  由 juncaipeng 提交于 9月 17, 2019
```
* fix yolo_box bug, test=develop

* fix test bug for yolo_box, test=develop
```
  7497a7d2
16 9月, 2019 2 次提交
- L
  Gru op (#2002) · 1cb36af6
  由 lhl960107 提交于 9月 16, 2019
```
* add x86 gru&&relu&&sequence_expand_as op test=develop
```
  1cb36af6
- X
  
  fix math dependencies error (#2023) · 79a03c2b
  由 Xiaoyang LI 提交于 9月 16, 2019
  
  79a03c2b
12 9月, 2019 4 次提交
- Z
  fix bilinear-interp arm compute (#2029) · af083da3
  由 zhupengyang 提交于 9月 12, 2019
```
fix bilinear-interp unit test for more cases

test=develop
```
  af083da3
- H
  add x86 math lstm and selected_rows test=develop (#1991) · 8b0dc8a3
  由 huzhiqiang 提交于 9月 12, 2019
```
add math function: lstm  and selected_rows into lite/x86/math
add selected_rows and rw_lock into lite/fluid
add lstm_cpu_kernel and  lstm_kernel into lite/x86/detail
```
  8b0dc8a3
- W
  
  add min_max_aspect_ratios_order attr in prior box op test=develop (#2016) · f04ed39c
  由 Wilber 提交于 9月 12, 2019
  
  f04ed39c
- W
  add transpose kernel for cuda test=develop (#1997) · fb40c748
  由 Wilber 提交于 9月 12, 2019
```
add transpose kernel for cuda
```
  fb40c748
11 9月, 2019 1 次提交
- 石
  make passes related to the device type, test=develop (#2012) · 3c0e8a6a
  由石晓伟提交于 9月 11, 2019
```
* make passes related to the device type, test=develop

* improve tips, test=develop
```
  3c0e8a6a
10 9月, 2019 3 次提交
- L
  
  add x86 softmax kernel and fix jit compute bugs test=develop (#2007) · 6006a87c
  由 lijianshe02 提交于 9月 10, 2019
  
  6006a87c
- W
  
  add elementwise_sub and modify argmax (#1964) · 192320c4
  由 Wilber 提交于 9月 10, 2019
  
  192320c4
- T
  
  fix fpga compile problem and kernels (#1989) · 49d495c0
  由 TianXiaogang 提交于 9月 10, 2019
  
  49d495c0
09 9月, 2019 1 次提交
- J
  add assign_value and hard_sigmoid, add fluid_type (#1983) · 9796c57d
  由 juncaipeng 提交于 9月 09, 2019
```
* add assign_value op, arm kernel and test, add fluid_type, test=develop

* add hard_sigmoid, test=develop
```
  9796c57d
07 9月, 2019 1 次提交

add lite x86 ops for ASR test=develop (#1981) · 25b775d6

由 lijianshe02 提交于 9月 07, 2019

* add lite x86 ops for ASR test=develop

* add lite x86 ops for ASR test=develop

* fix x86 ci run test problems test=develop

* fix mkl path for CI test=develop

25b775d6

06 9月, 2019 2 次提交

W
modify nearest_interpolate when attr align_corners=false (bug_fix) (#1969) · 6b48313c
由 Wilber 提交于 9月 06, 2019
```
* modify slice op and add slice test

* modify nearest_polate when align_corners=false (bugfix)
```
6b48313c

add cudnn conv fp32, int8 support (#1974) · 23d83c04

由 Zhaolong Xing 提交于 9月 06, 2019

* paddle lite cuda init
can run model with leaky_relu

* add the missing file.
test=develop

* add the load from memory interface.
test=develop

* refine this pr. fix comments
fix ci error
test=develop

* conv impl
fp32:
conv, conv+bais, conv+bias+relu, conv+bias+leaky_relu

int8:
conv, conv+bais+relu(int8 or fp32 output), conv+bias+leaky_relu(int8 or fp32 output)

can run conv+ bias+relu using cxx_api
test=develop

* move the lite/cuda/math to backends/cuda/math
test=develop

23d83c04

04 9月, 2019 1 次提交
- W
  modify slice op and add slice test (#1944) · 52f933d4
  由 Wilber 提交于 9月 04, 2019
```
* modify slice op and add slice test

* modify slice op bug
```
  52f933d4
03 9月, 2019 2 次提交
- H
  
  move npu into backends(directory) and move python/ into tools/python (#1958) · 0328b5c2
  由 huzhiqiang 提交于 9月 03, 2019
  
  0328b5c2
- H
  
  create backends directory and move hardware backends into it (#1954) · fede4a1c
  由 huzhiqiang 提交于 9月 03, 2019
  
  fede4a1c