提交 · 59d079e8a69cffaefa1867e9d07cc33fc176a5b7 · PaddlePaddle / Paddle-Lite

03 1月, 2020 1 次提交
- Z
  [NPU] enhance unittest for bn, transpose (#2716) · 59d079e8
  由 zhupengyang 提交于 1月 03, 2020
```
test=develop
```
  59d079e8
31 12月, 2019 3 次提交

X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo... · f1cedb8f

由 Wilber 提交于 12月 31, 2019

X86 and cuda compile simutaneously cmake ..  -DCMAKE_BUILD_TYPE=RelWithDebInfo  -DWITH_MKL=ON           -DLITE_WITH_CUDA=ON           -DWITH_MKLDNN=OFF           -DLITE_WITH_X86=ON           -DLITE_WITH_PROFILE=OFF          -DWITH_LITE=OFF           -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF           -DWITH_PYTHON=OFF           -DWITH_TESTING=ON           -DLITE_WITH_ARM=OFF           -DLITE_ON_TINY_PUBLISH=OFF           -DCUDNN_ROOT=/usr/local/cudnn/           -DLITE_BUILD_EXTRA=ON (#2708)

x86 and cuda compile simutaneously

f1cedb8f

Z
[XPU] bn unit test (#2706) · bc6d5adc
由 zhupengyang 提交于 12月 31, 2019
```
test=develop
```
bc6d5adc

[LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · a29c84a2

由 hong19860320 提交于 12月 31, 2019

* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.

* Refine the registration of op bridges and make it similar to the registration of op and kernel.

* Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.

* Add the unit test of the scale and softmax op bridge for NPU.

a29c84a2

26 12月, 2019 1 次提交
- Z
  [XPU] mul unittest (#2676) · 6bce0133
  由 zhupengyang 提交于 12月 26, 2019
```
test=develop
```
  6bce0133
25 12月, 2019 2 次提交

J
fix op inputs and outputs type (#2647) · 168ce9a9
由 juncaipeng 提交于 12月 25, 2019
```
* fix op inputs and outputs type, test=develop
```
168ce9a9

[X86] Polish the implementation of fc and imporve the unittest (#2656) · 28481458

由 Yiqun Liu 提交于 12月 25, 2019

* Remove GEMM padding in fc_compute.
test=develop

* Write a common ParallelFor function to run the for loop in parallel.

* Add the codes of padding GEMM back in fc.

* Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor.

* Refine the unit test of fc and add testing case of padding and parallel.
test=develop

* Enable more test cases in common fc unittest, including padding and parallel for x86 target.

* Remove the fc test under kernels/x86.
test=develop

* Disable relu in test of fc for non-x86 target.
test=develop

* Change the eps of arm.
test=develop

28481458

24 12月, 2019 5 次提交
- Z
  
  [XPU] matmul bridge and unit test (#2666) · d345a7fc
  由 zhupengyang 提交于 12月 24, 2019
  
  d345a7fc
- H
  
  [LITE][XPU] Fix dropout op bridge and unit test for BERT (#2665) · d444ecbf
  由 hong19860320 提交于 12月 24, 2019
  
  d444ecbf
- H
  [LITE][NPU][XPU] Support multiple types for XPU and NPU op bridges (#2646) · 05da0c72
  由 hong19860320 提交于 12月 24, 2019
```
* Support multiple types for XPU and NPU op bridges

* Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT

* Fix the definition of lookup_table kernel for X86
```
  05da0c72
- Z
  [XPU] add dropout bridge and unit test (#2650) · d904c9dd
  由 zhupengyang 提交于 12月 24, 2019
```
test=develop
```
  d904c9dd
- Z
  [XPU] elementwise_add, softmax unit test (#2653) · 64d01cb9
  由 zhupengyang 提交于 12月 24, 2019
```
* [XPU] elementwise_add unit test

* [XPU] softmax unit test

test=develop
```
  64d01cb9
23 12月, 2019 1 次提交
- Y
  
  [ARM] add grid_sampler op and ut, test=develop (#2598) · 3723451b
  由 yiicy 提交于 12月 23, 2019
  
  3723451b
21 12月, 2019 1 次提交
- Z
  [XPU] add layer_norm bridge and unit test (#2640) · 4dd6a4b8
  由 zhupengyang 提交于 12月 21, 2019
```
test=develop
```
  4dd6a4b8
20 12月, 2019 2 次提交
- Z
  [XPU] add reshape bridge and unit test (#2621) · a13c592d
  由 zhupengyang 提交于 12月 20, 2019
```
test=develop
```
  a13c592d
- Z
  [XPU] add transpose bridge and unit test (#2630) · b53ece7a
  由 zhupengyang 提交于 12月 20, 2019
```
* [XPU] add transpose bridge and unit test

test=develop
```
  b53ece7a
18 12月, 2019 1 次提交
- J
  Support Mask RCNN2 (#2588) · d1b7aec5
  由 juncaipeng 提交于 12月 18, 2019
```
* Support Mask RCNN2 (#2588)
```
  d1b7aec5
13 12月, 2019 1 次提交
- H
  [LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at... · d5434aa2
  由 hong19860320 提交于 12月 13, 2019
```
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
```
  d5434aa2
10 12月, 2019 1 次提交
- Y
  
  [ARM] add instance norm op and ut, test=develop (#2578) · 9a3552db
  由 yiicy 提交于 12月 10, 2019
  
  9a3552db
07 12月, 2019 1 次提交

Support mask_rcnn (#2484) · c2f72cb3

由 juncaipeng 提交于 12月 07, 2019

* add arm split lod tensor, test=develop

* add arm merge lod tensor, test=develop

* update split merge lod tensor, test=develop

* add reduce_prob op, test=develop

* support mask_rcnn succeed, test=develop

c2f72cb3

28 11月, 2019 1 次提交
- Y
  
  [cherry-pick][ARM] conv_transpose operator support padding_algorithm, test=develop (#2500) · 5fac0949
  由 yiicy 提交于 11月 28, 2019
  
  5fac0949
27 11月, 2019 1 次提交
- fill_constant op support param shape can be tensor or tensorlist, test=develop (#2459) · 89df8f01
  由 myq406450149 提交于 11月 27, 2019
```
* fill_constant can support shape is tensor or tensorlist
```
  89df8f01
19 11月, 2019 1 次提交
- Y
  
  fix lrn param, align to fluid, test=develop (#2452) · 94255f6c
  由 yiicy 提交于 11月 19, 2019
  
  94255f6c
16 11月, 2019 1 次提交
- H
  
  [LITE][X86] Add search_aligned_mat_mul and search_seq_fc op for X86 (#2428) · 78f76834
  由 hong19860320 提交于 11月 16, 2019
  
  78f76834
13 11月, 2019 2 次提交
- L
  Update the ops to fluid (#2406) · 518a87ef
  由 liu zhengxi 提交于 11月 13, 2019
```
align the lite nearest， bilinear op to fluid on arm and cuda
```
  518a87ef
- J
  fix error for AxesTensorList in unsqueeze op, test=develop (#2411) · f4e06650
  由 juncaipeng 提交于 11月 13, 2019
```
* fix error for AxesTensorList in unsqueeze op
```
  f4e06650
12 11月, 2019 1 次提交
- J
  Upgrade concat and unsqueeze, test=develop (#2378) · 26470600
  由 juncaipeng 提交于 11月 12, 2019
```
* update concat and unsqueeze, test=develop
```
  26470600
07 11月, 2019 1 次提交

check arm kernels type to make sure all_library_links work normally (#2386) · 916e80c2

由 huzhiqiang 提交于 11月 06, 2019

We have changed 11 arm_kernels into extra type in #2347 , which has caused test_compiling failure. In this PR , we move their 11 related arm_kernel_test into build_extra=ON

916e80c2

06 11月, 2019 2 次提交

update slice and reshape op and test on one op fake model test=develop (#2377) · e74609b7

由 Wilber 提交于 11月 06, 2019

update reshape op to support multiple input types of shape.
priority: input(ShapeTensor) > input(Shape) > attr(shape)

update slice op to support multiple iput types of starts and ends.
priority: input(StartsTensor) > input(StartsTensorList) > attr(starts)

e74609b7

fix fill_constant kernel bug test=develop (#2376) · 9d97d56e

由 Wilber 提交于 11月 06, 2019

fill_constant kernel only registered float type, only the float data type is produced, which is obviously a bug.

Now, produce data based on the data type attr.

By the way, fix the cast kernel bug.

9d97d56e

28 10月, 2019 1 次提交

[LITE][XPU] initial support for XPU (#2202) · 06d058fe

由 hong19860320 提交于 10月 28, 2019

* Initial support for XPU
* Fix compiling errors of XPU
* Move XPU op kernel bridges from backends to kernels to fix deps order
* Change the namespace and directory of XPU bridges
* Add XPU SDK
* Fix header files and namespace of XPU SDK
* Add unit tests for relu and conv2d ops
* Restore the modification of paddle_api_test
* Supports simple model which contains only a relu layer
* Add compiling scripts for XPU
* Fix compiling errors of XPU
* Add comments for XPU LoadModel and BuildModel

06d058fe

11 10月, 2019 1 次提交
- J
  
  add rsqrt op, test=develop (#2176) · dfce4621
  由 juncaipeng 提交于 10月 11, 2019
  
  dfce4621
23 9月, 2019 1 次提交
- J
  add cast from uint8 to float, test=develop (#2080) · 9941d746
  由 juncaipeng 提交于 9月 23, 2019
```
* add cast from uint8 to float, test=develop
```
  9941d746
19 9月, 2019 1 次提交

石

add full_api_static target and fix building errors, test=develop (#2064) · eef7ea0f

由石晓伟提交于 9月 19, 2019

* add full_api_static target and fix building errors, test=develop

* fix build errors, test=develop

* fix code style, test=develop

* fix lite/model_parser/pb/var_desc.cc, test=develop

* fix building errors, test=develop

* modify lite/tools/debug/CMakeLists.txt, test=develop

eef7ea0f

18 9月, 2019 1 次提交

fix bias quantize error && fix clang build error (#2049) · 81dffbe8

由 Xiaoyang LI 提交于 9月 18, 2019

* fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias

* change conv impl

* neon int8 kernel support float bias

* arm compute kernel support float bias

* add math_test target

* add tensor utils for testing, fix sgemm ut error

* add gemm_int8 unit test, support float bias

* fix build script

* add conv compute unit test for arm

* fix build script, test=develop

* fix fp32 dw conv3x3s1, test=develop

* add fp32 dw conv3x3s1, test=develop

* add armv7 fp32 dw conv3x3s1, test=develop

* add fp32 depthwise conv3x3s2, test=develop

* fix fp32 conv3x3 depthwise build error, test=develop

* fix gemm_like conv trans weights error, test=develop

* fix int8 depthwise conv3x3 error, test=develop

* turn on all test for arm fp32 conv, test=develop

* fix int8 conv1x1 error

* fix int8 direct conv3x3s1 error, test=develop

* fix int8 direct conv3x3s2, test=develop

* turn on all test for arm int8 conv, test=develop

* fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop

* remove debug info, strip ut binary, test=develop

* fix conv compute error, test=develop

* change Init() to ReInitWhenNeeded(), test=develop

* fix code style, test=develop

* remote engine_test, test=develop

* fix building server tests error, test=develop

* fix sdot clang build error, test=develop

* fix sgemm ut timeout error, test=develop

* fix clang build error, test=develop

* turn off math basic test due to ci time out, test=develop

* fix conv_int8 ut error, test=develop

81dffbe8

17 9月, 2019 2 次提交
- W
  modify norm kernel to run caffe_facedetection model (#2008) · 71bb3188
  由 Wilber 提交于 9月 17, 2019
```
* modify norm kernel to run caffe_facedetection model

* reserve bind norm, remove calc of norm output
```
  71bb3188
- J
  fix yolo_box bug (#2034) · 1b1d7a83
  由 juncaipeng 提交于 9月 17, 2019
```
* fix yolo_box bug, test=develop

* fix test bug for yolo_box, test=develop
```
  1b1d7a83
12 9月, 2019 2 次提交
- Z
  fix bilinear-interp arm compute (#2029) · ca424e73
  由 zhupengyang 提交于 9月 12, 2019
```
fix bilinear-interp unit test for more cases

test=develop
```
  ca424e73
- W
  add unsqueeze and range op (x2paddle) (#1988) · 3c08f676
  由 Wilber 提交于 9月 12, 2019
```
* add unsqueeze and range op. modify concat op test=develop

* modify exception in range_test_x86
```
  3c08f676
10 9月, 2019 1 次提交
- W
  
  add elementwise_sub and modify argmax (#1964) · 62ea82d0
  由 Wilber 提交于 9月 10, 2019
  
  62ea82d0