提交 · 59d079e8a69cffaefa1867e9d07cc33fc176a5b7 · PaddlePaddle / Paddle-Lite

03 1月, 2020 1 次提交
- Z
  [NPU] enhance unittest for bn, transpose (#2716) · 59d079e8
  由 zhupengyang 提交于 1月 03, 2020
```
test=develop
```
  59d079e8
31 12月, 2019 3 次提交

X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo... · f1cedb8f

由 Wilber 提交于 12月 31, 2019

X86 and cuda compile simutaneously cmake ..  -DCMAKE_BUILD_TYPE=RelWithDebInfo  -DWITH_MKL=ON           -DLITE_WITH_CUDA=ON           -DWITH_MKLDNN=OFF           -DLITE_WITH_X86=ON           -DLITE_WITH_PROFILE=OFF          -DWITH_LITE=OFF           -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF           -DWITH_PYTHON=OFF           -DWITH_TESTING=ON           -DLITE_WITH_ARM=OFF           -DLITE_ON_TINY_PUBLISH=OFF           -DCUDNN_ROOT=/usr/local/cudnn/           -DLITE_BUILD_EXTRA=ON (#2708)

x86 and cuda compile simutaneously

f1cedb8f

Z
[XPU] bn unit test (#2706) · bc6d5adc
由 zhupengyang 提交于 12月 31, 2019
```
test=develop
```
bc6d5adc

[LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · a29c84a2

由 hong19860320 提交于 12月 31, 2019

* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.

* Refine the registration of op bridges and make it similar to the registration of op and kernel.

* Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.

* Add the unit test of the scale and softmax op bridge for NPU.

a29c84a2

26 12月, 2019 1 次提交
- Z
  [XPU] mul unittest (#2676) · 6bce0133
  由 zhupengyang 提交于 12月 26, 2019
```
test=develop
```
  6bce0133
25 12月, 2019 2 次提交

J
fix op inputs and outputs type (#2647) · 168ce9a9
由 juncaipeng 提交于 12月 25, 2019
```
* fix op inputs and outputs type, test=develop
```
168ce9a9

[X86] Polish the implementation of fc and imporve the unittest (#2656) · 28481458

由 Yiqun Liu 提交于 12月 25, 2019

* Remove GEMM padding in fc_compute.
test=develop

* Write a common ParallelFor function to run the for loop in parallel.

* Add the codes of padding GEMM back in fc.

* Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor.

* Refine the unit test of fc and add testing case of padding and parallel.
test=develop

* Enable more test cases in common fc unittest, including padding and parallel for x86 target.

* Remove the fc test under kernels/x86.
test=develop

* Disable relu in test of fc for non-x86 target.
test=develop

* Change the eps of arm.
test=develop

28481458

24 12月, 2019 5 次提交
- Z
  
  [XPU] matmul bridge and unit test (#2666) · d345a7fc
  由 zhupengyang 提交于 12月 24, 2019
  
  d345a7fc
- H
  
  [LITE][XPU] Fix dropout op bridge and unit test for BERT (#2665) · d444ecbf
  由 hong19860320 提交于 12月 24, 2019
  
  d444ecbf
- H
  [LITE][NPU][XPU] Support multiple types for XPU and NPU op bridges (#2646) · 05da0c72
  由 hong19860320 提交于 12月 24, 2019
```
* Support multiple types for XPU and NPU op bridges

* Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT

* Fix the definition of lookup_table kernel for X86
```
  05da0c72
- Z
  [XPU] add dropout bridge and unit test (#2650) · d904c9dd
  由 zhupengyang 提交于 12月 24, 2019
```
test=develop
```
  d904c9dd
- Z
  [XPU] elementwise_add, softmax unit test (#2653) · 64d01cb9
  由 zhupengyang 提交于 12月 24, 2019
```
* [XPU] elementwise_add unit test

* [XPU] softmax unit test

test=develop
```
  64d01cb9
23 12月, 2019 1 次提交
- Y
  
  [ARM] add grid_sampler op and ut, test=develop (#2598) · 3723451b
  由 yiicy 提交于 12月 23, 2019
  
  3723451b
21 12月, 2019 1 次提交
- Z
  [XPU] add layer_norm bridge and unit test (#2640) · 4dd6a4b8
  由 zhupengyang 提交于 12月 21, 2019
```
test=develop
```
  4dd6a4b8
20 12月, 2019 2 次提交
- Z
  [XPU] add reshape bridge and unit test (#2621) · a13c592d
  由 zhupengyang 提交于 12月 20, 2019
```
test=develop
```
  a13c592d
- Z
  [XPU] add transpose bridge and unit test (#2630) · b53ece7a
  由 zhupengyang 提交于 12月 20, 2019
```
* [XPU] add transpose bridge and unit test

test=develop
```
  b53ece7a
19 12月, 2019 1 次提交
- Y
  [ARM] change global pooling choose kernel policy, test=develop (#2602) · 49f03648
  由 yiicy 提交于 12月 19, 2019
```
* [ARM] change global pooling choose kernel policy, test=develop
```
  49f03648
18 12月, 2019 1 次提交
- J
  Support Mask RCNN2 (#2588) · d1b7aec5
  由 juncaipeng 提交于 12月 18, 2019
```
* Support Mask RCNN2 (#2588)
```
  d1b7aec5
17 12月, 2019 1 次提交
- H
  
  [lite][arm] add conv+relu6/leakyRelu fusion (#2599) · 3455ab0a
  由 HappyAngel 提交于 12月 17, 2019
  
  3455ab0a
13 12月, 2019 1 次提交
- H
  [LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at... · d5434aa2
  由 hong19860320 提交于 12月 13, 2019
```
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
```
  d5434aa2
10 12月, 2019 1 次提交
- Y
  
  [ARM] add instance norm op and ut, test=develop (#2578) · 9a3552db
  由 yiicy 提交于 12月 10, 2019
  
  9a3552db
07 12月, 2019 1 次提交

Support mask_rcnn (#2484) · c2f72cb3

由 juncaipeng 提交于 12月 07, 2019

* add arm split lod tensor, test=develop

* add arm merge lod tensor, test=develop

* update split merge lod tensor, test=develop

* add reduce_prob op, test=develop

* support mask_rcnn succeed, test=develop

c2f72cb3

04 12月, 2019 1 次提交
- 石
  
  refactor profile tools, test=develop (#2536) · 8a634b71
  由石晓伟提交于 12月 04, 2019
  
  8a634b71
28 11月, 2019 1 次提交
- Y
  
  [cherry-pick][ARM] conv_transpose operator support padding_algorithm, test=develop (#2500) · 5fac0949
  由 yiicy 提交于 11月 28, 2019
  
  5fac0949
27 11月, 2019 1 次提交
- fill_constant op support param shape can be tensor or tensorlist, test=develop (#2459) · 89df8f01
  由 myq406450149 提交于 11月 27, 2019
```
* fill_constant can support shape is tensor or tensorlist
```
  89df8f01
22 11月, 2019 4 次提交

update conv 2-pad to 4-pad (#2404) · 820eb6d4

由 HappyAngel 提交于 11月 22, 2019

* fix conv 2-pad to 4-pad

* fix compute conv shape

* fix pad, test=develop

* change conv_depthwise_3x3s1_fp.cc name to conv3x3s1p01_depthwise_fp32.cc to distinguish between conv3x3s1_depthwise_fp32.cc

* delete printf note in conv3x3s1, test=develop

* delete printf note, test=develop

* delete gem_sdot.h, test=develop

it is coped from __gemm_sdot_meta_.h

* update compute padding, test=develop

* fix padding size, must be 2 or 4. test=develop

* fix format in operators/conv_op.cc, test=develop

* change #if 0 to #if 1, test=develop

* put 2-pad to 4-pad in AttachImpl, test=develop

* fix clang-format error inn tests/math/connv_compute_test, test=develop

* fix x86 test result error, test=develop

* add asymmetric padding test case in liite/tests/math/conv_compute.cc, test=develop

* change paddings type to support dynamically modify, test=develop

* fix x86 build error in connv_compute_test, test=develop

* fix opencl build error, test=develop

* fix oopencl build error, test=develop

* fix  opencl/conv_compute build error, test=develop

* fix  opencl/conv_compute build error, test=develop

* fix format in kernels/opencl/conv_computte_ttest,test=develop

* fix build error, test=develop

fix build error in kernels/x86/conv_compute.h

820eb6d4

add NHWC NCHW transform, test=develop (#2381) · 6b3c341f

由 HappyAngel 提交于 11月 22, 2019

* add nhwc to nchw

* add layout in funcs

* change layout as extra, test=develop

* change make, test=develop

* use template class method to update layout NNCHHW and NHWC transform, test=develop

* fix cmake error, set layout to extra, test=develop

* fix test_layout_compute_arm test, its extra

* layout is extra, test=develop

* fix error in kernels/arm/layout_comput.cc when register kernel, DataLayout must be NCHW, test=develop

* delete extra note, test=develop

* delete extra test

* delete layout_test, test=develop

, its in tests/math/layout_comput_test

* delete extrat test, test=develop

6b3c341f

Y
[ARM] add sgemmc4 common and small kernel, support for winograd, test=develop (#2471) · 66d2ae25
由 yiicy 提交于 11月 22, 2019
```
* unfinish sgemmc4

* finish armv8 sgemmc4

* arm add sgemmc4 with deal with remain

* [ARM] add sgemmc4 small kernel, test=develop
```
66d2ae25

update pooling 2-padding to 4-padding (#2410) · a7f7d49b

由 HappyAngel 提交于 11月 22, 2019

* fix pooling bug and speed

* fix build error

* delete VLOGin pool, test=develop

* add openmp, test=develop

* fix lite/kernels/arm/pool_compute_test basic_pooling compute error bug, test=develop

* update pooling 2-pad to 4-pad, test=develop

* fix 2-pad to 4-pad in operators/pool_op.h, AttachKernel will set param, so 2-pad to 4-pad funcs should put in AttachKernel. test=ddevellop

* put 2-pad to 4-pad in AttachImpl, test=develop

* according to reviews, fix some format error. test=develop

* fix format errorr, add (). test=develop

* change paddings type to support dynamically modify, test=develop

* update padding type int other devices, test=develop

* fix x8d build error on shared_ptr, test=ddevelop

* fix formmat in operators pool_op.cc, test=develop

a7f7d49b

20 11月, 2019 1 次提交
- Y
  [ARM] sgemv support transA, test=develop (#2453) · dde12f0d
  由 yiicy 提交于 11月 20, 2019
```
* [ARM] sgemv support transA, test=develop

* add sgemv ut, test=develop
```
  dde12f0d
19 11月, 2019 1 次提交
- Y
  
  fix lrn param, align to fluid, test=develop (#2452) · 94255f6c
  由 yiicy 提交于 11月 19, 2019
  
  94255f6c
16 11月, 2019 1 次提交
- H
  
  [LITE][X86] Add search_aligned_mat_mul and search_seq_fc op for X86 (#2428) · 78f76834
  由 hong19860320 提交于 11月 16, 2019
  
  78f76834
13 11月, 2019 2 次提交
- L
  Update the ops to fluid (#2406) · 518a87ef
  由 liu zhengxi 提交于 11月 13, 2019
```
align the lite nearest， bilinear op to fluid on arm and cuda
```
  518a87ef
- J
  fix error for AxesTensorList in unsqueeze op, test=develop (#2411) · f4e06650
  由 juncaipeng 提交于 11月 13, 2019
```
* fix error for AxesTensorList in unsqueeze op
```
  f4e06650
12 11月, 2019 2 次提交

[LITE][ARM]add cv image process (#2402) · 0ade1bc5

由 HappyAngel 提交于 11月 12, 2019

* add cv image process

* fix arm liunx build error

* add LITE_WITH_CV defien to make cv, test=develop

* fix cv format, annd add describe in utils/cv

* delete some Meaningless comments, test=develop

* set LITE_WITH_CV=OFF in build.sh, test=develop

* delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop

* according to reviews to redefine paddle_image_preprocess.h, test=develop

* add detailed note of flipParam, test=develop

* fix format in paddle_image_preprocess.h, test=develop

* fix error when build x86. test=develop

* lite_with_X86 does not contain lite_with_cv

0ade1bc5

J
Upgrade concat and unsqueeze, test=develop (#2378) · 26470600
由 juncaipeng 提交于 11月 12, 2019
```
* update concat and unsqueeze, test=develop
```
26470600

11 11月, 2019 1 次提交

fix pool bug and speed, test=develop (#2385) · d197de00

由 HappyAngel 提交于 11月 11, 2019

* fix pooling bug and speed

* fix build error

* delete VLOG in pool, test=develop

* add openmp, test=develop

* fix lite/kernels/arm/pool_compute_test basic_pooling compute error bug, test=develop

d197de00

07 11月, 2019 1 次提交

check arm kernels type to make sure all_library_links work normally (#2386) · 916e80c2

由 huzhiqiang 提交于 11月 06, 2019

We have changed 11 arm_kernels into extra type in #2347 , which has caused test_compiling failure. In this PR , we move their 11 related arm_kernel_test into build_extra=ON

916e80c2

06 11月, 2019 2 次提交

update slice and reshape op and test on one op fake model test=develop (#2377) · e74609b7

由 Wilber 提交于 11月 06, 2019

update reshape op to support multiple input types of shape.
priority: input(ShapeTensor) > input(Shape) > attr(shape)

update slice op to support multiple iput types of starts and ends.
priority: input(StartsTensor) > input(StartsTensorList) > attr(starts)

e74609b7

fix fill_constant kernel bug test=develop (#2376) · 9d97d56e

由 Wilber 提交于 11月 06, 2019

fill_constant kernel only registered float type, only the float data type is produced, which is obviously a bug.

Now, produce data based on the data type attr.

By the way, fix the cast kernel bug.

9d97d56e