提交 · 40eb11c1c6512fd5464c19f730b2dde07ccaddac · PaddlePaddle / Paddle-Lite

22 9月, 2020 1 次提交
- H
  
  [Framework] Add method for specifying initial size of `workspace_` (#4399) · 40eb11c1
  由 huzhiqiang 提交于 9月 22, 2020
  
  40eb11c1
15 9月, 2020 1 次提交
- H
  
  [cherry-pick][Core] Fix the exceptions handling for android+armv8+gcc (#4285) (#4318) · d3dcaa75
  由 hong19860320 提交于 9月 15, 2020
  
  d3dcaa75
30 7月, 2020 2 次提交
- Y
  [BugFix][OPENCL] Fix initalization sequence of opencl backend valid API.... · 0e5e49c0
  由 ysh329 提交于 7月 30, 2020
```
[BugFix][OPENCL] Fix initalization sequence of opencl backend valid API. test=develop (#4003) (#4021)

* fix opencl backend. test=develop
```
  0e5e49c0
- Y
  
  fix conflict and cherry pick 1d0f70ae : add opencl tune api. test=develop (#4020) · dd3150a4
  由 ysh329 提交于 7月 30, 2020
  
  dd3150a4
23 7月, 2020 1 次提交

[Cherry-pick][Core] Add the graph optimization of subblocks for transformer model (#3947) (#3979) · c56bf0d8

由 hong19860320 提交于 7月 23, 2020

* [Cherry-pick][Core] Add the graph optimization of subblocks for transformer model (#3947)
test=develop
* [Core][ARM] Fix beam_search, eltwise_mul supports broadcast and int64_t data type, add print op and kernel, add exeception
test=develop

* Fix the dims of parent idx of the arm kernel of beam_search op

* elementwise_mul supports int64_t data type with broadcasting

* Add print op and kernel for debugging

* Support throwing the exception when the internal error occurs

* Refine while and conditional_block op kernel

* Support the graph optimization on subblocks

* Pass program_desc and block_idx into the kernel of the control flow ops(while/conditional_block/subgraph), and create the RuntimeProgram online, it make it possiable to call the control flow ops recursively

*Add unit test for masked transformer model

c56bf0d8

19 7月, 2020 1 次提交
- Y
  [cherry-pick][OPENCL][API] add opencl valid api for device. test=develop (#3951) (#3960) · 37a01383
  由 ysh329 提交于 7月 19, 2020
```
* [OPENCL][API] add opencl valid api for device. test=develop (#3951)
```
  37a01383
17 7月, 2020 1 次提交
- Y
  [cherry-pick][OPENCL] remove conv redundant's for opencl kernel. test… (#3938) · 273b1fa2
  由 ysh329 提交于 7月 17, 2020
```
* [cherry-pick][OPENCL] remove conv redundant's for opencl kernel. test=develop
Co-authored-by: Nxiebaiyuan <xiebaiyuan@qq.com>
```
  273b1fa2
13 7月, 2020 1 次提交
- Q
  [NPU] enhance cache offline model, test=develop (#3805) (#3931) · ac897177
  由 Qi Li 提交于 7月 13, 2020
```
* [NPU] enhance cache offline model, test=develop
```
  ac897177
27 5月, 2020 1 次提交
- H
  
  [Opt] Expand the `precisions\data_layout\targets` types supported by current opt (#3715) · ff57450d
  由 huzhiqiang 提交于 5月 27, 2020
  
  ff57450d
22 5月, 2020 1 次提交
- H
  [cherry-pick] [BUG FIX] Fix the issue that paddle-lite python lib can not... · de017be4
  由 huzhiqiang 提交于 5月 22, 2020
```
[cherry-pick] [BUG FIX] Fix the issue that paddle-lite python lib can not inference on mac env (#3684)
```
  de017be4
14 5月, 2020 1 次提交
- H
  
  [Framework][Internal] Add set_passes_internal inference for CxxConfig (#3623) · 1caba6ff
  由 huzhiqiang 提交于 5月 14, 2020
  
  1caba6ff
13 5月, 2020 1 次提交
- H
  
  [Opt][Python][Framework] Add opt scripts for python installing package (#3615) · 583ad531
  由 huzhiqiang 提交于 5月 13, 2020
  
  583ad531
08 5月, 2020 1 次提交
- H
  
  [BUGFIX][Hash][cherry-pick] Improve the implementation of hash_combine (#3582) · b7e00750
  由 huzhiqiang 提交于 5月 08, 2020
  
  b7e00750
06 5月, 2020 1 次提交
- Z
  
  fix python whl of armlinux (#3546) (#3547) · 6ce8921b
  由 zhupengyang 提交于 5月 06, 2020
  
  6ce8921b
01 5月, 2020 1 次提交
- H
  
  [Compile] change compiling option `SHUTDOWN_LOG` into `WITH_LOG` · aaccd2f6
  由 huzhiqiang 提交于 5月 01, 2020
  
  aaccd2f6
27 4月, 2020 2 次提交
- H
  
  [cherry-pick] cherry-pick x86/python code from develop to release/v2.6 (#3510) · 1b384f75
  由 huzhiqiang 提交于 4月 27, 2020
  
  1b384f75
- H
  
  [Core] Fix stack overflow in STL::ostream (#3503) (#3504) · 547a8be5
  由 hong19860320 提交于 4月 27, 2020
  
  547a8be5
24 4月, 2020 2 次提交
- H
  
  [BUG FIX][PYTHON] fix the issue that python lib can not compile properly #3470 (#3484) · 16a70825
  由 huzhiqiang 提交于 4月 24, 2020
  
  16a70825
- W
  update cuda demo. test=develop test=release/v2.6.0 (#3475) · 8c1afb7c
  由 Wilber 提交于 4月 24, 2020
```
Modify cxx&python demo of Cuda backend 
```
  8c1afb7c
22 4月, 2020 3 次提交
- Y
  [LITE][OPENCL] Fix Places of CXX Config for OpenCL. test=develop (#3462) · b8234efb
  由 Yuan Shuai 提交于 4月 22, 2020
```
* Fix Places of CXX Config for OpenCL. test=develop

* fix shared ptr as unqiue ptr. test=develop
```
  b8234efb
- S
  
  Remove the dependence of paddle_lite_factory_helper.h (#3458) · 8df9f69a
  由 silingtong123 提交于 4月 22, 2020
  
  8df9f69a
- C
  
  [XPU] Add more XPU op kernels (#3457) · d5a6a1e5
  由 Cwndmiao 提交于 4月 22, 2020
  
  d5a6a1e5
15 4月, 2020 4 次提交
- H
  
  [OPT] Add RK and MTK support for model optimize tool (#3410) · 139808e9
  由 hong19860320 提交于 4月 15, 2020
  
  139808e9
- M
  refactor(*): reduce Wsign-compare warning (#3391) · 2997b937
  由 MaxwellDing 提交于 4月 15, 2020
```
refactor(*): reduce Wsign-compare warning
```
  2997b937
- H
  
  [APU] Add MTK APU backend (#3407) · 355d080b
  由 hong19860320 提交于 4月 15, 2020
  
  355d080b
- H
  
  [x86 test_model_bin] make test_model_bin executable on x86 platform (#3405) · 910b1cea
  由 huzhiqiang 提交于 4月 15, 2020
  
  910b1cea
14 4月, 2020 4 次提交
- S
  
  [windows compile]support inference library compiling on windows (#3403) · 58b2d7dd
  由 silingtong123 提交于 4月 14, 2020
  
  58b2d7dd
- A
  
  [RKNPU] Add Rockchip NPU backend (#3382) · fbe0799e
  由 airockchip 提交于 4月 14, 2020
  
  fbe0799e
- H
  
  [mac python] mac env supports outputs of python lib and installer (#3394) · 3190c354
  由 huzhiqiang 提交于 4月 14, 2020
  
  3190c354
- C
  Optimize matmul for size(x_dims)=2 size(y_dims)>2 (#3400) · 48f09caa
  由 cc 提交于 4月 14, 2020
```
* Optimize matmul for size(x_dims)=2  size(y_dims)>2
```
  48f09caa
13 4月, 2020 3 次提交
- Z
  [NPU] add shape, gather, lookup_table bridge (#3197) · dcf6acce
  由 zhupengyang 提交于 4月 13, 2020
```
* [NPU] add shape bridge

move shape arm kernel to host

* enhance compare arm kernel

* [NPU] add gather op bridge

* enable reshape arm ut

* [NPU] add lookup_table bridge
```
  dcf6acce
- C
  
  Fix gather and concat, add abs op, test=develop (#3395) · ae3ebea5
  由 cc 提交于 4月 13, 2020
  
  ae3ebea5
- W
  lite cuda support exec multi-stream. (#2949) · 4a7284f9
  由 Wilber 提交于 4月 13, 2020
```
lite cuda support exec multi-stream
```
  4a7284f9
11 4月, 2020 2 次提交

S
[LITE][BM] fix reshape infer shape issue, test=develop (#3384) · e55542dc
由 Santa An 提交于 4月 11, 2020
```
* [LITE][BM] fix reshape infer shape issue, test=develop

* [LITE][BM] with testing=on, test=develop
```
e55542dc

[LITE][OPENCL] Fix OpenCL API/Backend misc. test=develop (#3376) · b0b60f4f

由 Yuan Shuai 提交于 4月 11, 2020

1. clean code;
2. change `cl::Kernel` from unique to shared ptr;
3. `reset` `cl::Program` and `clear` `device_info_` in destroyed of CLRuntime;
4. remove clFlush in destroyed of CLRuntime.

b0b60f4f

10 4月, 2020 2 次提交
- C
  Optimize weight quantizaion (#3374) · 23231af8
  由 cc 提交于 4月 10, 2020
```
* Optimize weight quantizaion, test=develop
```
  23231af8
- Y
  [LITE][OPENCL] Fix OpenCL global static resources of CXX API and Light API (#3373) · 84b08a9b
  由 Yuan Shuai 提交于 4月 09, 2020
```
* [LITE][OPENCL] fix OpenCL global static resources. test=develop

* Fix Cxx and light api. test=develop
```
  84b08a9b
09 4月, 2020 1 次提交

由 jackzhang235 提交于 4月 09, 2020

[MLU] add some basic support for MLU, including related passes, kernels, gtests and some api in padddle_api.h
Passes：mlu_subgraph_pass ,mlu_postprocess_pass
Kernels:  act，batch_norm, concat, conv, elementwise, fc, interpolate, pool, scale, softmax

dc481d49

08 4月, 2020 2 次提交

Add hard_swish, ctc_align and reciprocal op (#3354) · 47869a59

由 cc 提交于 4月 08, 2020

* Add hard_swish, ctc_align and reciprocal op, test=develop
* Move some activation ops to extra, test=develop

47869a59

[Core][XPU] Add XPU op kernels (#3274) · 99deb7d9

由 hong19860320 提交于 4月 08, 2020

* [LITE][XPU] bind xpu resnet50 kernels

* [LITE][XPU] fuse resnet50 and encoder

* [LITE][XPU] bind xpu bert kernels

* [LITE][XPU] refine xpu_resnet_fuse_pass.cc

* [LITE][XPU] add xpu stack kernel

* [LITE][XPU] add xpu slice/tanh kernel

* [LITE][XPU] refine resnet50 and encoder fusor

* [LITE][XPU] split resnet50 and multi_encoder op from subgraph_op.h

* [LITE][XPU] clean workspace

* [LITE][XPU] add build script

* [LITE][XPU] fix compilation errors

* [LITE][XPU] fix kernel matmul

* [LITE][XPU] fix kernel ewadd ewsub

* [LITE][XPU] add xpu cast kernel

* [LITE][XPU] fix kernel slice

* [LITE][XPU] switch dev by LITE_XPU_DEV env

* [LITE][XPU] eliminate useless cast op

* [LITE][XPU] add PerThread Ops

* [LITE][X86] add SequenceUnpad op and kernel

* [LITE][XPU] add LITE_WITH_XTCL option

* [LITE][X86] add SequenceConv kernel

* [LITE][XPU] fix cmake dependency

* [LITE][XPU] add xpu sigmoid kernel

* [XPU] Remove the dependencies of framework.pb.h
test=develop

Change-Id: Icfb44efb0482a6369b365b5c09017765328fc10d

* [XPU] Fix the precision of cast kernel
test=develop

Change-Id: Icb18be47d7ab490de9fb9c92eae1165f49dbf492

* [Core] Fix the compiling error when build for the target that disable XPU
test=develop

Change-Id: I38ec53f222391d3bf06b70512e6c3ad1282e4683

* [XPU] Add io_copy kernel for xpu<->arm
test=develop

Change-Id: Iec7ea066f040534285557f9948b73e6a1970aed7

* fix
test=develop

Change-Id: I4db1c93df48e22afbba904ce6c3b0babd9fda4c3

* fix target matching of type_target_cast_pass and remove the unnecessary registration of io_copy kernel
test=develop

Change-Id: I432c10c9d1064e778d43fd0d12d8cf0599252f7a

* [X86] Add the keyword 'template' to avoid the compiling errors
test=develop

Change-Id: I015d5d323adafb3884029c8287ced66c90ad931e

* Fix the build.sh for XPU and x86
test=develop

Change-Id: I7d9575243669ce02af69a8ddbd6421db31902bd6

* [XPU] Add the keyword 'template' to avoid the compiling errors
test=develop

Change-Id: I46d0b3b6861286a73ee2999934b8e185e453e749

* [XPU] Add XTCL compiling option in build.sh
test=develop

Change-Id: I8b3fd998ca5f898d5bd2e665646e3874b3b73c80

* fix namespace conflicts, test=develop

* [API][XPU] Move the XPU related APIs into CxxConfig
test=develop

Change-Id: I75ac35e8bae96bcb835683f413f01b9db45afbf9

* [API][XPU] Remove the LITE_WITH_XPU in paddle_api.h
test=develop

Change-Id: Idbd64013bdf331ad876919511c1c349332d46f93

* [API][XPU] Remove XPUSetWorkspaceL3SizePerThread and XPUSetDevPerThread
test=develop

Change-Id: I515958f56f8e129280bae61c923513cc91fb9728

* [API][Core][XPU] Refine the test case and remove the necessary modifications
test=develop

Change-Id: I1e0e2957a2f9d5f4207b06c0bc98a5ab611fee56

* [Core] Remove useless code
test=develop

Change-Id: I6293faa10424aea2836d09d85ddb6a30f7811678

* [XPU] Refine the test cases
test=develop

Change-Id: I6818fc3addf1bca5b96a7d66ee99263242e3374f

* [XPU] Remove useless scripts and code
test=develop

Change-Id: I965ba6712d3cf881d0038f0473fec27d4c1bc684

* [XPU] Use InferShapeImpl in sequence_unpad, resnet50 and multi_encoder op
test=develop

Change-Id: I5375f524d36836a394d426b4b2bc9fb44be0b59c

* test=develop

Change-Id: I42ee68c8a5e891dd0f3e95d6cfbc498be7cf1519

* test=develop

Change-Id: If679e5aa73e1368e0ee5bd5f286d2e1b4c2f354e

* [XPU] Add __xpu__ prefix to the op and graph pass name of resnet50 and multi_encoder
test=develop

Change-Id: Idb61c99b4b8429cb87665bfd6835ab4d7d263be2

* [XPU] Fix and refine the xpu fuse pass
test=develop

Change-Id: If1c5b6788d994e2809c1a00d9384685a89440907

* test=develop

Change-Id: Icfa333e322fc4351700103692c46cfcb3d4f9a89

* [XPU] Remove the dependency on xpu api for xpu fuse passes
test=develop

Change-Id: I6094b5536f58ae18bab068284b32f9bd10a2ab92

* [XPU] Move unit tests from lite/api to lite/tests/api
test=develop

Change-Id: I7ba27abb23abeffb0c95fdbbefec7ac16cdbd250

* test=develop

Change-Id: I33230c84d6c4e61bf19f46668bae2baa3ef68794

* [XPU] Refine code
test=develop

Change-Id: I37bc5b948b4927e44cd3ea2594ebe3fd7671be06

* [XPU] Add env XPU_ENABLE_XTCL to enable xpu_subgraph_pass
test=develop

Change-Id: Ifb8e07e86f307f562adaca3ce792015a6f2a2204

* [XPU] refine code
test=develop

Change-Id: I1380654b930d51ae704dbc0cd855464d9c3b5b79

* [XPU] Refine code
test=develop

Change-Id: I73285c2718ccd3612490eb2635bef4fd608c9bde

* [XPU] Add comments for the XPU APIs
test=develop

Change-Id: Ieb5015f37984f8869b90c4c625c5894bb26164fd
Co-authored-by: Nmiaotianxiang <miaotianxiang@baidu.com>
Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

99deb7d9