提交 · f781ab08166c1f7958084260058564fc1196b0cc · Crayon鑫 / Paddle

25 12月, 2020 1 次提交

由 tangwei12 提交于 12月 25, 2020

* add ps table (#29463)

* add ps table

Change-Id: I468a04bd071d21ff52654926fcf4d5f3da19e178

* add service (#29560)

* add service, remove ut on mac

* fix heter_profiler & add heter stop method

* fix code style

* merge pscore

Change-Id: Ie7f60d1cdde6755a0c29db26863c6283e9843d57

* fix cmake

Change-Id: I6773509a7b4ca79139ecc40b7bf3eb318ceff8bb

* fix conflit

Change-Id: I35575be0c96a8520f9d756ea7f1ff0b904a165ba

* fix conflit

Change-Id: Ic926ea0b0d67803226d51241397ba3b510226bfa

f781ab08

17 12月, 2020 1 次提交

update activation op on kunlun (#29577) (#29717) · e82efc0c

由 TTerror 提交于 12月 17, 2020

* fix expand && concat/transpose to new api

* update xpu_header

* update activation op on kunlun

* update activation op on kunlun

* update activation op on kunlun

* update activation op on kunlun

* update activation op on kunlun

* add nearest_interp on kunlun

* update error message

e82efc0c

16 12月, 2020 1 次提交
- Q
  support roi_align & affine_channel for kunlun (#29561) (#29657) · d82b0300
  由 QingshuChen 提交于 12月 16, 2020
```
* support roi_align & affine_channel for kunlun

* minor
```
  d82b0300
15 12月, 2020 1 次提交

cherry-pick kunlun PR: 29458, 29539 (#29583) · 03ddf690

由 QingshuChen 提交于 12月 15, 2020

* support mobilenet for kunlun (#29458)

* add xpu ops for training transformer in kunlun (#29539)

* 1.fix matmul bug 2. add one hot

* add xpu error msg
Co-authored-by: Nprocr <procrboo@gmail.com>
Co-authored-by: Ntaixiurong <taixiurong@126.com>

03ddf690

08 12月, 2020 1 次提交

[2.0 rc1/cherrypick] cherry-pick kunlun PR:29234/29229/29293/29367/29280/29448 (#29466) · 6bfc5721

由 liuyuhui 提交于 12月 08, 2020

* add deformable_conv op on xpu (#29234)

* rebase develop

* update deformable_conv op on xpu

* update deformable_conv op on xpu

* update kunlun conv2d/softmax/elementwise implemetation (#29229)

* update conv2d & softmax to new xpu api
* test=kunlun

* remove useless comments
* test=kunlun

* remote softmax xpu op
* test=kunlun

* update kunlun softmax
* test=kunlun

* update xpu unitest
* test=kunlun

* fix elementwise_grad bug for kunlun
*test=kunlun

* support global pooling for kunlun (#29293)

* test=kunlun

* update reduce_sum op on xpu (#29367)

* update reduce_sum op on xpu

* update reduce_sum op on xpu

* support running on xpu

* fix expand/uniform_random && concat/transpose to new api on xpu (#29280)

* fix expand && concat/transpose to new api

* update uniform_random_op

* update xpu_header

* 1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>
Co-authored-by: N卖鱼的哲学 <tangzhiyi11@users.noreply.github.com>
Co-authored-by: NQingshuChen <qingshu.chen714@gmail.com>
Co-authored-by: Ntaixiurong <taixiurong@126.com>
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>

6bfc5721

07 12月, 2020 2 次提交
- W
  [Release/2.0 rc1] fix cmake error message. (#29420) · 401cc1e0
  由 Wilber 提交于 12月 07, 2020
```
* update  lite tag.

* fix cmake error log.
```
  401cc1e0
- W
  
  update lite tag. (#29394) · 07a7cd4b
  由 Wilber 提交于 12月 07, 2020
  
  07a7cd4b
05 12月, 2020 1 次提交
- W
  
  update cmake for FT openbals version (#29383) · 4a8aef49
  由 Wilber 提交于 12月 05, 2020
  
  4a8aef49
02 12月, 2020 1 次提交

add compile option WITH_TENSORRT (#29208) (#29264) · f5afeef1

由 Shang Zhizhou 提交于 12月 02, 2020

* add compile option WITH_TENSORRT

* add WITH_TENSORRT to ci paddle_buils.sh

* add WITH_TENSORRT to paddle_build.sh

* change FATAL to WARNING when TensorRT is not found and WITN_TENSORRT=ON, just to pass ci-py3 temporarily

f5afeef1

01 12月, 2020 1 次提交
- W
  
  revert python file coverage, delete coverage run --include, test=develop (#29230) · 2b2cd186
  由 wanghuancoder 提交于 12月 01, 2020
  
  2b2cd186
30 11月, 2020 2 次提交

W

[Lite-Subgraph] Fix compile error for lite subgraph. (#29146) · 4fec182d
由 Wilber 提交于 11月 30, 2020

4fec182d

Generate code coverage reports only for incremental files (#28508) · 0239f796

由 wanghuancoder 提交于 11月 30, 2020

* Generate code coverage reports only for incremental files, test=develop

* Generate code coverage reports only for incremental files, test=develop

* Generate code coverage reports only for incremental files, test=develop

* test for diff python file, test=develop

* fix no python diff report, test=develop

* add cc test file, test=develop

* fix bug in generic.cmake, test=develop

* for debug no cc report, test=develp

* modify compire branch form test_pr to test, test=develop

* fix bug, test=develop

* test for h file changed, test=develop

* debug for redefinition of argument optimize error, test=develop

* close -o3 for test, test=develop

* remove -o3 for test, test=develop

* remove coverage option for nvcc, test=develop

* use CMAKE_CXX_FLAGS open coverage option when header file changed, test=develop

* reopen -o3, test=develop

* remove debug code, test=develop

* remove unused code, test=develop

0239f796

27 11月, 2020 2 次提交

Z

fix CUDA 11 error on windows (#29101) · e668cb07
由 Zhou Wei 提交于 11月 27, 2020

e668cb07

detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01

由 Shang Zhizhou 提交于 11月 27, 2020

* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake

* comile with cuda9

* add some unittest

* notest;test=coverage

* add unittest for trt plugin swish && split

* update ernie unittest

* fix some error message

* remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter

* fix comile errror when CUDA_ARCH_NAME < Pascal"

* fix comile error

* update unittest timeout

* compile with cuda9

* update error msg

* fix code style

* add some comments

* add define IF_CUDA_ARCH_SUPPORT_FP16

* rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED

b9e76a01

24 11月, 2020 1 次提交
- Y
  
  restore timeout value (#29027) · 5cb8e17a
  由 YUNSHEN XIE 提交于 11月 24, 2020
  
  5cb8e17a
20 11月, 2020 1 次提交

add kunlun kernel: slice, slice_grad, top_k, cast. *test=kunlun (#28542) · d3d1a6b6

由 taixiurong 提交于 11月 20, 2020

* 1.add xpu slice op 2. add xpu top_k op 3.modify xpu cast to new api

* 1.add xpu slice op 2. add xpu top_k op 3.modify xpu cast to new api

d3d1a6b6

16 11月, 2020 1 次提交
- Z
  open a part of GPU unittest for windows (#28378) · 93c39779
  由 Zhou Wei 提交于 11月 16, 2020
```
* open a part of GPU unittest for windows

* open a part of GPU unittest for windows
```
  93c39779
12 11月, 2020 1 次提交
- S
  裁剪transformer模型trt支持；修复tensorRT不支持DeletePass的bug (#28517) · 8699f38d
  由 Shang Zhizhou 提交于 11月 12, 2020
```
* skip_layernorm_op done

* add unittest

* slice op convertor support trt < 6

* skip_layernorm only work in ernie
```
  8699f38d
09 11月, 2020 2 次提交
- Y
  modified timeout value on windows (#28499) · d3b2d07d
  由 YUNSHEN XIE 提交于 11月 09, 2020
```
* modified timeout value on windows

* fix some error
```
  d3b2d07d
- Y
  exec ut no more than 15s 2 (#28441) · 72c78e4d
  由 YUNSHEN XIE 提交于 11月 09, 2020
```
* exec ut no more than 15s 2

* fix for ut test_inplace_addto_strategy timeout
```
  72c78e4d
06 11月, 2020 1 次提交
- Q
  fix batch_norm_xpu bug & remove xpusimulator dependence (#28430) · 6bba8e57
  由 QingshuChen 提交于 11月 06, 2020
```
*test=kunlun
```
  6bba8e57
04 11月, 2020 3 次提交
- W
  
  [sw] Update compile error for sw (#28419) · 648b92c0
  由 Wilber 提交于 11月 04, 2020
  
  648b92c0
- 石
  
  update the cmake cmd, test=develop (#28344) · 0d25d55a
  由石晓伟提交于 11月 04, 2020
  
  0d25d55a
- W
  
  refine (#28366) · 337d3832
  由 wangchaochaohu 提交于 11月 04, 2020
  
  337d3832
03 11月, 2020 2 次提交
- W
  
  Paddle support compile on sw (#27858) · 09fd2b2a
  由 Wilber 提交于 11月 03, 2020
  
  09fd2b2a
- Z
  
  fix compile out of memory temporary (#28346) · f41104ef
  由 Zhou Wei 提交于 11月 03, 2020
  
  f41104ef
30 10月, 2020 1 次提交
- 石
  update the version of pybind, test=develop (#28284) · d9b5f126
  由石晓伟提交于 10月 30, 2020
```
* update version pybind to v2.4.3, test=develop

* update unittests, test=develop
```
  d9b5f126
26 10月, 2020 1 次提交
- X
  
  add git mirror url to speed up clone (#28241) · d2522197
  由 XiaoguangHu 提交于 10月 26, 2020
  
  d2522197
23 10月, 2020 1 次提交
- Z
  
  fix CUDA9 error due to BuildCustomizations (#28222) · 4877bd59
  由 Zhou Wei 提交于 10月 23, 2020
  
  4877bd59
21 10月, 2020 3 次提交
- W
  
  [lite-xpu-subgraph] Fix xpu compile and test xpu ci. (#27932) · f935ca8a
  由 Wilber 提交于 10月 21, 2020
  
  f935ca8a
- Z
  
  fix Automatic GPU detection failed on windows (#28148) · 68c473e3
  由 Zhou Wei 提交于 10月 21, 2020
  
  68c473e3
- Z
  
  fix dynamic_loader more safe and error message on windows (#28117) · 5d700021
  由 Zhou Wei 提交于 10月 21, 2020
  
  5d700021
16 10月, 2020 2 次提交
- L
  
  add a comment, test=document_fix (#28008) · ff02173d
  由 lilong12 提交于 10月 16, 2020
  
  ff02173d
- L
  build gloo from source code instead of using the pre-compiled library (#27930) · afce32f3
  由 lilong12 提交于 10月 16, 2020
```
* build gloo from source code , test=develop
```
  afce32f3
12 10月, 2020 2 次提交
- W
  
  Lite subgraph support arm cpu. (#27827) · 9005c5a2
  由 Wilber 提交于 10月 12, 2020
  
  9005c5a2
- add musl option (#27798) · 6335e6a0
  由 chen.zhiyu 提交于 10月 12, 2020
  
  6335e6a0
11 10月, 2020 1 次提交
- W
  
  update for windows compile. (#27813) · a2d08aa9
  由 Wilber 提交于 10月 11, 2020
  
  a2d08aa9
01 10月, 2020 1 次提交
- W
  
  Added support for quantization of fusion_gru (#27518) · 966447e3
  由 Wojciech Uss 提交于 10月 01, 2020
  
  966447e3
27 9月, 2020 2 次提交

add support to float64 input of warpctc op. (#27399) · 1501a80f

由 Li Fuchen 提交于 9月 27, 2020

* add float64 input to ctc_loss

* modified error message of  warpctc

* update repo and tag of warpctc

* add test for warpctc with float64 input

* modified warpctc.cmake to make sure build always

* resolved sample code bug of warpctc

* add core.ops in warpctc dygraph

* fix a bug of test

1501a80f

support elementwise add, activation, matmul on Baidu Kunlun (#27143) · 6b727e08

由 QingshuChen 提交于 9月 27, 2020

* support elementwise add, activation, matmul on Baidu Kunlun
* test=kunlun

* minor
* test=kunlun

* reconstuct the xpu directory
* test=kunlun

* minor
* test=kunlun

* minor
* test=kunlun

* minor
* test=kunlun

* minor
* test=kunlun

* minor
* test=kunlun

6b727e08

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致