提交 · d82d59e6e731a3de4057249239c90b81360a0e01 · 机器未来 / Paddle

09 12月, 2020 2 次提交
- P
  
  support clip op trt converter (#29411) (#29496) · 4d51cd73
  由 Pei Yang 提交于 12月 09, 2020
  
  4d51cd73
- P
  
  conflict (#29498) · d5ff367b
  由 Pei Yang 提交于 12月 09, 2020
  
  d5ff367b
08 12月, 2020 3 次提交

[2.0 rc1/cherrypick] cherry-pick kunlun PR:29234/29229/29293/29367/29280/29448 (#29466) · 6bfc5721

由 liuyuhui 提交于 12月 08, 2020

* add deformable_conv op on xpu (#29234)

* rebase develop

* update deformable_conv op on xpu

* update deformable_conv op on xpu

* update kunlun conv2d/softmax/elementwise implemetation (#29229)

* update conv2d & softmax to new xpu api
* test=kunlun

* remove useless comments
* test=kunlun

* remote softmax xpu op
* test=kunlun

* update kunlun softmax
* test=kunlun

* update xpu unitest
* test=kunlun

* fix elementwise_grad bug for kunlun
*test=kunlun

* support global pooling for kunlun (#29293)

* test=kunlun

* update reduce_sum op on xpu (#29367)

* update reduce_sum op on xpu

* update reduce_sum op on xpu

* support running on xpu

* fix expand/uniform_random && concat/transpose to new api on xpu (#29280)

* fix expand && concat/transpose to new api

* update uniform_random_op

* update xpu_header

* 1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>
Co-authored-by: N卖鱼的哲学 <tangzhiyi11@users.noreply.github.com>
Co-authored-by: NQingshuChen <qingshu.chen714@gmail.com>
Co-authored-by: Ntaixiurong <taixiurong@126.com>
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>

6bfc5721

L

refine reshape grad and double grad kernel, use tensor copy async (#29128) (#29446) · 08ee7485
由 Leo Chen 提交于 12月 08, 2020

08ee7485
Z

revert cast eigen kernel (#29445) · 14cf420e
由 Zhang Ting 提交于 12月 08, 2020

14cf420e

07 12月, 2020 5 次提交
- W
  
  polish the code of cumsum and remove some unused code (#29303) (#29423) · d77566b3
  由 wangchaochaohu 提交于 12月 07, 2020
  
  d77566b3
- Z
  
  fix that parameters'grad has grad var (#29440) · 492d43ef
  由 Zhou Wei 提交于 12月 07, 2020
  
  492d43ef
- S
  Fix unittest (#29412) (#29437) · c14d2c6a
  由 Shang Zhizhou 提交于 12月 07, 2020
```
* fix tensorrt unittest precision error

* fix unittest precision error. test_trt_subgraph_pass && test_trt_dynamic_shape_transformer_prune
```
  c14d2c6a
- C
  
  Use different name_scope for different conv type, test=develop (#29355) (#29410) · f223c786
  由 cc 提交于 12月 07, 2020
  
  f223c786
- T
  fix gpu outofrange (#29238) (#29348) · de3c067a
  由 tangwei12 提交于 12月 07, 2020
```
* fix gpu emb out of range

Change-Id: I5794ac73bd634d5ea069a6fbbd914274b6d6b7bf

* fix doc

Change-Id: I5a3350b2930a9ab2f52116c192b087307faf8fdf
```
  de3c067a
05 12月, 2020 2 次提交

update unbind norm add CUDAPlace api doc information (#29322) (#29391) · 7e322b3c

由 myq406450149 提交于 12月 05, 2020

* enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop

* fix format. test=develop

* format fix. test=develop

* add lod_rank_table. test=develop

* fix format. test=develop

* fix doc info. test=develop

* fix np error

* add unbind dygraph api. test=develop

* fix unbind doc.test=develop

7e322b3c

Release/2.0 rc1 (#29388) · fbb6cd70

由 chentianyu03 提交于 12月 05, 2020

* fix random failed of complex matmul

* Make transpose, trace, kron, reshape, sum op support complex type (#29321)

* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types

* add test cases for complex elementwise, matmul and getitem unittest

* add test cases for complex types

* add test cases for complex matmul unittest

* kron, reshape, transpose support complex types

* sum and trace op support complex types

* add test case of sum and trace op

* fix the bug of imag part of complex not initialized

* format file

* format code style

* kron support type promotion; modify test cases

fbb6cd70

04 12月, 2020 5 次提交

L

update, test=develop (#29331) (#29370) · 11980774
由 lilong12 提交于 12月 04, 2020

11980774
S
fix tensorrt output shape error (#29308) (#29344) · 7a0602c8
由 Shang Zhizhou 提交于 12月 04, 2020
```
* fix tensorrt output shape error

* fix unittest tensorrt_engine_op_test

* fix code style for unitest
```
7a0602c8

[cherry-pick 2.0rc1][inplace] Add ShareHolderWith for class Variable and... · efb5ad62

由 liym27 提交于 12月 04, 2020

[cherry-pick 2.0rc1][inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267) (#29359)

efb5ad62

Support type promote for basic math ops (quantum required) (#29265) (#29354) · 0e7539e7

由 Chen Weihang 提交于 12月 04, 2020

* basic impl of type promote

* add comment & another testcase

* fix complex bugs & support python op promote type

* fix failed unittests & polish code

* add unittest for coverage

* change to only promote complex type

* polish code details

* polish several comments

0e7539e7

L
use has_grad instead of train_mode (#29309) (#29346) · 0a7c7c1c
由 Leo Chen 提交于 12月 04, 2020
```
* use has_grad instead of train_mode

* add vlog for debug

* fix ut

* fix ut
```
0a7c7c1c

03 12月, 2020 4 次提交

S
[Cherry-Pick]Fix reducer warning & fix doc of fleet (#29333) · afa50f45
由 ShenLiang 提交于 12月 03, 2020
```
* fix the warning of reducer (#29323)

* fix warning of fleet (#29317)

* Fix doc of fleet api (#29282)
```
afa50f45
L

fix shape of tile_grad op (#29289) (#29324) · 8cd8cd53
由 Leo Chen 提交于 12月 03, 2020

8cd8cd53
S
[cherry-pick]Change the api of DataParallel and Fleet (#29288) · ec57656e
由 ShenLiang 提交于 12月 03, 2020
```
* Change the api of DataParallel and Fleet (#29224)
```
ec57656e

[Cherry-pick] Add pure fp16 training with master weights. (#29301) · d8ea8a06

由 Zhen Wang 提交于 12月 03, 2020

* Add pure fp16 training with master weights. (#27712)

* add the weight decay func for the momentum op

* Add the multi_precision function in Momentum Optimizer.

* Make sure that the initial value of master weights are same with the fp16 weights.

* add static loss scaling.

* add the rescale_grad function in the pure fp16 training.

* use the original momentum updating method.

* Polish some codes, such as variable names.

* add docstring for apis.

* update the var creation details of _create_master_weight.

* not modify codes about imperative momentum updating.

* Fix the error of test_dist_sparse_tensor_load_momentum UT.

* add unit test for multi precision fp16 training.

* add more unit tests for CI.

* Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.

d8ea8a06

02 12月, 2020 3 次提交
- W
  
  cherry-pick 29304. (#29305) · 3809fca6
  由 Wilber 提交于 12月 02, 2020
  
  3809fca6
- S
  add compile option WITH_TENSORRT (#29208) (#29264) · f5afeef1
  由 Shang Zhizhou 提交于 12月 02, 2020
```
* add compile option WITH_TENSORRT

* add WITH_TENSORRT to ci paddle_buils.sh

* add WITH_TENSORRT to paddle_build.sh

* change FATAL to WARNING when TensorRT is not found and WITN_TENSORRT=ON, just to pass ci-py3 temporarily
```
  f5afeef1
- C
  Hot fix complle failed in gcc4.8 caused by complex impl (#29254) (#29274) · 40bad648
  由 Chen Weihang 提交于 12月 02, 2020
```
* hot fix complle failed in gcc4.8

* fix failed unittest
```
  40bad648
01 12月, 2020 3 次提交

add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) · 8f45d142

由 chentianyu03 提交于 12月 01, 2020

* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types

* add test cases for complex elementwise, matmul and getitem unittest

* add test cases for complex types

* add test cases for complex matmul unittest

8f45d142

accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429) · c0a991c8

由 Zhou Wei 提交于 12月 01, 2020

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* fix coverage

* fix api doc

* fix CI unittest

* fix CI unittest

* fix unitest

* empty tensor does’t need inner_var_

* fix some error message

c0a991c8

W

fix lite unit test. (#29233) · 74c43ac6
由 Wilber 提交于 12月 01, 2020

74c43ac6

30 11月, 2020 13 次提交

A
Small optimizations for conv2d kernel subroutines. (#29188) · 4096ff94
由 Adam Osewski 提交于 11月 30, 2020
```
- Make sure that oneDNN memory descriptors are created only once at
first iteration.
```
4096ff94
J

Enable all image classification models (#29155) · 5c61eeef
由 joanna.wozna.intel 提交于 11月 30, 2020

5c61eeef
W

[Lite-Subgraph] Fix compile error for lite subgraph. (#29146) · 4fec182d
由 Wilber 提交于 11月 30, 2020

4fec182d

Update ps gpu (#29209) · b5c63423

由 123malin 提交于 11月 30, 2020

* fix paramete prefetch & device guard
Co-authored-by: NMrChengmo <cmchengmo@163.com>
Co-authored-by: Nchengmo <chengmo@baidu.com>

b5c63423

Check whether there is any inplace operation affecting gradient calculation. (#27901) · 865a4598

由 liym27 提交于 11月 30, 2020

* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.

* Add a new attribute `_inplace_version` for VarBase.

* Raise exception if an inplace operation can result in incorrect gradient computation.

* Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.

* For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.

* Use original var_wrapper if the inplace_version is not changed.

* Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.

865a4598

Add unittest in musl build (#29099) · 4056c4f1

由 chen.zhiyu 提交于 11月 30, 2020

* add musl docker build script

* rm space test=document_fix

* fix some docs and types errors test=document_fix

* move install of python requirement to docker build

* add copyright to docker file.

* add extr opts

* format docs

* add ut test add pip cache

* add more args description in readme

* add stack backtrace in ctest

* fix readme bugs

4056c4f1

1
prefetch optimize (#29095) · 03d4665f
由 123malin 提交于 11月 30, 2020
```
* test=develop, optimize async prefetch
```
03d4665f
W

optimizer amp, all use fp16 communication, overlap last comm and compute (#28957) · 0c2a51d2
由 WangXi 提交于 11月 30, 2020

0c2a51d2

Polish unittests details and execution conditions to adapt to MUSL (#29044) · 0b032fae

由 Chen Weihang 提交于 11月 30, 2020

* fix failed tests in yingchun gived list

* add unittests into static_mode_white_list

* add enable static

* fix dist unittest

* skip test_sigmoid_focal_loss_op & add gym

* revert no need skip unittests

* remove gym

0b032fae

1
test=develop, rm pathlib (#28658) · 92817f80
由 123malin 提交于 11月 30, 2020
```
* test=develop, rm pathlib
```
92817f80
W

Add quantization of multi_gru op and tests (#28615) · 4fd4095d
由 Wojciech Uss 提交于 11月 30, 2020

4fd4095d
J
fix gru gcc7.4 bug for the gru compile · bc6033f8
由 Jack Zhou 提交于 11月 30, 2020
```
fix gru gcc7.4 bug for the gru compile
```
bc6033f8

Generate code coverage reports only for incremental files (#28508) · 0239f796

由 wanghuancoder 提交于 11月 30, 2020

* Generate code coverage reports only for incremental files, test=develop

* Generate code coverage reports only for incremental files, test=develop

* Generate code coverage reports only for incremental files, test=develop

* test for diff python file, test=develop

* fix no python diff report, test=develop

* add cc test file, test=develop

* fix bug in generic.cmake, test=develop

* for debug no cc report, test=develp

* modify compire branch form test_pr to test, test=develop

* fix bug, test=develop

* test for h file changed, test=develop

* debug for redefinition of argument optimize error, test=develop

* close -o3 for test, test=develop

* remove -o3 for test, test=develop

* remove coverage option for nvcc, test=develop

* use CMAKE_CXX_FLAGS open coverage option when header file changed, test=develop

* reopen -o3, test=develop

* remove debug code, test=develop

* remove unused code, test=develop

0239f796

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致