提交 · 6bfc57215fbe0f9f876dfce45ca67ec10c4f7e2b · BaiXuePrincess / Paddle

08 12月, 2020 4 次提交

[2.0 rc1/cherrypick] cherry-pick kunlun PR:29234/29229/29293/29367/29280/29448 (#29466) · 6bfc5721

由 liuyuhui 提交于 12月 08, 2020

* add deformable_conv op on xpu (#29234)

* rebase develop

* update deformable_conv op on xpu

* update deformable_conv op on xpu

* update kunlun conv2d/softmax/elementwise implemetation (#29229)

* update conv2d & softmax to new xpu api
* test=kunlun

* remove useless comments
* test=kunlun

* remote softmax xpu op
* test=kunlun

* update kunlun softmax
* test=kunlun

* update xpu unitest
* test=kunlun

* fix elementwise_grad bug for kunlun
*test=kunlun

* support global pooling for kunlun (#29293)

* test=kunlun

* update reduce_sum op on xpu (#29367)

* update reduce_sum op on xpu

* update reduce_sum op on xpu

* support running on xpu

* fix expand/uniform_random && concat/transpose to new api on xpu (#29280)

* fix expand && concat/transpose to new api

* update uniform_random_op

* update xpu_header

* 1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>
Co-authored-by: N卖鱼的哲学 <tangzhiyi11@users.noreply.github.com>
Co-authored-by: NQingshuChen <qingshu.chen714@gmail.com>
Co-authored-by: Ntaixiurong <taixiurong@126.com>
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>

6bfc5721

S
[Cherry-Pick]Fix bug where embedding can‘t be processed correctly in reducer (#29490) · 6b9302a2
由 ShenLiang 提交于 12月 08, 2020
```
* fix the bug of reducer in embedding
```
6b9302a2
L
[Cherry-pick] Fix bug in gloo that gloo initialization hangs (#29449) · d8e1e50a
由 lilong12 提交于 12月 08, 2020
```
* update, test=develop (#29331)
```
d8e1e50a
Z

revert cast eigen kernel (#29445) · 14cf420e
由 Zhang Ting 提交于 12月 08, 2020

14cf420e

07 12月, 2020 4 次提交
- S
  Fix unittest (#29412) (#29437) · c14d2c6a
  由 Shang Zhizhou 提交于 12月 07, 2020
```
* fix tensorrt unittest precision error

* fix unittest precision error. test_trt_subgraph_pass && test_trt_dynamic_shape_transformer_prune
```
  c14d2c6a
- B
  Add deform_conv2d,DeformConv2D (#29364) (#29425) · b776434c
  由 Bai Yifan 提交于 12月 07, 2020
```
* add deform_conv2d,DeformConv2D
```
  b776434c
- C
  
  change shape of output in cross_entropy (#29414) · d094cd02
  由 chajchaj 提交于 12月 07, 2020
  
  d094cd02
- C
  remove complexvariable (#29390) (#29417) · 9fec4bce
  由 chentianyu03 提交于 12月 07, 2020
```
* rm complexvariable

* modify test_var_base unittest

* remove duplicated codes
```
  9fec4bce
05 12月, 2020 2 次提交

L
[cherri-pick] Fix bug: delete wrong check_type of paddle.concat and support... · 2816f590
由 liym27 提交于 12月 05, 2020
```
[cherri-pick] Fix bug: delete wrong check_type of paddle.concat and support LoDTensorArray (#29306) (#29368)
```
2816f590

Release/2.0 rc1 (#29388) · fbb6cd70

由 chentianyu03 提交于 12月 05, 2020

* fix random failed of complex matmul

* Make transpose, trace, kron, reshape, sum op support complex type (#29321)

* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types

* add test cases for complex elementwise, matmul and getitem unittest

* add test cases for complex types

* add test cases for complex matmul unittest

* kron, reshape, transpose support complex types

* sum and trace op support complex types

* add test case of sum and trace op

* fix the bug of imag part of complex not initialized

* format file

* format code style

* kron support type promotion; modify test cases

fbb6cd70

04 12月, 2020 6 次提交

H
[Dy2stat] Reduce Exception Type for Better Error Message (#29268) (#29363) · 981244cf
由 Huihuang Zheng 提交于 12月 04, 2020
```
Reduce exception type so that if covert_to_static failed, it reports right error message.
```
981244cf

[cherry-pick 2.0rc1][inplace] Add ShareHolderWith for class Variable and... · efb5ad62

由 liym27 提交于 12月 04, 2020

[cherry-pick 2.0rc1][inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267) (#29359)

efb5ad62

[cherry-pick 2.0rc1][Dy2Stat] Fix bug: Do not use gast.Subscript to replace... · d10eb700

由 liym27 提交于 12月 04, 2020

[cherry-pick 2.0rc1][Dy2Stat] Fix bug: Do not use gast.Subscript to replace gast.Name in when transforming for_enumerate_loop (#29310) (#29361)

d10eb700

Support type promote for basic math ops (quantum required) (#29265) (#29354) · 0e7539e7

由 Chen Weihang 提交于 12月 04, 2020

* basic impl of type promote

* add comment & another testcase

* fix complex bugs & support python op promote type

* fix failed unittests & polish code

* add unittest for coverage

* change to only promote complex type

* polish code details

* polish several comments

0e7539e7

[Cheery-Pick 2.0.0-rc1][Dy2stat] Add a decorator paddle.jit.not_to_static to... · 8e0d688a

由 liym27 提交于 12月 04, 2020

[Cheery-Pick 2.0.0-rc1][Dy2stat] Add a decorator paddle.jit.not_to_static to support that not to convert a function in Dynamic-to-Static. (#29253) (#29340)

Usage scenarios：A function could have run successfully in static mode,  you can use it to decorate a function in the following cases:
  1. An unknown error occurs in the dynamic-to-static conversion process of the function;
  2. In the internal implementation of the function, it has two branches: dynamic branch and static branch;
  3. Users don't want to convert the function in the process of dynamic to static.

8e0d688a

L
use has_grad instead of train_mode (#29309) (#29346) · 0a7c7c1c
由 Leo Chen 提交于 12月 04, 2020
```
* use has_grad instead of train_mode

* add vlog for debug

* fix ut

* fix ut
```
0a7c7c1c

03 12月, 2020 4 次提交

L
Move temporal_shift to paddle.nn.functional (#29261) (#29315) · f616daaa
由 LielinJiang 提交于 12月 03, 2020
```
* move temporal_shift to functional
```
f616daaa
S
[cherry-pick]Change the api of DataParallel and Fleet (#29288) · ec57656e
由 ShenLiang 提交于 12月 03, 2020
```
* Change the api of DataParallel and Fleet (#29224)
```
ec57656e

[Cherry-pick] Add pure fp16 training with master weights. (#29301) · d8ea8a06

由 Zhen Wang 提交于 12月 03, 2020

* Add pure fp16 training with master weights. (#27712)

* add the weight decay func for the momentum op

* Add the multi_precision function in Momentum Optimizer.

* Make sure that the initial value of master weights are same with the fp16 weights.

* add static loss scaling.

* add the rescale_grad function in the pure fp16 training.

* use the original momentum updating method.

* Polish some codes, such as variable names.

* add docstring for apis.

* update the var creation details of _create_master_weight.

* not modify codes about imperative momentum updating.

* Fix the error of test_dist_sparse_tensor_load_momentum UT.

* add unit test for multi precision fp16 training.

* add more unit tests for CI.

* Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.

d8ea8a06

H
[Dy2stat] Fix PaddleGan Deoldify Model Dy2stat Problems (#29226) (#29281) · 32c139d3
由 Huihuang Zheng 提交于 12月 03, 2020
```
Cherry-pick of PR #29226
```
32c139d3

02 12月, 2020 2 次提交
- C
  
  fix random failed of complex matmul (#29299) · 4b553ece
  由 chentianyu03 提交于 12月 02, 2020
  
  4b553ece
- C
  Hot fix complle failed in gcc4.8 caused by complex impl (#29254) (#29274) · 40bad648
  由 Chen Weihang 提交于 12月 02, 2020
```
* hot fix complle failed in gcc4.8

* fix failed unittest
```
  40bad648
01 12月, 2020 4 次提交

J
Momentum Velocity init in Momentum.__init__() (#29223) · a5d13d59
由 Jiawei Wang 提交于 12月 01, 2020
```
* add lamb optimizer and unittest

* fix momentum resume training

* fix momentum acc
```
a5d13d59
W

revert python file coverage, delete coverage run --include, test=develop (#29230) · 2b2cd186
由 wanghuancoder 提交于 12月 01, 2020

2b2cd186

add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) · 8f45d142

由 chentianyu03 提交于 12月 01, 2020

* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types

* add test cases for complex elementwise, matmul and getitem unittest

* add test cases for complex types

* add test cases for complex matmul unittest

8f45d142

accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429) · c0a991c8

由 Zhou Wei 提交于 12月 01, 2020

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* fix coverage

* fix api doc

* fix CI unittest

* fix CI unittest

* fix unitest

* empty tensor does’t need inner_var_

* fix some error message

c0a991c8

30 11月, 2020 11 次提交

C

diable test_yolov3 in musl (#29216) · 786e69e9
由 Chen Weihang 提交于 11月 30, 2020

786e69e9
H

Refine the doc and unit test for Sigmoid and stanh (#29198) · f23665e5
由 hong19860320 提交于 11月 30, 2020

f23665e5

Update ps gpu (#29209) · b5c63423

由 123malin 提交于 11月 30, 2020

* fix paramete prefetch & device guard
Co-authored-by: NMrChengmo <cmchengmo@163.com>
Co-authored-by: Nchengmo <chengmo@baidu.com>

b5c63423

Check whether there is any inplace operation affecting gradient calculation. (#27901) · 865a4598

由 liym27 提交于 11月 30, 2020

* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.

* Add a new attribute `_inplace_version` for VarBase.

* Raise exception if an inplace operation can result in incorrect gradient computation.

* Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.

* For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.

* Use original var_wrapper if the inplace_version is not changed.

* Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.

865a4598

J
Remove cast from paddle.pow api (#29134) · dc070ecf
由 joejiong 提交于 11月 30, 2020
```
As the title
```
dc070ecf
W

optimizer amp, all use fp16 communication, overlap last comm and compute (#28957) · 0c2a51d2
由 WangXi 提交于 11月 30, 2020

0c2a51d2

Polish unittests details and execution conditions to adapt to MUSL (#29044) · 0b032fae

由 Chen Weihang 提交于 11月 30, 2020

* fix failed tests in yingchun gived list

* add unittests into static_mode_white_list

* add enable static

* fix dist unittest

* skip test_sigmoid_focal_loss_op & add gym

* revert no need skip unittests

* remove gym

0b032fae

T

add set_trainer_num api in dataset (#29133) · 4adddcc8
由 Thunderbrook 提交于 11月 30, 2020

4adddcc8
L

fix code: if y is True -> if y (#29184) · e0344081
由 liym27 提交于 11月 30, 2020

e0344081

save model after jit.load (#28748) · 1476e1f9

由 WeiXin 提交于 11月 30, 2020

* Changed a variable name error

* Add comments

* Move member functions of TranslatedLayer out of function

* edit code according to review

* Edit input argument of '_run_static_graph'

* reset due to Segmentation fault

* rename variables when stitching graph

* modify code according CI

* Add comments to '__i_m_p_l__'

* remove blanks befor 'Get...'

* edit code according to review

* Add a comment to '_execution_method_creator'

* Edit a comment to '_execution_method_creator'

1476e1f9

Generate code coverage reports only for incremental files (#28508) · 0239f796

由 wanghuancoder 提交于 11月 30, 2020

* Generate code coverage reports only for incremental files, test=develop

* Generate code coverage reports only for incremental files, test=develop

* Generate code coverage reports only for incremental files, test=develop

* test for diff python file, test=develop

* fix no python diff report, test=develop

* add cc test file, test=develop

* fix bug in generic.cmake, test=develop

* for debug no cc report, test=develp

* modify compire branch form test_pr to test, test=develop

* fix bug, test=develop

* test for h file changed, test=develop

* debug for redefinition of argument optimize error, test=develop

* close -o3 for test, test=develop

* remove -o3 for test, test=develop

* remove coverage option for nvcc, test=develop

* use CMAKE_CXX_FLAGS open coverage option when header file changed, test=develop

* reopen -o3, test=develop

* remove debug code, test=develop

* remove unused code, test=develop

0239f796

28 11月, 2020 3 次提交
- H
  [Dy2stat] Disable PaddleInference IR Optimization in test_mnist for CUDA11 (#29105) · 27b42183
  由 Huihuang Zheng 提交于 11月 28, 2020
```
test_mnist failed on CUDA11. We found that it is due to PaddleInference IR Optimization after debugging. We disable it in this PR and we will re-enable it after PaddleInference fixes it.
```
  27b42183
- L
  
  [Dy2Stat] Don't conver the function from third library logging (#29161) · 01bdea7c
  由 liym27 提交于 11月 28, 2020
  
  01bdea7c
- L
  [Dy2Stat] Fix bug: the return statement should be transformed to an equivalent... · a7433cc3
  由 liym27 提交于 11月 28, 2020
```
[Dy2Stat] Fix bug: the return statement should be transformed to an equivalent Paddle/Python if statement, which depends on if conditions of the return stmt. (#29165)
```
  a7433cc3

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致