提交 · c0a991c8740b413559bfc894aa5ae1d5ed3704b5 · 机器未来 / Paddle

01 12月, 2020 1 次提交

accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429) · c0a991c8

由 Zhou Wei 提交于 12月 01, 2020

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* fix coverage

* fix api doc

* fix CI unittest

* fix CI unittest

* fix unitest

* empty tensor does’t need inner_var_

* fix some error message

c0a991c8

30 11月, 2020 11 次提交

C

diable test_yolov3 in musl (#29216) · 786e69e9
由 Chen Weihang 提交于 11月 30, 2020

786e69e9
H

Refine the doc and unit test for Sigmoid and stanh (#29198) · f23665e5
由 hong19860320 提交于 11月 30, 2020

f23665e5

Update ps gpu (#29209) · b5c63423

由 123malin 提交于 11月 30, 2020

* fix paramete prefetch & device guard
Co-authored-by: NMrChengmo <cmchengmo@163.com>
Co-authored-by: Nchengmo <chengmo@baidu.com>

b5c63423

Check whether there is any inplace operation affecting gradient calculation. (#27901) · 865a4598

由 liym27 提交于 11月 30, 2020

* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.

* Add a new attribute `_inplace_version` for VarBase.

* Raise exception if an inplace operation can result in incorrect gradient computation.

* Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.

* For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.

* Use original var_wrapper if the inplace_version is not changed.

* Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.

865a4598

J
Remove cast from paddle.pow api (#29134) · dc070ecf
由 joejiong 提交于 11月 30, 2020
```
As the title
```
dc070ecf
W

optimizer amp, all use fp16 communication, overlap last comm and compute (#28957) · 0c2a51d2
由 WangXi 提交于 11月 30, 2020

0c2a51d2

Polish unittests details and execution conditions to adapt to MUSL (#29044) · 0b032fae

由 Chen Weihang 提交于 11月 30, 2020

* fix failed tests in yingchun gived list

* add unittests into static_mode_white_list

* add enable static

* fix dist unittest

* skip test_sigmoid_focal_loss_op & add gym

* revert no need skip unittests

* remove gym

0b032fae

T

add set_trainer_num api in dataset (#29133) · 4adddcc8
由 Thunderbrook 提交于 11月 30, 2020

4adddcc8
L

fix code: if y is True -> if y (#29184) · e0344081
由 liym27 提交于 11月 30, 2020

e0344081

save model after jit.load (#28748) · 1476e1f9

由 WeiXin 提交于 11月 30, 2020

* Changed a variable name error

* Add comments

* Move member functions of TranslatedLayer out of function

* edit code according to review

* Edit input argument of '_run_static_graph'

* reset due to Segmentation fault

* rename variables when stitching graph

* modify code according CI

* Add comments to '__i_m_p_l__'

* remove blanks befor 'Get...'

* edit code according to review

* Add a comment to '_execution_method_creator'

* Edit a comment to '_execution_method_creator'

1476e1f9

Generate code coverage reports only for incremental files (#28508) · 0239f796

由 wanghuancoder 提交于 11月 30, 2020

* Generate code coverage reports only for incremental files, test=develop

* Generate code coverage reports only for incremental files, test=develop

* Generate code coverage reports only for incremental files, test=develop

* test for diff python file, test=develop

* fix no python diff report, test=develop

* add cc test file, test=develop

* fix bug in generic.cmake, test=develop

* for debug no cc report, test=develp

* modify compire branch form test_pr to test, test=develop

* fix bug, test=develop

* test for h file changed, test=develop

* debug for redefinition of argument optimize error, test=develop

* close -o3 for test, test=develop

* remove -o3 for test, test=develop

* remove coverage option for nvcc, test=develop

* use CMAKE_CXX_FLAGS open coverage option when header file changed, test=develop

* reopen -o3, test=develop

* remove debug code, test=develop

* remove unused code, test=develop

0239f796

28 11月, 2020 4 次提交

[Dy2stat] Disable PaddleInference IR Optimization in test_mnist for CUDA11 (#29105) · 27b42183

由 Huihuang Zheng 提交于 11月 28, 2020

test_mnist failed on CUDA11. We found that it is due to PaddleInference IR Optimization after debugging. We disable it in this PR and we will re-enable it after PaddleInference fixes it.

27b42183

L

[Dy2Stat] Don't conver the function from third library logging (#29161) · 01bdea7c
由 liym27 提交于 11月 28, 2020

01bdea7c

[Dy2Stat] Fix bug: the return statement should be transformed to an equivalent... · a7433cc3

由 liym27 提交于 11月 28, 2020

[Dy2Stat] Fix bug: the return statement should be transformed to an equivalent Paddle/Python if statement, which depends on if conditions of the return stmt. (#29165)

a7433cc3

[dy2stat] Set shape for linspace to Fix dy2stat for GridGenerator Model (#29173) · 4a0a8701

由 Huihuang Zheng 提交于 11月 28, 2020

GridGenerator model failed because the output shape of `linspace` is (-1). The reason is that C++ InferShape fixes the shape to (-1):

https://github.com/PaddlePaddle/Paddle/blob/5da3d514ebaa6fffd48c4a2e6bb5b16268dae92e/paddle/fluid/operators/linspace_op.cc#L49

We cannot set the shape in C++ infer shape because this Tensor may not be initialized during compile time, but when input `num` of `linspace` is an integer, we know the shape at compiler time. This PR simply set the shape in Python and add GridGenerator as unittest.

4a0a8701

27 11月, 2020 12 次提交

A

[Dy2Stat]Refine code of test_lac unittest (#29087) · cb680c80
由 Aurelius84 提交于 11月 27, 2020

cb680c80

Support dynamic graph distributed (#28997) · e2d01eb6

由 ShenLiang 提交于 11月 27, 2020

* add reducer

* refine envent for memorycopy

* add concat&split for allreduce

* apply concat & split for fuse tensor

* fix nccl dep

* fix the untest, compile problem and ddp initialize problem

* fix untest for mac & add some comments & solve the repeated param in sublayers

* fix untest for windows & fix document

e2d01eb6

L
update expand as op to use the shape of the target tensor instead of the... · 7e5e9934
由 lilong12 提交于 11月 27, 2020
```
update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020)

* update, test=develop
```
7e5e9934
K
alias yolo_loss & yolo_box to paddle.vision. (#28520) · f4c894a6
由 Kaipeng Deng 提交于 11月 27, 2020
```
* alias yolo_loss & decode_yolo_box to paddle.vision. test=develop
```
f4c894a6
L
add paddle.subtract, optimize paddle.maximum and paddle.minimum · 28280647
由 LutaoChu 提交于 11月 27, 2020
```
add paddle.subtract, optimize paddle.maximum and paddle.minimum 
```
28280647
J
Add eigen gru and fix the dropout bug in the rnn · 085260f3
由 Jack Zhou 提交于 11月 27, 2020
```
Add eigen gru and fix the dropout bug in the rnn 
```
085260f3
L
[Dynamic-to-Static] Support **kwargs as input of the function which is... · 5fe44571
由 liym27 提交于 11月 27, 2020
```
[Dynamic-to-Static] Support **kwargs as input of the function which is decorated by `jit.save.to_static` (#29098)
```
5fe44571
Y

fix error with ut timeout and failed (#29148) · 0fca8cdf
由 YUNSHEN XIE 提交于 11月 27, 2020

0fca8cdf
C

add debug msg for test_buffer_shared_memory_reuse_pass (#29151) · 0d1900d3
由 Chen Weihang 提交于 11月 27, 2020

0d1900d3

detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01

由 Shang Zhizhou 提交于 11月 27, 2020

* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake

* comile with cuda9

* add some unittest

* notest;test=coverage

* add unittest for trt plugin swish && split

* update ernie unittest

* fix some error message

* remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter

* fix comile errror when CUDA_ARCH_NAME < Pascal"

* fix comile error

* update unittest timeout

* compile with cuda9

* update error msg

* fix code style

* add some comments

* add define IF_CUDA_ARCH_SUPPORT_FP16

* rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED

b9e76a01

C
Add symlink force for unittest test_static_save_load (#29137) · c39da29d
由 Chen Weihang 提交于 11月 27, 2020
```
* add symlink force for unittest

* open unittest
```
c39da29d
C

support jit.save datra parallel (#29135) · 95a0f87b
由 Chen Weihang 提交于 11月 27, 2020

95a0f87b

26 11月, 2020 11 次提交

L
add paddle.broadcast_to api which is a alias of paddle.expand (#28706) · 449903de
由 lilong12 提交于 11月 26, 2020
```
* update, test=develop
```
449903de
L
Split train_mode and has_grad for tracer (#29064) · 770395cb
由 Leo Chen 提交于 11月 26, 2020
```
* split train_mode and has_grad

* fix format

* fix ci problems

* fix sample code
```
770395cb
Y

disable ut test_static_save_load (#29119) · 27d04a3b
由 YUNSHEN XIE 提交于 11月 26, 2020

27d04a3b

[sharding] doc, api, bug fixed (#28983) · 0dadacc4

由 JZ-LIANG 提交于 11月 26, 2020

* add lars to fleet meta optimizer

* add lamb to proto

* add lamb to fleet meta optimizer

* fixed syntax bug

* fixed syntax bug

* fixed syntax error in lamb, add config setter of lamb in distributed_strategy

* trigger unitest to rerun

* add new unitest func for lamb

* revise unitest for lars and lamb

* revise dgc meta unitest

* revise lars document in distribute_strategy

* revise lars lamb document in distributed_strategy.py

* revise lars lamb document in distributed_strategy.py

* add weight decay exclude logic to lars

* restore optimzier.py

* restore optimizer.py as develop except lars

* add epsilon and exclude fn to distributed_sttrategy

* add lars epsilon

* revise unitest for fleet lars and lamb

* revise lars lamb unitest for CI coverage

* revise lars argument api

* revise lars argument api

* revise lars argument api

* revise api doc of lars

* fix op role

* add sharding save and add_sync_comm_for_test function

* add comm_analyse to utlis

* revise sharding_utils

* add sharding saving unittest

* revise sharding utils for unittest

* revise sharding en doc

* update sharding utils api

* add doc for sharding

* fixed bug in sharding var size count

* update varsize count in sharding

* fix sharding num_nccl_comm

* Revert "fix sharding num_nccl_comm"

This reverts commit d51587c15e9323acf226ddd36154275f0d1daf76.

0dadacc4

Y

fix crypto ut test error for windows ci (#29090) · dd417750
由 Yanghello 提交于 11月 26, 2020

dd417750
W

Fix multi nccl comm & wait server ready (#28663) · e931c7ba
由 WangXi 提交于 11月 26, 2020

e931c7ba

add API serialize_program, serialize_persistables, save_to_file,... · db412585

由 Shibo Tao 提交于 11月 26, 2020

add API serialize_program, serialize_persistables, save_to_file, deserialize_program, deserialize_persistables, load_from_file. (#29034)

db412585

K
remove BatchSampler type check (#29114) · b052149d
由 Kaipeng Deng 提交于 11月 26, 2020
```
* remove BatchSampler type check. test=develop
```
b052149d
H

Add dygraph implementation for multiplex op (#29049) · db85f4cf
由 hutuxian 提交于 11月 26, 2020

db85f4cf
J
Add bf16 pool2d and unify bf16 unit tests (#29039) · b0d1ac16
由 joanna.wozna.intel 提交于 11月 26, 2020
```
* Add bf16 pool2d and unify bf16 unit tests

* Add change default ops test
```
b0d1ac16
G

Clean up the redundant files and unify the launch interface. (#28928) · 1358397e
由 gongweibao 提交于 11月 26, 2020

1358397e

25 11月, 2020 1 次提交
- C
  Hide the C++ stack by default and add hints (#29042) · fea0e294
  由 Chen Weihang 提交于 11月 25, 2020
```
* default not show cpp statck & add hint

* fix failed unittest

* fix failed unittests
```
  fea0e294

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致