提交 · 1b9a3bfc9bd1e7e16e842e3f34cb9ca5a9249c93 · 机器未来 / Paddle

26 4月, 2021 1 次提交

[Dy2stat] Support paddle.to_tensor with int, float, bool. (#32420) · 1b9a3bfc

由 Huihuang Zheng 提交于 4月 26, 2021

paddle.to_tensor will be translated to paddle.assign in Dy2stat, however paddle.assign doesn't support int, float, bool. This PR added the supports.

1b9a3bfc

25 4月, 2021 17 次提交
- L
  add pipeline for dynamic graph (#32511) · 561dc719
  由 lilong12 提交于 4月 25, 2021
```
* add pp dygraph, test=develop
```
  561dc719
- J
  Dygraph Recompute (#32516) · 583ebab7
  由 JZ-LIANG 提交于 4月 25, 2021
```
* Dygraph reocmpute

* unitest for Dygraph reocmpute

* dy recompute remove unitest for win and mac
```
  583ebab7
- H
  Make range API set its out shape when possible (#32472) · f16981b1
  由 Huihuang Zheng 提交于 4月 25, 2021
```
`range` API set its output shape in dygraph but not in static graph, which can cause Dy2stat error. This PR set the shape of `range` API when possible.
```
  f16981b1
- W
  
  fix a bug, test=develop (#32488) · 29e081bb
  由 wanghuancoder 提交于 4月 25, 2021
  
  29e081bb
- L
  
  [slice] Support index is Tensor for slice in dynamic mode (#32435) · aceec7fb
  由 liym27 提交于 4月 25, 2021
  
  aceec7fb
- L
  
  [Setitem] Support grad computation of op set_value (#32431) · 25e723e7
  由 liym27 提交于 4月 25, 2021
  
  25e723e7
- B
  
  add copy_cross_scope (#32432) · 5943ff7b
  由 Baibaifan 提交于 4月 25, 2021
  
  5943ff7b
- W
  paddle.save/load support nested structure and layer (#32446) · 727b28d7
  由 WeiXin 提交于 4月 25, 2021
```
* support save/load binary format tensor

* Fix error when create cudaplace

* Fix error when create cudaplace

* Fix error when create cudaplace

* get devive context from pool.

* move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'.

* support complex object

* improve coverage.

* improve coverage

* improve coverage.

* fix a bug.

* polish API

* save/load program

* paddle.save/load: layer

* deal with conflict

* if PY2, block test_paddle_save_load.TestSaveLoadLayer

* polish code.

* polish code

* edit unnittest

* The condition for object to be identified as state_dict becomes strict

* use 'core._cuda_synchronize'
```
  727b28d7
- M
  
  add silu op, test=develop (#32384) · 2f351ed5
  由 minghaoBD 提交于 4月 25, 2021
  
  2f351ed5
- S
  [HybridParallel] Add pipeline layer in dygraph (#32449) · 7ef1de67
  由 ShenLiang 提交于 4月 25, 2021
```
* add pipeline layer
```
  7ef1de67
- L
  Fix the bug in mp (#31996) · 976fe6f9
  由 lilong12 提交于 4月 25, 2021
```
* update
```
  976fe6f9
- W
  [BUG FIX] when x.dim < y.dim, the result of compare_op is inverse (#32470) · 78eff521
  由 wawltor 提交于 4月 25, 2021
```
* fix bug: when x.dim < y.dim, the result of compare_op is inverse to expected result

* support the cuda for fix the compare broadcast bug
```
  78eff521
- S
  fix tc trt shape (#32458) · f272e59a
  由 Shang Zhizhou 提交于 4月 25, 2021
```
* fix tc trt shape

* fix fc dynamic shape

* add fc shape assert

* update
```
  f272e59a
- P
  let paddle.utils.install_check support CPU package with GPU device (#32428) · 06276f46
  由 pangyoki 提交于 4月 25, 2021
```
* let paddle.utils.install_check support CPU package with GPU device

* use use_cuda in dygraph checking

* add unittest for install_check
```
  06276f46
- L
  
  fix tensor to_string when shape contains zero (#32501) · 3b61d066
  由 Leo Chen 提交于 4月 25, 2021
  
  3b61d066
- Z
  
  add detail for gpu_id, document_fix (#32444) · 136ef09d
  由 Zhang Ting 提交于 4月 25, 2021
  
  136ef09d
- W
  
  use 'paddle.framework.set_grad_enabled' in pylayer (#32355) · 83580ee6
  由 WeiXin 提交于 4月 25, 2021
  
  83580ee6
24 4月, 2021 2 次提交
- H
  Fix test_yolov3 Random Failure (#32496) · 9bf90922
  由 Huihuang Zheng 提交于 4月 24, 2021
```
Reduce max iter size to fix windows openblas test_yolov3 random failure.
Decrease batch size to fix pe related unittest random failure.
```
  9bf90922
- Z
  
  add tensor.tolist() support (#32366) · 8beb1707
  由 zhiboniu 提交于 4月 24, 2021
  
  8beb1707
23 4月, 2021 7 次提交

L
add the c_identity op (#32485) · 8fa8a37f
由 lilong12 提交于 4月 23, 2021
```
* add c_identity op, test=develop
```
8fa8a37f

[NPU] refactor check_finite_and_scale npu kernel (#32407) · 39a59dcf

由 Leo Chen 提交于 4月 23, 2021

* refactor_check_finite_and_scale_npu_kernel

* fix compile

* add alloc_float_status op

* add alloc_float_status op

* add FloatStatus for check_finite_and_unscale

* refine code

* remove unneccessary logic

* refine for fleet

39a59dcf

B
solve hccl communicate conflict (#32447) · 0e74eea2
由 Baibaifan 提交于 4月 23, 2021
```
solve hccl communicate conflict (#32447)
```
0e74eea2
L
add c_concat and c_split ops (#32486) · 2b108a04
由 lilong12 提交于 4月 23, 2021
```
* add c_concat op
```
2b108a04
S

add lstm support on xpu test=kunlun (#32436) · b6f8ccd2
由 shanliang1992 提交于 4月 23, 2021

b6f8ccd2
S

disable utest (#32474) · 1dc83932
由 ShenLiang 提交于 4月 23, 2021

1dc83932

Fix seven error message (#32397) · 203ac4f3

由 Kqnonrime 提交于 4月 23, 2021

* fix two error message

* fix two error message

* fix error

* fix error

* fix error

* fix error

* fix some error message

* fix some error

* fix error

* fix some error

* fix some error

* fix some error

* fix one error

* fix some error

* fix seven error message

* fix error

* fix error

* fix error

* fix error

203ac4f3

22 4月, 2021 9 次提交
- Y
  
  Add `paddle.set_grad_enabled` (#31794) · f8ca5a9d
  由 Yang Zhang 提交于 4月 22, 2021
  
  f8ca5a9d
- W
  support int32 and int64 kernel for clip operator (#32373) · c3328288
  由 wuyefeilin 提交于 4月 22, 2021
```
support int32 and int64 kernel for clip operator 
```
  c3328288
- Y
  
  Add fleet get_loss_scaling doc and update alert message (#32419) · d03b0b16
  由 Yuang Liu 提交于 4月 22, 2021
  
  d03b0b16
- F
  import sequence_* API to new namespace (#32089) · f12c943a
  由 Feiyu Chan 提交于 4月 22, 2021
```
* import sequence_* API to new namespace

* fix typos, remove alias marking

* update sample code

* fix sample code

* fix docstring for sequence_mask
```
  f12c943a
- Z
  
  fix type(x)=paddle.VarBase to paddle.Tensor (#32364) · bec4b167
  由 zhiboniu 提交于 4月 22, 2021
  
  bec4b167
- S
  [HybridParallel] Add ClipGradByGlobalNorm & check_finite_and_unscale in Dygraph (#32354) · 7ea999fd
  由 ShenLiang 提交于 4月 22, 2021
```
* add clip/check

* add amp & clip grad in dygraph

* add logging
```
  7ea999fd
- F
  add glu in nn.functional (#32096) · b2ee8380
  由 Feiyu Chan 提交于 4月 22, 2021
```
add glu in nn.functional
```
  b2ee8380
- W
  support save/load binary format tensor. (#32211) · f4d9adc7
  由 WeiXin 提交于 4月 22, 2021
```
* support save/load binary format tensor

* Fix error when create cudaplace

* Fix error when create cudaplace

* Fix error when create cudaplace

* get devive context from pool.

* move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'.

* improve coverage.

* improve coverage.

* polish API

* deal with conflict

* disable save/load large file in unnittest

* split unnittest.
```
  f4d9adc7
- T
  
  Delete WITH_GRPC flag and Distributed old code (#32383) · e58c705b
  由 tianshuo78520a 提交于 4月 22, 2021
  
  e58c705b
21 4月, 2021 4 次提交

C
[HotFix] Add support for optimizer with varbase input (#32362) · b47dd158
由 Chen Weihang 提交于 4月 21, 2021
```
* add support for optimizer with varbase input

* refine cond

* fix failed unittest

* add test for coverage
```
b47dd158

【NPU】Merge NPU ccl code (#32381) · c3158527

由 zhang wenhui 提交于 4月 21, 2021

* add allreduce and broadcast without test (#31024)

add allreduce and broadcast without test

* Refactor HCCLCommContext to be compatible with Paddle (#31359)

Refactor HCCLCommContext to be compatible with Paddle (#31359)

* [NPU] add npu kernel for communication op (#31437)

* add allreduce and broadcast without test

* add c_broadcast_test case

* build c_comm_init and c_create_group operators

* make the whole thing compile

* add broadcast and init op test case but run failed

* make unit test compile

* fix broadcast test bug and change into hcom for ccl

* change c_comm_init and c_create_group ops accordingly

* make tests compile

* transfer code to 27

* compiled successfully in 28, but run failed

* test broadcast in 28, but failed

* make hcom primitives work

* change hccl data type for base.h

* fix broadcast bug

* make attributes work

* fix group name bug

* add allreduce but test failed

* allreduce bug for qiuliang

* allreduce finished

* add allgather and reducescatter

* merge all op code

* add allgather test

* finish run all ccl op test exclude send/recv

* all all op and test exclude send/recv

* send_v2_npu.cc recv_v2_npiu.cc compiled

* fix ccl core dump bug and test allgather, reducescatter, broadcast op

* fix allreduce bug just for test

* hcom send&recv test pass, without hcom_destroy

* for qiuliang test

* Ascend Send&Recv Test Pass

* all op (ex send/recv) ok

* fix bug

* merge all ccl op

* style merge to PaddlePaddle

* merge style

* new merge style

* merge style 2

* insert an empty at the end

* disable ctest for hcom to pass ci
Co-authored-by: Nvoid-main <voidmain1313113@gmail.com>
Co-authored-by: Nf2hkop <f2huestc@outlook.com>

* Add auto-increasing tag id for Hcom OPs (#31702)

* add c_reduce_sum op (#31793)

add c_reduce_sum op

* update Ascendrc hccl to 20.3 (#32126)

update Ascendrc hccl to 20.3 (#32126)

* fix merge code

* change cmake.txt1

* [NPU] Support npu kernel for c sync stream op (#31386)

* sync stream npu op

* add with_ascend_acl

* update c++ unittest

* compile all failed

* try to pre commit

* after pre commit

* merge&compile&test hccl successfully!

* fix code style

* fix code style

* fix bugs about hccl

* fix some bugs

* fix code style

* fix style

* fix style

* fix

* fixed

* merge develop
Co-authored-by: Nlw921014 <liuwei921014@yeah.net>
Co-authored-by: NVoid Main <voidmain1313113@gmail.com>
Co-authored-by: Nf2hkop <f2huestc@outlook.com>
Co-authored-by: Nxiayanming <41795079@qq.com>

c3158527

H

fix bug in amp O2 (#32343) · 4be3b057
由 huangxu96 提交于 4月 21, 2021

4be3b057
A

[CustomOp]Fix MAC3-CI random failed with XXX_setup.py(#32369) · 7bae5e9a
由 Aurelius84 提交于 4月 21, 2021

7bae5e9a

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致