提交 · f16981b1114f1abea94688a7861a44acb59ba717 · 机器未来 / Paddle

25 4月, 2021 11 次提交
- L
  
  [slice] Support index is Tensor for slice in dynamic mode (#32435) · aceec7fb
  由 liym27 提交于 4月 25, 2021
  
  aceec7fb
- L
  
  [Setitem] Support grad computation of op set_value (#32431) · 25e723e7
  由 liym27 提交于 4月 25, 2021
  
  25e723e7
- W
  paddle.save/load support nested structure and layer (#32446) · 727b28d7
  由 WeiXin 提交于 4月 25, 2021
```
* support save/load binary format tensor

* Fix error when create cudaplace

* Fix error when create cudaplace

* Fix error when create cudaplace

* get devive context from pool.

* move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'.

* support complex object

* improve coverage.

* improve coverage

* improve coverage.

* fix a bug.

* polish API

* save/load program

* paddle.save/load: layer

* deal with conflict

* if PY2, block test_paddle_save_load.TestSaveLoadLayer

* polish code.

* polish code

* edit unnittest

* The condition for object to be identified as state_dict becomes strict

* use 'core._cuda_synchronize'
```
  727b28d7
- M
  
  add silu op, test=develop (#32384) · 2f351ed5
  由 minghaoBD 提交于 4月 25, 2021
  
  2f351ed5
- S
  [HybridParallel] Add pipeline layer in dygraph (#32449) · 7ef1de67
  由 ShenLiang 提交于 4月 25, 2021
```
* add pipeline layer
```
  7ef1de67
- L
  Fix the bug in mp (#31996) · 976fe6f9
  由 lilong12 提交于 4月 25, 2021
```
* update
```
  976fe6f9
- W
  [BUG FIX] when x.dim < y.dim, the result of compare_op is inverse (#32470) · 78eff521
  由 wawltor 提交于 4月 25, 2021
```
* fix bug: when x.dim < y.dim, the result of compare_op is inverse to expected result

* support the cuda for fix the compare broadcast bug
```
  78eff521
- S
  fix tc trt shape (#32458) · f272e59a
  由 Shang Zhizhou 提交于 4月 25, 2021
```
* fix tc trt shape

* fix fc dynamic shape

* add fc shape assert

* update
```
  f272e59a
- P
  let paddle.utils.install_check support CPU package with GPU device (#32428) · 06276f46
  由 pangyoki 提交于 4月 25, 2021
```
* let paddle.utils.install_check support CPU package with GPU device

* use use_cuda in dygraph checking

* add unittest for install_check
```
  06276f46
- L
  
  fix tensor to_string when shape contains zero (#32501) · 3b61d066
  由 Leo Chen 提交于 4月 25, 2021
  
  3b61d066
- W
  
  use 'paddle.framework.set_grad_enabled' in pylayer (#32355) · 83580ee6
  由 WeiXin 提交于 4月 25, 2021
  
  83580ee6
24 4月, 2021 2 次提交
- H
  Fix test_yolov3 Random Failure (#32496) · 9bf90922
  由 Huihuang Zheng 提交于 4月 24, 2021
```
Reduce max iter size to fix windows openblas test_yolov3 random failure.
Decrease batch size to fix pe related unittest random failure.
```
  9bf90922
- Z
  
  add tensor.tolist() support (#32366) · 8beb1707
  由 zhiboniu 提交于 4月 24, 2021
  
  8beb1707
23 4月, 2021 6 次提交

L
add the c_identity op (#32485) · 8fa8a37f
由 lilong12 提交于 4月 23, 2021
```
* add c_identity op, test=develop
```
8fa8a37f

[NPU] refactor check_finite_and_scale npu kernel (#32407) · 39a59dcf

由 Leo Chen 提交于 4月 23, 2021

* refactor_check_finite_and_scale_npu_kernel

* fix compile

* add alloc_float_status op

* add alloc_float_status op

* add FloatStatus for check_finite_and_unscale

* refine code

* remove unneccessary logic

* refine for fleet

39a59dcf

L
add c_concat and c_split ops (#32486) · 2b108a04
由 lilong12 提交于 4月 23, 2021
```
* add c_concat op
```
2b108a04
S

add lstm support on xpu test=kunlun (#32436) · b6f8ccd2
由 shanliang1992 提交于 4月 23, 2021

b6f8ccd2
S

disable utest (#32474) · 1dc83932
由 ShenLiang 提交于 4月 23, 2021

1dc83932

Fix seven error message (#32397) · 203ac4f3

由 Kqnonrime 提交于 4月 23, 2021

* fix two error message

* fix two error message

* fix error

* fix error

* fix error

* fix error

* fix some error message

* fix some error

* fix error

* fix some error

* fix some error

* fix some error

* fix one error

* fix some error

* fix seven error message

* fix error

* fix error

* fix error

* fix error

203ac4f3

22 4月, 2021 7 次提交
- Y
  
  Add `paddle.set_grad_enabled` (#31794) · f8ca5a9d
  由 Yang Zhang 提交于 4月 22, 2021
  
  f8ca5a9d
- W
  support int32 and int64 kernel for clip operator (#32373) · c3328288
  由 wuyefeilin 提交于 4月 22, 2021
```
support int32 and int64 kernel for clip operator 
```
  c3328288
- Z
  
  fix type(x)=paddle.VarBase to paddle.Tensor (#32364) · bec4b167
  由 zhiboniu 提交于 4月 22, 2021
  
  bec4b167
- S
  [HybridParallel] Add ClipGradByGlobalNorm & check_finite_and_unscale in Dygraph (#32354) · 7ea999fd
  由 ShenLiang 提交于 4月 22, 2021
```
* add clip/check

* add amp & clip grad in dygraph

* add logging
```
  7ea999fd
- F
  add glu in nn.functional (#32096) · b2ee8380
  由 Feiyu Chan 提交于 4月 22, 2021
```
add glu in nn.functional
```
  b2ee8380
- W
  support save/load binary format tensor. (#32211) · f4d9adc7
  由 WeiXin 提交于 4月 22, 2021
```
* support save/load binary format tensor

* Fix error when create cudaplace

* Fix error when create cudaplace

* Fix error when create cudaplace

* get devive context from pool.

* move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'.

* improve coverage.

* improve coverage.

* polish API

* deal with conflict

* disable save/load large file in unnittest

* split unnittest.
```
  f4d9adc7
- T
  
  Delete WITH_GRPC flag and Distributed old code (#32383) · e58c705b
  由 tianshuo78520a 提交于 4月 22, 2021
  
  e58c705b
21 4月, 2021 5 次提交
- C
  [HotFix] Add support for optimizer with varbase input (#32362) · b47dd158
  由 Chen Weihang 提交于 4月 21, 2021
```
* add support for optimizer with varbase input

* refine cond

* fix failed unittest

* add test for coverage
```
  b47dd158
- Y
  
  add get_loss_scaling to fleet (#32401) · 37bb3342
  由 Yuang Liu 提交于 4月 21, 2021
  
  37bb3342
- L
  
  [Kunlun]add collective ops for multi XPU cards training and add Kunlun multi XPU cards CI (#32302) · 2194ad15
  由 liuyuhui 提交于 4月 21, 2021
  
  2194ad15
- J
  
  Added bilinear and nearest interp v2 oneDNN FP32 kernels (#32312) · 5d19f8d8
  由 jakpiase 提交于 4月 21, 2021
  
  5d19f8d8
- J
  
  Added oneDNN reduce_op GRAD kernel (#32280) · ead83422
  由 jakpiase 提交于 4月 21, 2021
  
  ead83422
20 4月, 2021 3 次提交
- F
  add paddle.nn.unfold #32297 (#32298) · 186682fe
  由 FNRE 提交于 4月 20, 2021
```
* add paddle.nn.unfold
* update Parameters of Unfold
```
  186682fe
- W
  
  save/load program (#32336) · e0a52fd7
  由 WeiXin 提交于 4月 20, 2021
  
  e0a52fd7
- W
  
  support `numpy.array/asarray(tensor) -> ndarray`, test=develop (#32300) · 43926c80
  由 Wenyu 提交于 4月 20, 2021
  
  43926c80
19 4月, 2021 4 次提交

[NPU] cherry-pick gc/dataloader/save&load/optimization from ascendrc to develop (#32294) · cbe5c9f8

由 Leo Chen 提交于 4月 19, 2021

* [NPU] support GarbageCollector for npu (#31874)

* support GarbageCollector for npu

* fix typo

* fix gather_grad

* disable NPUDefaultStreamGarbageCollector on NPU

* [NPU] support npu for memcpy op (#31808)

* support npu for memcpy op

* add ut

* fix ut

* fix typo

* 【NPU】fix bug of using temp vector (#31963)

* fix bug when beta1_pow on cpu (#31995)

* [NPU] support npu profiler (#31684)

* support npu profiler

* add python api

* fix bugs

* add wrapper for incomplete type

* update profile proto

* record npu wait

* add xpu placeholder

* fix adam (#32016)

* [NPU] enable async copy and  add wait before sync operation (#31956)

* enable async copy and  add wait before sync operation

* remove unneccessary wait

* add FillNpuTensorWithConstant

* refine

* fix fill_constant

* make TensorFromVector/TensorToVector sync

* [NPU] Support dataloader on npu place. (#31867)

* [NPU] Wait on NPUPlace (#32086)

* [NPU] fix cast op (#32121)

* fix npu kernel of cast op to handle casting to same dtype

* add comments

* [NPU] support cann 20.3 (#32044)

* fix compile problem on cann 20.3

* fix ut

* fix test_mul

* fix check_finite_and_scale

* fix lookup_table_v2_grad

* fix cmake

* support print op

* [NPU] Support npu save load (#31893)

* support save load for NPU

* add save load npu unittest

* support np.array transform in NPU

* fix errors

* delete dygraph in unittest

* add Wait

* fix unittest

* fix review comment

* fix unittest problem

* fix little problem

* change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performance (#32196)

* change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performace

* refine code

* fix NPUDeviceContext in all c++ unittest (#32198)

* fix NPUDeviceContext in all c++ unittest

* refine log
Co-authored-by: Npangyoki <pangyoki@126.com>

* [NPU] Remove TensorFromVector and avoid sync copy in npu op kernel for better performance (#31994)

* enable async copy and  add wait before sync operation

* remove unneccessary wait

* add FillNpuTensorWithConstant

* refine

* fix fill_constant

* change TensorFromVector to FillNpuTensorWithConstant

* fix ignored api

* delete extra unittest

* fix little error

* fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu

* change TensorCopySync to TensorCopy

* delete useless Wait and add StreamWait

* fix npu_stream error

* fix check_finite_and_unscale_op_npu TensorCopy

* only save stream wait

* fix NPUDeviceContext in all c++ unittest

* delete wait
Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>

* delete useless unittest file (#32206)

* Fix op test (#32231)

* fix conditional block (#32243)

* fix adam bug again (#32246)

* fix compile

* fix ut

* fix ut
Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
Co-authored-by: Npangyoki <pangyoki@126.com>

cbe5c9f8

S
[Hybrid Parallel] Support dp & mp in dygraph (#32323) · ffd40860
由 ShenLiang 提交于 4月 19, 2021
```
* support dp & mp
```
ffd40860

Fix sublayer (#31824) · 4d69eeaa

由 Jiabin Yang 提交于 4月 19, 2021

* fix sublayer error with include_sublayers=False

* add ut

* refactor include_sublayers related api

* fix ut

* fix ut of transformer

* fix ut of transformer

* remove useless code

* change sublayer api

* polish code

* add test for include_self=True

4d69eeaa

J

Add BF16 Constant Initializer and support for other initializer (#31935) · 76cb83e8
由 joanna.wozna.intel 提交于 4月 19, 2021

76cb83e8

17 4月, 2021 1 次提交
- S
  [Hybrid Parallel] Add model parallel support in dygraph (#32248) · 66d46221
  由 ShenLiang 提交于 4月 17, 2021
```
* add model parallel support in dygraph
```
  66d46221
15 4月, 2021 1 次提交
- 1
  tree-based-model (#31696) · a8c3a902
  由 123malin 提交于 4月 15, 2021
```
* add index_dataset and index_sampler for tree-based model
```
  a8c3a902

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致