提交 · e69cc215ad0bafaeb47b577a709d793ca264f355 · PaddlePaddle / Paddle

31 8月, 2021 8 次提交
- Y
  [cherry-pick][hybrid performance] optim the grad fuse for pipeline mode by... · e69cc215
  由 Yuang Liu 提交于 8月 31, 2021
```
[cherry-pick][hybrid performance] optim the grad fuse for pipeline mode by sorting the grad by dtype (#35070) (#35300)
```
  e69cc215
- Y
  [cherry-pick][hybrid performance] Grad fuse for gradient merge under pipeline... · e931cd12
  由 Yuang Liu 提交于 8月 31, 2021
```
[cherry-pick][hybrid performance] Grad fuse for gradient merge under pipeline mode (#35004) (#35299)
```
  e931cd12
- R
  Add flags to control whether to check Nan value of hccl_allreduce_sum. (#35093) (#35298) · d4948bc1
  由 Roc 提交于 8月 31, 2021
```
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
```
  d4948bc1
- R
  [hybrid] Fix row parallel linear bias (#35186) (#35297) · b36fb036
  由 Roc 提交于 8月 31, 2021
```
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  b36fb036
- R
  [hybrid][npu] fix npu clear float status in pipeline (#35165) (#35295) · 167685e5
  由 Roc 提交于 8月 31, 2021
```
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  167685e5
- R
  [hybrid npu] fix npu found_finite in hybrid (#35134) (#35291) · e64105f6
  由 Roc 提交于 8月 31, 2021
```
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  e64105f6
- Y
  [cherry-pick][Hybrid Performance] Move the cast op of AMP which cast fp32... · 6fb58aef
  由 Yuang Liu 提交于 8月 31, 2021
```
[cherry-pick][Hybrid Performance] Move the cast op of AMP which cast fp32 param to fp16 param to the optimizer (#34965) (#35296)
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  6fb58aef
- Y
  [cherry-pick] NPU use squared_l2_norm in GradientClipByGlobalNorm (#34836) (#35289) · 38c27d55
  由 Yuang Liu 提交于 8月 31, 2021
```
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  38c27d55
18 8月, 2021 2 次提交
- G
  support class center sample of PartialFC (#34106) · 100db44f
  由 Guoxia Wang 提交于 8月 18, 2021
```
* support class center sample of PartialFC
```
  100db44f
- W
  [Paddle-TRT] unitest_quant_dequant (#34929) · c7070cb8
  由 Wangzheee 提交于 8月 18, 2021
```
* unitest_quant_dequant

* fix

* fix

* deleted: test_trt_quant_conv2d_dequant_fuse_pass.py

* fix
```
  c7070cb8
17 8月, 2021 8 次提交

R

[NPU]Adamw skip update for npu (#34897) · b4474fb4
由 Roc 提交于 8月 17, 2021

b4474fb4
A

[NPU] add where_index op and tests (#34951) · 1ef21855
由 Aganlengzi 提交于 8月 17, 2021

1ef21855
W
Modify the name of class in unittest with the same name (#34952) · 01a3a2e0
由 WeiXin 提交于 8月 17, 2021
```
* polish unittest.

* polish code

* polish code
```
01a3a2e0
S
[bug fix] fix unfold negative_size_param (#34943) · 8ef1bf87
由 shangliang Xu 提交于 8月 17, 2021
```
* [bug fix] fix unfold negative_size_param
```
8ef1bf87

Align CTC grad scale same with ESPNet (#34729) · 10f9644c

由 Hui Zhang 提交于 8月 16, 2021

* dygraph support more ctc grad scale

* scale for 1.x

* fix unitest

* fix unitest

* format code

* fix unittest

* fix log info

* unittest cov

* fix format;notest,test=cpu,coverage

* skip ctc_loss egs;test=cpu

* warpctc grad cov;test=coverage

* add dygraph test;test=coverage

* format;test=cpu,coverage

* format;test=cpu

* add api compat;test=cpu

* add cpu test

* rename

* rename

* fix

* fix test

* format

* eigen cpu

* eigen gpu grad pass

* cuda gpu pass

* format

* fix ci

10f9644c

Add some passes which can be applied to Program (#34730) · 8046e33d

由 Zeng Jinle 提交于 8月 17, 2021

* add inplace passes and tests

* update

* fix use_cuda undefined
fix compile error of op compat

* add more ut

* fix CPU CI error

* check adam unique

* fix mac/windows ci, improve coverage

* fix ci error

* follow weihang's comment

* fix BlockDesc::MoveFrom

* follow qiuliang's comment

* update

* follow huihuang's comments

8046e33d

Z

add api fill_diagonal_inplace (#34460) · 5de576b0
由 zhiboniu 提交于 8月 17, 2021

5de576b0
K
fix drop_last not work on IterableDataset (#34801) · 16146088
由 Kaipeng Deng 提交于 8月 17, 2021
```
* fix drop_last not work in IterableDataset. test=develop
```
16146088

16 8月, 2021 14 次提交

L
Fix typos in English docs for diag and diagflat. (#34869) · 35ef4180
由 Li Min 提交于 8月 16, 2021
```
* Fix typos in english docs for diag and diagflat.
```
35ef4180

[NPU] Support npu op:(1)arg_min (2)arg_max (#34867) · b1cc4a46

由 veyron95 提交于 8月 16, 2021

* [NPU] Support npu op:(1)arg_min (2)arg_max

* Modify and add unit test cases

* Modify unit test cases

b1cc4a46

[NPU] Add size npu op (#34636) · 49818943

由 0x45f 提交于 8月 16, 2021

* add size npu op

* modify support data type

* no longer use NPU size OP

* remove useless comments, add test case

* fix copyright, remove useless include

49818943

F

[CPU-PSLIB] Add config for scale_sparse_grad in config_fleet.py,test=develop (#34893) · d028214d
由 Fan Zhang 提交于 8月 16, 2021

d028214d
Z

fix iscan bug in test file (#34912) · f6d8ab54
由 zhangchunle 提交于 8月 16, 2021

f6d8ab54
Q

[NPU] add nearest_interp_v2 and nearest_interp_v2_grad, test=develop (#34769) · 3b9f040d
由 Qi Li 提交于 8月 16, 2021

3b9f040d

[NPU] Support NPU kernel for nearest_interp and nearest_interp_grad op (#34881) · e4e8cc9b

由 From00 提交于 8月 16, 2021

* Add NPU kernel for nearest_interp op

* Add grad op

* Modify codes according to the review comments

* Modify codes according to the review comments

e4e8cc9b

add unique_consecutive_op (#34334) · 875cfd57

由 duanboqiang 提交于 8月 16, 2021

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* remove unity build

* add unique_consecutive op

* add unique_consecutive op

* add enable static

* add noqa

* add space line

* add default case.

* add comma

* add space line

* modify unique_consecutive unittest

* optimize ut coverage

* rebase develop

* improve coverage

* update en docs

* update en docs

* update en docs

* update en docs

* update en docs

* update en doc

875cfd57

L
[amp] dygraph amp support param_group (#34899) · e29c2d12
由 Leo Chen 提交于 8月 16, 2021
```
* dygraph amp support param_group

* remove unused code

* fix doc
```
e29c2d12
G
support margin loss (arcface, cosface, sphereface) for single GPU and cross GPUs (#34247) · b0cb4148
由 Guoxia Wang 提交于 8月 16, 2021
```
* support margin loss (arcface, cosface, sphereface)
```
b0cb4148

Support npu op hard_swish and hard_swish_grad (#34608) · fd92d949

由 zyfncg 提交于 8月 16, 2021

* Support NPU OP hard_swish and hard_swish_grad

* Support NPU OP hard_swish and hard_swish_grad

* add the unittest to compare the result between npu ans cpu

* format the prompt of exception

* replace Min and Max op by ClipByValue op

* fix the precision problem for fp16

* Using HardtanhGrad to improve performace

fd92d949

S
[dev] fix dice_loss bug (#34757) · ad6c3b92
由 shangliang Xu 提交于 8月 16, 2021
```
* fix dice_loss bug
```
ad6c3b92
Z

Add bcast semantics checks at C++ level to BroadcastTensorsOp (#34874) · e84b2e9b
由 Zhanlue Yang 提交于 8月 16, 2021

e84b2e9b
R
[NPU] add p_norm_op_npu (#34695) · 7316018d
由 ronnywang 提交于 8月 15, 2021
```
* add p_norm_op_npu

* remove p_norm_grad op

* update
```
7316018d

14 8月, 2021 1 次提交
- W
  
  [hybrid] refine pipeline stage and mp send/recv check (#34870) · 2cd05d5d
  由 WangXi 提交于 8月 14, 2021
  
  2cd05d5d
13 8月, 2021 7 次提交

New Einsum API (#33821) · 8c8667f0

由 Tongxin Bai 提交于 8月 13, 2021

* OP dot: refactor CPU kernels and get better loop performance.

* Minor fix on code format.

* Fixed minor errors.

* Add new API: einsum

* Update the Einsum unit test.

One case failed with matmul_v2, where the dtype is int64:

a = np.arange(2 * 3 * 1).reshape(2, 3, 1)
b = np.arange(1)
paddle.einsum("...i, ...i", a, b)

* Test cases in test_einsum test floating point dtypes only.

As of now Paddle only supports float/double dtypes in matmul, which is
one of building blocks of this Einsum implementation. We decide not to
test einsum against other dtypes.

* Polish format.

* More formatting.

* Format...

* Einsum: improve test coverage.

* Einsum: bug fixes and more testcases for testing error messages

* Einsum: fix format..

* Einsum: fixed typo and format.

* Einsum: format again...

* Einsum: applied suggested changes.

* Einsum API: improve API documentation.

* Einsum API: apply suggested changes.

* Einsum API: Add dygraph only note.

* Einsum API: Add dygraph only note.

* Einsum API: fixed unittest.

8c8667f0

Z

fix a bug of slice by none index (#34877) · ff4bdac3
由 zyfncg 提交于 8月 13, 2021

ff4bdac3

Bug fix : Can't load multiple modules of custom c++ op (#34505) · fc6b4a50

由 zyfncg 提交于 8月 13, 2021

* Fix a bug : can't load more than one custom op module

* Fix a bug : can't load more than one custom op module

* add test for load multiple modules of custom c++ op

* add config for Coverage CI

fc6b4a50

Q

[NPU] fix bce_loss_npu, test=develop (#34876) · 5b86b999
由 Qi Li 提交于 8月 13, 2021

5b86b999
S
[Bug-Fix]fix bug of py36 import utils (#34873) · 507ea06f
由 ShenLiang 提交于 8月 13, 2021
```
* fix bug of py36 import
```
507ea06f
B

add retry for gethostbyname (#34855) · e92f0388
由 Baibaifan 提交于 8月 13, 2021

e92f0388
A

[npu]add unsqueeze2_grad,test=develop (#34733) · 2164ad61
由 andyjpaddle 提交于 8月 13, 2021

2164ad61

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功