提交 · e29c2d12cd8fb67354f052d0d81b8c9f20699e35 · PaddlePaddle / Paddle

16 8月, 2021 10 次提交
- L
  [amp] dygraph amp support param_group (#34899) · e29c2d12
  由 Leo Chen 提交于 8月 16, 2021
```
* dygraph amp support param_group

* remove unused code

* fix doc
```
  e29c2d12
- G
  support margin loss (arcface, cosface, sphereface) for single GPU and cross GPUs (#34247) · b0cb4148
  由 Guoxia Wang 提交于 8月 16, 2021
```
* support margin loss (arcface, cosface, sphereface)
```
  b0cb4148
- Z
  
  Enhance tensor shape check for dist op. (#34915) · dc439a12
  由 Zhong Hui 提交于 8月 16, 2021
  
  dc439a12
- Z
  Support npu op hard_swish and hard_swish_grad (#34608) · fd92d949
  由 zyfncg 提交于 8月 16, 2021
```
* Support NPU OP hard_swish and hard_swish_grad

* Support NPU OP hard_swish and hard_swish_grad

* add the unittest to compare the result between npu ans cpu

* format the prompt of exception

* replace Min and Max op by ClipByValue op

* fix the precision problem for fp16

* Using HardtanhGrad to improve performace
```
  fd92d949
- S
  [dev] fix dice_loss bug (#34757) · ad6c3b92
  由 shangliang Xu 提交于 8月 16, 2021
```
* fix dice_loss bug
```
  ad6c3b92
- Z
  
  Add bcast semantics checks at C++ level to BroadcastTensorsOp (#34874) · e84b2e9b
  由 Zhanlue Yang 提交于 8月 16, 2021
  
  e84b2e9b
- L
  
  [NPU] remove npu int64 kernel for increment op (#34909) · 28279f6f
  由 Leo Chen 提交于 8月 16, 2021
  
  28279f6f
- T
  
  Check whl size (#34767) · 34d188bf
  由 tianshuo78520a 提交于 8月 16, 2021
  
  34d188bf
- T
  Op-benchmark CI cpu and gpu (#34631) · 8fb17fc7
  由 tianshuo78520a 提交于 8月 16, 2021
```
* notest;pm-op-benchmark

* notest;pm-op-benchmark

* notest;pm-op-benchmark

* notest;pm-op-benchmark

* notest;pm-op-benchmark

* notest;pm-op-benchmark

* notest;test=op_benchmark

* notest;test=op_benchmark

* notest;op_benchmark

* notest;op_benchmark

* notest;op_benchmark

* notest;op_benchmark

* notest;op_benchmark

* notest;op_benchmark

* notest;test=op_benchmark

* notest;op_benchmark

* notest;op_benchmark

* notest;op_benchmark

* fix

* fix
```
  8fb17fc7
- R
  [NPU] add p_norm_op_npu (#34695) · 7316018d
  由 ronnywang 提交于 8月 15, 2021
```
* add p_norm_op_npu

* remove p_norm_grad op

* update
```
  7316018d
14 8月, 2021 1 次提交
- W
  
  [hybrid] refine pipeline stage and mp send/recv check (#34870) · 2cd05d5d
  由 WangXi 提交于 8月 14, 2021
  
  2cd05d5d
13 8月, 2021 11 次提交

由 Tongxin Bai 提交于 8月 13, 2021

* OP dot: refactor CPU kernels and get better loop performance.

* Minor fix on code format.

* Fixed minor errors.

* Add new API: einsum

* Update the Einsum unit test.

One case failed with matmul_v2, where the dtype is int64:

a = np.arange(2 * 3 * 1).reshape(2, 3, 1)
b = np.arange(1)
paddle.einsum("...i, ...i", a, b)

* Test cases in test_einsum test floating point dtypes only.

As of now Paddle only supports float/double dtypes in matmul, which is
one of building blocks of this Einsum implementation. We decide not to
test einsum against other dtypes.

* Polish format.

* More formatting.

* Format...

* Einsum: improve test coverage.

* Einsum: bug fixes and more testcases for testing error messages

* Einsum: fix format..

* Einsum: fixed typo and format.

* Einsum: format again...

* Einsum: applied suggested changes.

* Einsum API: improve API documentation.

* Einsum API: apply suggested changes.

* Einsum API: Add dygraph only note.

* Einsum API: Add dygraph only note.

* Einsum API: fixed unittest.

8c8667f0

Z

fix a bug of slice by none index (#34877) · ff4bdac3
由 zyfncg 提交于 8月 13, 2021

ff4bdac3

Bug fix : Can't load multiple modules of custom c++ op (#34505) · fc6b4a50

由 zyfncg 提交于 8月 13, 2021

* Fix a bug : can't load more than one custom op module

* Fix a bug : can't load more than one custom op module

* add test for load multiple modules of custom c++ op

* add config for Coverage CI

fc6b4a50

Z

fix generator thread safety bug (#34888) · f421741c
由 Zeng Jinle 提交于 8月 13, 2021

f421741c
H
Add EmptyGradOpMaker CI Approval (#34810) · ac56d54e
由 Hao Lin 提交于 8月 13, 2021
```
* Add EmptyGradOpMaker CI Approval, test=develop

* Fix typo in echo_line
```
ac56d54e
Support sccache distributed storage on windows (#34879) · 8bc4d854
由 zhouweiwei2014 提交于 8月 13, 2021

8bc4d854
Q

[NPU] fix bce_loss_npu, test=develop (#34876) · 5b86b999
由 Qi Li 提交于 8月 13, 2021

5b86b999
R

fix npu_finalize (#34857) · 17a99760
由 ronnywang 提交于 8月 13, 2021

17a99760
S
[Bug-Fix]fix bug of py36 import utils (#34873) · 507ea06f
由 ShenLiang 提交于 8月 13, 2021
```
* fix bug of py36 import
```
507ea06f
B

add retry for gethostbyname (#34855) · e92f0388
由 Baibaifan 提交于 8月 13, 2021

e92f0388
A

[npu]add unsqueeze2_grad,test=develop (#34733) · 2164ad61
由 andyjpaddle 提交于 8月 13, 2021

2164ad61

12 8月, 2021 11 次提交
- Q
  
  [NPU] add meshgrid, test=develop (#34576) · 3f71e8d2
  由 Qi Li 提交于 8月 12, 2021
  
  3f71e8d2
- C
  Remove incorrect signal error stack trace (#34842) · 572adccd
  由 Chen Weihang 提交于 8月 12, 2021
```
* remove unmatched signal error stack

* fix error writing for cond
```
  572adccd
- C
  Revert "[oneDNN] Fix to issue #34554 (#34623)" (#34838) · dc62a227
  由 Chen Weihang 提交于 8月 12, 2021
```
This reverts commit 0a5c99e8.
```
  dc62a227
- fix set_grad_ivar bug of Tensor.backward (#34819) · dffb0b22
  由 zhouweiwei2014 提交于 8月 12, 2021
  
  dffb0b22
- W
  
  [Inference] Inference python api support fp16 (#34676) · 6326c3ef
  由 Wilber 提交于 8月 12, 2021
  
  6326c3ef
- F
  transformer c files (#34706) · 016cc56d
  由 Feng Xing 提交于 8月 12, 2021
```
This PR adds fused transformer related files defining c interface including class, function etc..
```
  016cc56d
- Z
  Fix safety-bug of functional.linear (#34696) · 0e28c8bb
  由 zhulei 提交于 8月 12, 2021
```
* Fix safety-bug of functional.linear

* Fix safety-bug of functional.linear

* Fix safety-bug of functional.linear

* Fix safety-bug of functional.linear
```
  0e28c8bb
- S
  [HybridParallel]Add Recompute for PipeLineParallel (#34607) · 589d13c5
  由 ShenLiang 提交于 8月 12, 2021
```
* add recompute for pp

* add recompute offload

* add recompute partition
```
  589d13c5
- W
  
  [NPU] Support npu kernel for smooth_l1_loss op (#34674) · cfa69133
  由 wuhuachaocoding 提交于 8月 12, 2021
  
  cfa69133
- F
  [NPU] Support npu op expand_v2 and expand_v2_grad (#34764) · bc543e35
  由 Fan Zhang 提交于 8月 12, 2021
```
* [NPU] Support npu op expand_v2 and expand_v2_grad

* [NPU] Support npu op expand_v2 and expand_v2_grad

* [NPU] Support npu op expand_v2 and expand_v2_grad

* update test_expand_v2_op_npu.py

* update test_expand_v2_op_npu.py

* modify expand_v2_op_npu.cc

* modify expand_v2_op_npu.cc
```
  bc543e35
- P
  add det_mv3_db & LeViT test case in pr-ci-inference (#34803) · 1c31d9d3
  由 Peihan 提交于 8月 12, 2021
```
* add det_mv3_db & LeViT test case in pr-ci-inference

* fix LeViT model dir bugs

* fix grammar error
```
  1c31d9d3
11 8月, 2021 7 次提交

[oneDNN] Fix to issue #34554 (#34623) · 0a5c99e8

由 Jacek Czaja 提交于 8月 11, 2021

* - Added softmax without caching

* - Binary is no longer manually cached

* - Activation onednn caching removed

* - Removed manual caching of activation

* - modified UT

* - fix

* - fix

* - fixes to building

* - fix

* - fix

* - fix to UT

* - Faulty UT workaround

* - approval workaround

* - Fixes after review

* - compilation fixes

* - more lint fixes

* - more fixes after review

* - fixes after another round of review

0a5c99e8

[AMP] add state_dict and load_state_dict and unittest for class GradScaler (#34300) · 99f8f5c8

由 zhangbo9674 提交于 8月 11, 2021

* add state_dict and load_state_dict and unittest for class GradScaler

* refine unittest for coverage of load_state_dict

* refine comments of code-block

* refine some comments

* refine state_dict code and unittest

* add #require gpu, xpu for GradScaler get/set example code

* add #require gpu, xpu for GradScaler get/set example code

* refine example code

* refine unittest for state_dict

* refine unittest for state_dict

* fix bug of DataLoader in TestGradScalerStateDict

* add flag FLAGS_cudnn_deterministic

99f8f5c8

`set_value_grad` propagate gradients to `Input` and `TensorValue` (#34304) · 9d02313c

由 WeiXin 提交于 8月 11, 2021

* add set_value_grad op

* add unittest.

* polish unittest.

* polish code.

* support cuda kernel

* polish code according to CI

* polish code.

* polish code

* remove *.pyc

* polish code.

* add unittest to improve coverage.

* polish code.

9d02313c

W
[Paddle TRT]fix_fc_int8_convert; fix_reshape_convert (#34787) · 3429c04b
由 Wangzheee 提交于 8月 11, 2021
```
* fix_fc_reshape_convert

* fix
```
3429c04b
F

[NPU] Support npu op flatten_contiguous_range_grad (#34798) · fc537d4f
由 Fan Zhang 提交于 8月 11, 2021

fc537d4f
P
[NPU] add while, read_from_array and write_to_array npu op (#34755) · 234c21ac
由 pangyoki 提交于 8月 11, 2021
```
* add while read_from_array write_to_array npu op

* optimize unittest
```
234c21ac
R

split_op for npu (#34699) · d45d3112
由 Roc 提交于 8月 11, 2021

d45d3112

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功