提交 · dc439a129c3205023b973c754a7ccdee4a5f567f · 机器未来 / Paddle

16 8月, 2021 4 次提交
- Z
  Support npu op hard_swish and hard_swish_grad (#34608) · fd92d949
  由 zyfncg 提交于 8月 16, 2021
```
* Support NPU OP hard_swish and hard_swish_grad

* Support NPU OP hard_swish and hard_swish_grad

* add the unittest to compare the result between npu ans cpu

* format the prompt of exception

* replace Min and Max op by ClipByValue op

* fix the precision problem for fp16

* Using HardtanhGrad to improve performace
```
  fd92d949
- S
  [dev] fix dice_loss bug (#34757) · ad6c3b92
  由 shangliang Xu 提交于 8月 16, 2021
```
* fix dice_loss bug
```
  ad6c3b92
- Z
  
  Add bcast semantics checks at C++ level to BroadcastTensorsOp (#34874) · e84b2e9b
  由 Zhanlue Yang 提交于 8月 16, 2021
  
  e84b2e9b
- R
  [NPU] add p_norm_op_npu (#34695) · 7316018d
  由 ronnywang 提交于 8月 15, 2021
```
* add p_norm_op_npu

* remove p_norm_grad op

* update
```
  7316018d
14 8月, 2021 1 次提交
- W
  
  [hybrid] refine pipeline stage and mp send/recv check (#34870) · 2cd05d5d
  由 WangXi 提交于 8月 14, 2021
  
  2cd05d5d
13 8月, 2021 6 次提交

由 Tongxin Bai 提交于 8月 13, 2021

* OP dot: refactor CPU kernels and get better loop performance.

* Minor fix on code format.

* Fixed minor errors.

* Add new API: einsum

* Update the Einsum unit test.

One case failed with matmul_v2, where the dtype is int64:

a = np.arange(2 * 3 * 1).reshape(2, 3, 1)
b = np.arange(1)
paddle.einsum("...i, ...i", a, b)

* Test cases in test_einsum test floating point dtypes only.

As of now Paddle only supports float/double dtypes in matmul, which is
one of building blocks of this Einsum implementation. We decide not to
test einsum against other dtypes.

* Polish format.

* More formatting.

* Format...

* Einsum: improve test coverage.

* Einsum: bug fixes and more testcases for testing error messages

* Einsum: fix format..

* Einsum: fixed typo and format.

* Einsum: format again...

* Einsum: applied suggested changes.

* Einsum API: improve API documentation.

* Einsum API: apply suggested changes.

* Einsum API: Add dygraph only note.

* Einsum API: Add dygraph only note.

* Einsum API: fixed unittest.

8c8667f0

Z

fix a bug of slice by none index (#34877) · ff4bdac3
由 zyfncg 提交于 8月 13, 2021

ff4bdac3

Bug fix : Can't load multiple modules of custom c++ op (#34505) · fc6b4a50

由 zyfncg 提交于 8月 13, 2021

* Fix a bug : can't load more than one custom op module

* Fix a bug : can't load more than one custom op module

* add test for load multiple modules of custom c++ op

* add config for Coverage CI

fc6b4a50

Q

[NPU] fix bce_loss_npu, test=develop (#34876) · 5b86b999
由 Qi Li 提交于 8月 13, 2021

5b86b999
B

add retry for gethostbyname (#34855) · e92f0388
由 Baibaifan 提交于 8月 13, 2021

e92f0388
A

[npu]add unsqueeze2_grad,test=develop (#34733) · 2164ad61
由 andyjpaddle 提交于 8月 13, 2021

2164ad61

12 8月, 2021 6 次提交
- Q
  
  [NPU] add meshgrid, test=develop (#34576) · 3f71e8d2
  由 Qi Li 提交于 8月 12, 2021
  
  3f71e8d2
- fix set_grad_ivar bug of Tensor.backward (#34819) · dffb0b22
  由 zhouweiwei2014 提交于 8月 12, 2021
  
  dffb0b22
- Z
  Fix safety-bug of functional.linear (#34696) · 0e28c8bb
  由 zhulei 提交于 8月 12, 2021
```
* Fix safety-bug of functional.linear

* Fix safety-bug of functional.linear

* Fix safety-bug of functional.linear

* Fix safety-bug of functional.linear
```
  0e28c8bb
- S
  [HybridParallel]Add Recompute for PipeLineParallel (#34607) · 589d13c5
  由 ShenLiang 提交于 8月 12, 2021
```
* add recompute for pp

* add recompute offload

* add recompute partition
```
  589d13c5
- W
  
  [NPU] Support npu kernel for smooth_l1_loss op (#34674) · cfa69133
  由 wuhuachaocoding 提交于 8月 12, 2021
  
  cfa69133
- F
  [NPU] Support npu op expand_v2 and expand_v2_grad (#34764) · bc543e35
  由 Fan Zhang 提交于 8月 12, 2021
```
* [NPU] Support npu op expand_v2 and expand_v2_grad

* [NPU] Support npu op expand_v2 and expand_v2_grad

* [NPU] Support npu op expand_v2 and expand_v2_grad

* update test_expand_v2_op_npu.py

* update test_expand_v2_op_npu.py

* modify expand_v2_op_npu.cc

* modify expand_v2_op_npu.cc
```
  bc543e35
11 8月, 2021 14 次提交

[AMP] add state_dict and load_state_dict and unittest for class GradScaler (#34300) · 99f8f5c8

由 zhangbo9674 提交于 8月 11, 2021

* add state_dict and load_state_dict and unittest for class GradScaler

* refine unittest for coverage of load_state_dict

* refine comments of code-block

* refine some comments

* refine state_dict code and unittest

* add #require gpu, xpu for GradScaler get/set example code

* add #require gpu, xpu for GradScaler get/set example code

* refine example code

* refine unittest for state_dict

* refine unittest for state_dict

* fix bug of DataLoader in TestGradScalerStateDict

* add flag FLAGS_cudnn_deterministic

99f8f5c8

`set_value_grad` propagate gradients to `Input` and `TensorValue` (#34304) · 9d02313c

由 WeiXin 提交于 8月 11, 2021

* add set_value_grad op

* add unittest.

* polish unittest.

* polish code.

* support cuda kernel

* polish code according to CI

* polish code.

* polish code

* remove *.pyc

* polish code.

* add unittest to improve coverage.

* polish code.

9d02313c

F

[NPU] Support npu op flatten_contiguous_range_grad (#34798) · fc537d4f
由 Fan Zhang 提交于 8月 11, 2021

fc537d4f
P
[NPU] add while, read_from_array and write_to_array npu op (#34755) · 234c21ac
由 pangyoki 提交于 8月 11, 2021
```
* add while read_from_array write_to_array npu op

* optimize unittest
```
234c21ac
R

split_op for npu (#34699) · d45d3112
由 Roc 提交于 8月 11, 2021

d45d3112
R
[NPU] add momentum_op_npu and test (#34082) · 9e3e08f0
由 ronnywang 提交于 8月 11, 2021
```
* add momentum_op_npu and test

* update

* fix hang
```
9e3e08f0
R
[NPU] add reduce_mean_op_npu and test (#34053) · f6fab559
由 ronnywang 提交于 8月 11, 2021
```
* add reduce_mean_op_npu and test

* remove skip.If

* update
```
f6fab559
R
[NPU] add batch_norm_op_npu and test (#34056) · 9ed5db28
由 ronnywang 提交于 8月 11, 2021
```
* add batch_norm_op_npu and tests

* remove skip.If

* fix bug
```
9ed5db28
W

[hybrid] pp+dp support fp16 allreduce (#34762) · 4d7af372
由 WangXi 提交于 8月 11, 2021

4d7af372
L
add the basic apis for auto_parallel (#33804) · 3f962e77
由 lilong12 提交于 8月 11, 2021
```
* add auto_parallel apis
```
3f962e77
S
[HybridParallel] Support save/load for PipeLineParallel (#34768) · 88f2f4a4
由 ShenLiang 提交于 8月 11, 2021
```
* add save/load for pipelineparallel

* add save/load
```
88f2f4a4

[NPU] Add exp and exp_grad npu op (#34612) · b5ec65e1

由 0x45f 提交于 8月 11, 2021

* add exp and exp_grad npu op

* modify support register type

* remove empty line and remove exp_grad support data type int/int64

* move exp and epx_grad kernel to activation_op_npu.cc, delete attrs

* move code to activation_op_npu.cc

b5ec65e1

A

[NPU] add elementwise_min_grad_op_npu,test=develop (#34731) · 45af4f2a
由 andyjpaddle 提交于 8月 11, 2021

45af4f2a

[NPU] Support NPU kernel for TopKV2 op (#34599) · bb01b120

由 From00 提交于 8月 11, 2021

* Add NPU kernel for TopKV2 op

* deleted unnecessary cache file static_mode_white_list.cpython-37.pyc

* A draft for error checking

* A commit with accuracy error for float32 data

* Modify codes according to the review comments

* Modify codes according to the review comments

bb01b120

10 8月, 2021 8 次提交

[NPU] Support npu kernel for flatten_contiguous_range op, test=develop (#34642) · 79be8427

由 Liu-xiandong 提交于 8月 10, 2021

* fix npu compile error, test=develop

* [NPU] Support npu kernel for flatten_contiguous_range op, test=develop

* [NPU] Support npu kernel for flatten_contiguous_range op, test=develop

* [NPU] Support npu kernel for flatten_contiguous_range op, test=develop

* [NPU] Support npu kernel for flatten_contiguous_range op, test=develop

* [NPU] Support npu kernel for flatten_contiguous_range op, test=develop

* [NPU] Support npu kernel for flatten_contiguous_range op, test=develop

* [NPU] Support npu kernel for flatten_contiguous_range op, test=develop

* Update flatten_op_npu.cc

* Update flatten_op_npu.cc
Co-authored-by: Nqili93 <qili93@qq.com>

79be8427

A
[NPU] add squared_l2_norm squared_l2_norm_grad and tests (#34708) · b64312fc
由 Aganlengzi 提交于 8月 10, 2021
```
* [NPU] add squared_l2_norm squared_l2_norm and tests

* [NPU] replace Square&ReduceSumD with SquareSumV1
```
b64312fc

Support npu op fill_any_like (#34518) · e8df3226

由 zyfncg 提交于 8月 10, 2021

* Support npu kernel for fill_any_like op

* modify the description of exception

* remove useless template element

* remove useless decorator

* fix the code format error

e8df3226

[NPU] Support op kernel for Fill constant batch size like op (#34721) · ed2641cb

由 andyjpaddle 提交于 8月 10, 2021

* fix npu compile error, test=develop

* add fill constant batch size lilke op npu,test=develop
Co-authored-by: Nqili93 <qili93@qq.com>

ed2641cb

X

fix a quantization bug (#34647) · cfd49acc
由 XGZhang 提交于 8月 10, 2021

cfd49acc

Support npu kernel for tile op (#34606) · 8a6aa596

由 chenjian 提交于 8月 10, 2021

* Support npu kernel for tile op

* modify according to the comments

* fix compute function

8a6aa596

Support npu kernel for expand_as_v2 op (#34620) · 202c2402

由 chenjian 提交于 8月 10, 2021

* Support npu kernel for expand_as_v2 op

* mofify the registry data type name

* fix test unit

* fix npu compile error, test=develop

* fix compute function
Co-authored-by: Nqili93 <qili93@qq.com>

202c2402

L
Fix error of HSigmoidLoss (#34719) · 3f32b730
由 Linjie Chen 提交于 8月 10, 2021
```
* Fix error of HSigmoidLoss

* update unittest

* update unittest
```
3f32b730

09 8月, 2021 1 次提交
- Y
  
  [NPU] Support npu op flatten2_grad (#34669) · 7afd31bb
  由 YuanRisheng 提交于 8月 09, 2021
  
  7afd31bb

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致