提交 · f59bcb1c781038b871154118f31658c0fff8b16a · PaddlePaddle / Paddle

01 6月, 2022 11 次提交

J
[AutoParallel & Science] Miscellaneous improvements (#43139) · f59bcb1c
由 JZ-LIANG 提交于 6月 01, 2022
```
* adapt for 10 loss

* partitioner support optimizer
```
f59bcb1c
B

Update creation.py (#42915) · ff1789ca
由 BrilliantYuKaimin 提交于 6月 01, 2022

ff1789ca

update xpu cmake: xdnn 0601 (#43051) · 2aea0db8

由 houj04 提交于 6月 01, 2022

* update xpu cmake: xdnn 0527. test=kunlun

* update to xdnn 0531.

* update to xdnn 0531. test=kunlun

* update to xdnn 0601. test=kunlun

2aea0db8

Z
Unittest parallel (#43042) · dc26d07b
由 zhangchunle 提交于 6月 01, 2022
```
unittest parallel
Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
```
dc26d07b
R
Add pinned memory to host memory stats (#43096) · c4b7c485
由 Ruibiao Chen 提交于 6月 01, 2022
```
* Add pinned memory to HostMemoryStats

* Add macro for WrapStatAllocator

* Fix CI errors
```
c4b7c485
Z

fluid code transfer in nn.functional (#42808) · 0e10f247
由 zhiboniu 提交于 6月 01, 2022

0e10f247

fix the bug of adamw which set the attribute in param group not working (#43013) · 77bae9a4

由 Guoxia Wang 提交于 6月 01, 2022

* fix the bug of adamw which set the attribute in param group not working

* fix undefined variable

* fix api example typo

* add unittest

* fix unittest typo

77bae9a4

H

[revert] revert inference accelarate #43125 · 81622708
由 huzhiqiang 提交于 6月 01, 2022

81622708
C

add some comp op costs (#43114) · bd018360
由 caozhou 提交于 6月 01, 2022

bd018360

[Auto Parallel] Add miscellaneous improvements (#43108) · 010aba33

由 Yulong Ao 提交于 6月 01, 2022

* [Auto Parallel] Add the parallel tuner

* [Auto Parallel] Improve the parallel tuner and fix some bugs

* upodate cost model

* update import Resharder by dist op

* update cost model

* fix comp cost bug

* update cost model

* [Auto Parallel] Amend the dist attr for #processses=1

* update cost model and tuner

* update cost model and tuner

* update cost model and tuner

* update cluster

* update reshard

* [Auto Parallel] Add the estimation from the cost model

* [Auto Parallel] Reimplement the backup and restore functions

* [Auto Parallel] Fix the bugs of the parallel tuner

* [Auto Parallel] Update the engine api and dist context

* [Auto Parallel] Work around the high order grad problem

* [Auto Parallel] Add some miscellaneous improvements

* [Auto Parallel] Add a unittest for DistributedContext
Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>

010aba33

[Yaml]add conv3d, depthwise_conv2d yaml (#42807) · 5f2c251c

由 chentianyu03 提交于 6月 01, 2022

* add conv3d yaml

* add conv3d_grad, conv3d_double_grad

* add final_state_conv3d test case

* add conv3d double test case

* add depthwise_conv2d grad yaml

* add depthwise_conv2d double grad test case

* modify the order of args

* add depthwise_conv2d_grad_grad config

5f2c251c

31 5月, 2022 22 次提交
- S
  Remove mkldnn attributes from base ops (#42852) · 4b89120b
  由 Sławomir Siwek 提交于 5月 31, 2022
```
* remove attrs from base op

* fix typos

* remove brelu

* undo removing code related to matmul

* remove whitespaces

* undo changes in matmul

* remove empty line
```
  4b89120b
- P
  add double_grad and triple_grad inplace info in backward.yaml (#43124) · 94194275
  由 pangyoki 提交于 5月 31, 2022
```
* add double_grad and triple_grad inplace info in backward.yaml

* only generate inplace api in forward
```
  94194275
- W
  [Eager] Fix Full Zero (#43048) · 462ae005
  由 wanghuancoder 提交于 5月 31, 2022
```
* fix full zero

* fix full zero

* fix full zero

* fix full zero

* refine

* refine

* refine
```
  462ae005
- S
  
  put set error_code infront to avoid being skipped (#43014) · d70e45bc
  由 Sing_chan 提交于 5月 31, 2022
  
  d70e45bc
- C
  [Phi] Polish assign kernel copy impl (#43061) · c9e7c407
  由 Chen Weihang 提交于 5月 31, 2022
```
* fix assign kernel copy impl

* fix test failed
```
  c9e7c407
- B
  
  test=document_fix Verified (#42919) · 172739d4
  由 BrilliantYuKaimin 提交于 5月 31, 2022
  
  172739d4
- C
  
  [MLU] add mlu kernel for abs op (#43099) · cb195fa0
  由 cambriconhsq 提交于 5月 31, 2022
  
  cb195fa0
- Y
  [IPU] support paddle.distributed.launch with IPUs (#43087) · e680d581
  由 yaozhixin 提交于 5月 31, 2022
```
* [IPU] support paddle.distributed.launch with IPUs

* add device_num to env_args_mapping
```
  e680d581
- D
  update RandomCrop class code annotation; test=document_fix (#42428) · 48409529
  由 David Nicolas 提交于 5月 31, 2022
```
* update RandomCrop class code annotation; test=document_fix

* update adjust_brightness api in functional.py test=document_fix

* udpate uniform api in random.py

* update transforms.py
```
  48409529
- B
  
  test=document_fix (#42922) · 632027d7
  由 BrilliantYuKaimin 提交于 5月 31, 2022
  
  632027d7
- C
  [Eager] Polish append op using for model perf (#43102) · e9589e35
  由 Chen Weihang 提交于 5月 31, 2022
```
* polish append op using

* fix var error

* fix group norm impl
```
  e9589e35
- A
  [NPU] fix arg_max and reduce_max (#42887) · f9e55dee
  由 Aganlengzi 提交于 5月 31, 2022
```
* fix arg_max and reduce_max

* add arg_max ut
```
  f9e55dee
- T
  【PaddlePaddle Hackathon 2】16 新增 API RRelu (#41823) · 21e1d10f
  由 thunder95 提交于 5月 31, 2022
```
* rrelu逻辑部分

* unregistered op kernel (unresolved)

* commit before merge

* 丰富测试用例

* 修复rrelu-sig的bug

* 修复cpu环境测试

* 修改拼写错误

* 修改code format

* 尝试优化测试用例timeout的问题

* 优化测试用例

* 移除seed, 优化随机函数

* update en doc for rrelu

* fix rrelu en docs, test=document_fix

* add paper link for en docs, test=document_fix

* udpate en doc

* add r,test=document_fix
```
  21e1d10f
- H
  
  fix bugs (#43115) · 6319dd83
  由 Haohongxiang 提交于 5月 31, 2022
  
  6319dd83
- X
  [EinsumOp] Make EinsumOp support bfloat16. (#43085) · a4bb38cb
  由 xiongkun 提交于 5月 31, 2022
```
* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0

* make EInsumOP support bf16

* add unittest for BF16

* add condition for test_BF16

* fix bugs

* fix
```
  a4bb38cb
- L
  Fix the underflow of fp16 fake quantize operators (#43088) · 0ae8a2d6
  由 Leo Chen 提交于 5月 31, 2022
```
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
```
  0ae8a2d6
- J
  Support backward prune for eager intermidiate (#43111) · 4700a08e
  由 Jiabin Yang 提交于 5月 31, 2022
```
* support is empty

* fix error

* fix code error

* change to fake empty

* using fake empty first

* using fake empty first

* Support backward prune in fluid
```
  4700a08e
- L
  Rename dropout is test (#43098) · 67497119
  由 Li Min 提交于 5月 31, 2022
```
* replace dropout_is_test with is_test.
* improve atol on a100.
```
  67497119
- W
  [Eager] fix collective_global_gather (#43090) · ae45d981
  由 Weilong Wu 提交于 5月 31, 2022
```
* [Eager] fix collective_global_gather

* fix eager_ode = 1
```
  ae45d981
- Z
  add embedding yaml (#43029) · 2785f876
  由 zyfncg 提交于 5月 31, 2022
```
* add embedding yaml

* fix infermeta bug

* fix bug of selected_rows infer_meta

* fix selected_rows

* add unittest
```
  2785f876
- W
  
  fix slice plugin (#43110) · b779d2b8
  由 Wilber 提交于 5月 31, 2022
  
  b779d2b8
- J
  OneDNN md-in-tensor refactoring part 5: Memory descriptor enabled for... · 12d8a567
  由 jakpiase 提交于 5月 30, 2022
```
OneDNN md-in-tensor refactoring part 5: Memory descriptor enabled for elementwises, reductions and expand_v2 ops (#43036)

* enabled md in elementwises, reductions and expand_v2

* CI fix for invalid numpy copy

* fixed formatting

* CI rerun

* changes after review
```
  12d8a567
30 5月, 2022 7 次提交
- C
  
  [mlu] add one_hot_v2 mlu kernel (#43025) · 13a21cf7
  由 Chenxiao Niu 提交于 5月 30, 2022
  
  13a21cf7
- L
  Add fused_bias_dropout_residual_ln op and layer. (#43062) · dceccd9d
  由 Li Min 提交于 5月 30, 2022
```
* add fused_bias_dropout_residual_ln op and layer.
```
  dceccd9d
- H
  
  fix scale_matmul fuse pass (#43089) · e1e0deed
  由 heliqi 提交于 5月 30, 2022
  
  e1e0deed
- S
  [TensorRT] Fix delete fill_constant pass (#43053) · 1448520d
  由 shentanyue 提交于 5月 30, 2022
```
* update lite compile cmake

* Update delete_fill_constant_op_pass.cc

* Update analysis_config.cc
```
  1448520d
- P
  support backward inplace in eager fluid dygraph mode (#43054) · ed2886de
  由 pangyoki 提交于 5月 30, 2022
```
* support backward inplace in eager fluid mode

* fix

* fix

* optimize format

* little change
```
  ed2886de
- P
  
  add backward inplace api (#42965) · 3d56d419
  由 pangyoki 提交于 5月 30, 2022
  
  3d56d419
- C
  
  Implement fused_gate_attention operator for AlphaFold. (#42018) · fdcdbec5
  由 crystal 提交于 5月 30, 2022
  
  fdcdbec5

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功