提交 · cbda49e6fe4e1b982325211f1d9384216b189522 · PaddlePaddle / Paddle

16 9月, 2022 5 次提交

J

Correct order of passes (#45936) · cbda49e6
由 joanna.wozna.intel 提交于 9月 16, 2022

cbda49e6

support pow with scalar input, square, cast, var, size operators for deepxde (#46024) · 1711407d

由 Xiaoxu Chen 提交于 9月 16, 2022

* add reduce_mean,reduce_sum primitive ops
* add ne_p gt_p primitive operators
* add ge_p abs_p primitive oparators
* add cast primitive operators
* add pow,square prim2oirg rules
* add elementwise_div orig2prim rule

1711407d

Unify core avx and core_noavx to libpaddle (#46095) · 267d71a4

由 Chen Weihang 提交于 9月 16, 2022

* unify  core_avx and core_noavx

* fix except error

* revert mac compile logic

* revert dylib to so

* add core_noavx branch

* remove core_noavx

* replace paddle_core by lib paddle

* polish var name

* replace paddle_core by libpaddle

* update custom device commit

* polish code by comments

267d71a4

refactor mp. (#45803) · fa97e5ba

由 wuhuachaocoding 提交于 9月 16, 2022

* refactor mp.

* update setup.py.

* update mp_layers.py for compatibility.

* add documents for mp_layers.py

* update init.py

* update collective.py.

* update.

* update mp_ops.py

* update.

* update code style.

* update code style.

fa97e5ba

W

Support both use_calc_stream and sync_op in send recv APIs (#46023) · ae00f428
由 Wen Sun 提交于 9月 16, 2022

ae00f428

15 9月, 2022 21 次提交

Gloo update (#45584) · 92e1f64b

由 ziyoujiyi 提交于 9月 15, 2022

* back fl

* delete ssl cert

* .

* make warning

* .

* unittest paral degree

* solve unittest

* heter & multi cloud commm ready

* .

* .

* fix gloo compile warning

92e1f64b

H
[jit] skip forward save (#45901) · 483ba282
由 Hui Zhang 提交于 9月 15, 2022
```
* skip forward save

* fix bug

* more ci for jit skip forward
```
483ba282

[Auto Parallel] Improve the APIs (#45776) · b042a3b1

由 Yulong Ao 提交于 9月 15, 2022

* [Auto Parallel] Use c++ dist attr in the completion process

* [Auto Parallel] Add minor changes

* [Auto Parallel] Use c++ dist attr in the completion process

* [Auto Parallel] Add minor changes

* [Auto Parallel] Add the serialization process for dist attrs

* [Auto Parallel] Remove unnecessary comments

* [Auto Parallel] Fix some bugs

* [Auto Parallel] Fix the code style

* [Auto Parallel] Remove unnecessary impls

* [Auto Parallel] Fix the importing error

* [Auto Parallel] Fix the copy from bugs of op dist attr

* [Auto Parallel] Replace the use of constexpr if

* [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh

* [Auto Parallel] Change API of the completion unittest

* [Auto Parallel] Fix the bug when set_attr an int

* [Auto Parallel] Add the unittest for the serialization

* [Auto Parallel] Add some unit tests

* [Auto Paralle] Unify the strategy

* [Auto Parallel] Improve the engine api

* [Auto Parallel] Reset the changes made to the framework

* [Auto Parallel] Change the engine unittest

* [Auto Parallel] Update API of the completion and partitioner

* [Auto Parallel] Update unit tests using engine api

* update shard annotation

* [Auto Parallel] Remove the modifications of other modules

* [Auto Parallel] Add docs for APIs

* add new strategy

* [Auto Parallel] Replace the logger

* [Auto Parallel] Restore the test_program.py

* [Auto Parallel] Change the import rules

* [Auto Parallel] Add the examples for Engine

* [Auto Parallel] Do some minor changes

* [Auto Parallel] Remove yaml dependency

* [Auto Parallel] Fix the unittests

* add valid after train

* bug fix
Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>

b042a3b1

H
refine PADDLE_WITH_MKLDNN code (#46053) · ea96172e
由 HongyuJia 提交于 9月 15, 2022
```
* refine PADDLE_WITH_MKLDNN code

* fix data_norm_op

* polish addmm_op
```
ea96172e
G

remove tmp fp32 var for gaussian_random (#46033) · 3671d114
由 Guoxia Wang 提交于 9月 15, 2022

3671d114
N

Revert "Fix argsort in XPU black list for XPU KP (#45975)" (#46064) · f3206b09
由 niuliling123 提交于 9月 15, 2022

f3206b09

updating mul and matmul with set_mem_desc (#45624) · 416e0de7

由 Jacek Czaja 提交于 9月 15, 2022

* - mul & matmul changes

- fix

- bs16 correction of strides

* - cosmetic fixes

* - lint

* - fix

* - fix

* - format -> mem_desc

* - fix

* - fix

* - fix

* - fix

* - fix

416e0de7

N

[CodeStyle][W291] trim trailing whitespace in NPU unittest file (#46042) · 5022dd9b
由 Nyakku Shigure 提交于 9月 15, 2022

5022dd9b
N

[CodeStyle] trailing whitespace hook for doc and cpp related files (#46067) · 710efdae
由 Nyakku Shigure 提交于 9月 15, 2022

710efdae
傅

Optimize flip kernel by eliminating H2D data transfer, test=develop (#46046) · b3283f4c
由傅剑寒提交于 9月 15, 2022

b3283f4c
W

fix_recover_remove_padding kernel (#46050) · 65bdd80b
由 Wangzheee 提交于 9月 15, 2022

65bdd80b

Clear extra attrs of elementwise op in OpMaker (#45845) · b26efe0d

由 zyfncg 提交于 9月 15, 2022

* clear extra attrs of elementwise op in opmaker

* fix op_debug_string_test

* fix bug of grad_add

* fix sort of runtime attrs

b26efe0d

W
Support 0 shapes input Tensor for MKL slice (#45930) · 1d78681d
由 WangZhen 提交于 9月 15, 2022
```
Support 0 shapes input Tensor for MKL slice kernel
```
1d78681d
L
Performance fix for broadcast kernel [Part3] (#45854) · f48b1264
由 limingshu 提交于 9月 15, 2022
```
* first commit

* fix some bugs in code

* fix bugs

* to optimize merge one dimension feature
```
f48b1264
N

[CodeStyle] trim trailing whitespace in .h, .cc, .cu, etc. (#46006) · 8dde7aea
由 Nyakku Shigure 提交于 9月 15, 2022

8dde7aea
W

General Plugin Mechanism (#45355) · bc77e6d5
由 weishengying 提交于 9月 15, 2022

bc77e6d5
W
[Eager] saved_tensors_hooks (#45763) · b294f054
由 wanghuancoder 提交于 9月 15, 2022
```
* saved_tensors_hooks
```
b294f054
L

add determine action for embed_grad and index_add. (#46040) · 0c40d889
由 Li Min 提交于 9月 15, 2022

0c40d889
J
[Eager] Optimize log (#45783) · 54a43981
由 Jiabin Yang 提交于 9月 15, 2022
```
* make eager log readable

* fix compile error

* recover test

* invoke ci again
```
54a43981
Z
Delete eigen header in data_type.h (#46036) · 34510e8f
由 zyfncg 提交于 9月 15, 2022
```
* delete eigen header in data_type.h

* fix complie bug

* refactor
```
34510e8f
S

add trailing whitespace hook (#44474) · 49f6c245
由 Sing_chan 提交于 9月 15, 2022

49f6c245

14 9月, 2022 14 次提交
- N
  [CodeStyle][W291] trim trailing whitespace in python file (#45937) · de8c0ba5
  由 Nyakku Shigure 提交于 9月 14, 2022
```
* trim trailing whitespace

* fix `.cmake-format.py`

* revert npu ut changes, avoid npu ci error
```
  de8c0ba5
- J
  Support inference compilation in training package (#46008) · cbe64cc1
  由 JingZhuangzhuang 提交于 9月 14, 2022
```
* merge python lib
* Update third_party.cmake
* Update CMakeLists.txt
```
  cbe64cc1
- P
  
  new executor support compiled_program._graph (#46025) · 9718791c
  由 pangyoki 提交于 9月 14, 2022
  
  9718791c
- C
  
  update op compat ci rule, test=document_fix (#46037) · 13f4bbe8
  由 Chen Weihang 提交于 9月 14, 2022
  
  13f4bbe8
- J
  delay tensorrt registry (#45824) · d7d35ff8
  由 JingZhuangzhuang 提交于 9月 14, 2022
```
* Delay TensorRT registry
* Add unused define
* Fix TensorRT test
* fix function to reference
* Update trt_plugin.h
```
  d7d35ff8
- C
  
  normize yaml backward op label (#46028) · 6891a4fe
  由 Chen Weihang 提交于 9月 14, 2022
  
  6891a4fe
- J
  [PHI] Support bmm and bmm_grad in xpu (#45887) · 6bd2762c
  由 Jiabin Yang 提交于 9月 14, 2022
```
* support bmm and bmm_grad in xpu

* add error removal

* test=kunlun

* refactor code for better structure

* test=kunlun

* add fp16 kernel for bmm

* test=kunlun
```
  6bd2762c
- C
  add convert rules for fill_any_like op in paddle science (#45985) · 4fac4e77
  由 Charles-hit 提交于 9月 14, 2022
```
* add convert rules for fill_any_like op in paddle science

* add unit test for fill_any_like op in paddle science

* modify fill_any_like convert rule

* modify fill_any_like convert rule dtype
```
  4fac4e77
- W
  
  CastPyArg2IntArray use int64_t (#45919) · c53e92fc
  由 wanghuancoder 提交于 9月 14, 2022
  
  c53e92fc
- Z
  
  [Sparse]Remove unused code (#46021) · 0b82fb32
  由 zhangkaihuo 提交于 9月 14, 2022
  
  0b82fb32
- L
  
  Support fp16 for index_select and index_add (#45601) · 61012a76
  由 Li Min 提交于 9月 14, 2022
  
  61012a76
- N
  [CodeStyle] trim trailing whitespace in .md and .rst (#45990) · 3404ff67
  由 Nyakku Shigure 提交于 9月 14, 2022
```
* [CodeStyle] trim trailing whitespace in .md and .rst

* empty commit, test=document_fix
```
  3404ff67
- L
  Migrate scale and scatter to phi, and modify the code style for... · 1349584e
  由 Leo Guo 提交于 9月 14, 2022
```
Migrate scale and scatter to phi, and modify the code style for roi_align_kernel. test=kunlun (#45938)
```
  1349584e
- Z
  fix trt multiclass_nms3 (#45166) · f85f2e83
  由 Zhang Jun 提交于 9月 14, 2022
```
* update

* update

* update
```
  f85f2e83

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功