提交 · 4b0f1b0cb8566f5b93a7ad412ca38a2789a1fffd · PaddlePaddle / Paddle

08 11月, 2022 29 次提交
- R
  
  [CustomDevice] fix the not ready kernel can not register. (#47758) · 4b0f1b0c
  由 ronnywang 提交于 11月 08, 2022
  
  4b0f1b0c
- J
  [Auto Parallel] Sharding Optimization：Partition Algorithm & Stage2 Parameter... · e5eb3f55
  由 JZ-LIANG 提交于 11月 08, 2022
```
[Auto Parallel] Sharding Optimization：Partition Algorithm & Stage2 Parameter Bucket communication  (#47180)

* partition param by order

* add logging

* reorder opt

* config

* stage2 bucket

* update unitest
```
  e5eb3f55
- W
  
  Fix compiler error with_trt (#47716) · 6934ae2b
  由 Wilber 提交于 11月 08, 2022
  
  6934ae2b
- L
  
  refine comm api implementation (#47713) · 84c9a0d6
  由 LiYuRio 提交于 11月 08, 2022
  
  84c9a0d6
- [Zero-Dim] support input 0D Tensor for sundary api (#47734) · 3198af20
  由 zhouweiwei2014 提交于 11月 08, 2022
```
* [Zero-Dim] support input 0D Tensor for sundary api

* fix comment
```
  3198af20
- S
  Migrate old C++ unit tests to Python framework (#47006) · 0c9f09b8
  由 Sławomir Siwek 提交于 11月 08, 2022
```
* softplus+activation

* fc + elementwise_add test refactored

* rename MKLDNN to OneDNN

* fc+activation tests refactored

* remove softplus ut

* whitespace

* whitespace

* codestyle

* codestyle

* add more cases to fc+act

* remove softplus+hard_sigmoid pass

* remove softplus + hard_sigmoid UT

* add approximate for gelu

* swish beta range

* new codestyle

* reduce number of tests
```
  0c9f09b8
- [Zero-Dim] support input 0D Tensor for distribution transform api (#47677) · dc85b393
  由 zhouweiwei2014 提交于 11月 08, 2022
```
* [Zero-Dim] support input 0D Tensor for distribution api

* fix comment
```
  dc85b393
- Z
  
  add adadelta op for xpu, test=kunlun (#47661) · 047971f0
  由 zhangyikun02 提交于 11月 08, 2022
  
  047971f0
- Z
  
  argsort support n > 16384 and add argsort_grad op for xpu, test=kunlun (#47701) · 6a6a3ff1
  由 zhangyikun02 提交于 11月 08, 2022
  
  6a6a3ff1
- S
  
  fix npu:0 stage (#47729) · 793c35ef
  由 shentanyue 提交于 11月 08, 2022
  
  793c35ef
- K
  
  add fuse_multi_transformer passes to fp16. test=develop (#47676) · caca5687
  由 Kaipeng Deng 提交于 11月 08, 2022
  
  caca5687
- L
  
  Fix bug of abs_double_grad in eager mode for kunlun, test=kunlun (#47722) · aba3c806
  由 Leo Guo 提交于 11月 08, 2022
  
  aba3c806
- N
  
  [CodeStyle][py2] remove the `next` method for python2 compatibility (PEP 3114) (#47728) · 4061b1b8
  由 Nyakku Shigure 提交于 11月 08, 2022
  
  4061b1b8
- X
  [BugFix] fix tensor_array slice bugs in _getitem_impl_ (#46447) · fccf664f
  由 xiongkun 提交于 11月 08, 2022
```
* fix tensor_array slice bugs in _getitem_impl_

* fix when var is a paddle.Tensor

* code format
```
  fccf664f
- R
  
  [CustomDevice] fix undefined symbol GetCCLComm in the cpu version (#47717) · 97004f67
  由 ronnywang 提交于 11月 08, 2022
  
  97004f67
- C
  
  Support cuda 11 with jetson (#47741) · 14c95700
  由 chalsliu 提交于 11月 08, 2022
  
  14c95700
- Z
  [Paddle Inference] allow fold fill_constant && allow nms3 into trt in int8 model (#47551) · c3a69111
  由 zhoutianzi666 提交于 11月 08, 2022
```
* allow fold fill_constant && allow nms3 into trt in int8 model
* use unordered_map
* fix CI failing
```
  c3a69111
- H
  update AUTHOR. test=kunlun (#47682) · 51507430
  由 houj04 提交于 11月 08, 2022
```
* update AUTHOR. test=kunlun

* update AUTHOR.
```
  51507430
- N
  [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) · 888272b5
  由 Nyakku Shigure 提交于 11月 08, 2022
```
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition

* fix an increment
```
  888272b5
- Z
  
  fix examplce code of slice api (#47735) · e5bb8785
  由 zyfncg 提交于 11月 08, 2022
  
  e5bb8785
- P
  Split quant (#47449) · 130db92a
  由 Paulina Gacek 提交于 11月 08, 2022
```
* Split kernel registered, tests for uint/int added

* Split quantized

* Split output scales calculated only once

* NearestInterp test fix reversed

* DequantizeOutputs corrected
```
  130db92a
- J
  removing dependent to fluid/framework/eigen.h in phi (#47675) · c7cd8d98
  由 jzhang533 提交于 11月 08, 2022
```
* removing dependent to fluid/framework/eigen.h in phi

* more fix according to PR-CI-Py3 fail
```
  c7cd8d98
- T
  remove dist xpu tests for R200 (#47381) · ef21b58b
  由 tianshuo78520a 提交于 11月 08, 2022
```
* disable distributed xpu tests

* test=kunlun

* test=document_fix;test=kunlun

* test=document_fix;test=kunlun

* test=document_fix;test=kunlun

* test=document_fix;test=kunlun
```
  ef21b58b
- C
  support pow double grad op (#47691) · 6fe9dfb2
  由 Charles-hit 提交于 11月 08, 2022
```
* support pow_double_grad op

* add unit test for pow double grad

* fix pow double grad

* optimize pow double grad kernel

* fix pow double grad kernel
```
  6fe9dfb2
- Z
  [Paddle-TRT]Fix cast converter bug , use setOutputType() instaead (#46289) · 18adbbd0
  由 zhoutianzi666 提交于 11月 08, 2022
```
* fix cast bug
```
  18adbbd0
- W
  
  remove <fluid/eager/api/utils/global_utils.h> from phi (#47739) · 42d9fe2f
  由 Wang Xin 提交于 11月 08, 2022
  
  42d9fe2f
- C
  
  normalize autotune tests dir (#47726) · 6bab3343
  由 Chen Weihang 提交于 11月 08, 2022
  
  6bab3343
- T
  
  fix cinn_instruction_run_op_test when FLAGS_use_system_allocator=True (#47731) · a4a9ce0e
  由 TeFeng Chen 提交于 11月 08, 2022
  
  a4a9ce0e
- T
  Fix undefined symbol: shm_open (#47421) · 50c3632f
  由 Tomasz Socha 提交于 11月 08, 2022
```
* Fix undefined symbol: shm_open

* Fix for Windows

* Exclude APLLE
```
  50c3632f
07 11月, 2022 11 次提交
- Y
  Define ConvRunner to wrapper the call of cudnn conv functions. (#47576) · c331e2ce
  由 Yiqun Liu 提交于 11月 07, 2022
```
* Define ConvRunner to wrapper the call of cudnn conv functions.

* Use ConvKind in SearchAlgorithm.
```
  c331e2ce
- H
  suqeeze2 + transpose2 fuse onednn (#47592) · fa874a46
  由 Hui Zhang 提交于 11月 07, 2022
```
* suqeeze2 transpose2 fuse onednn

* format

* fix output shape

* fix conflict

* format

* format

* remove useless

* remove log

* simply pass

* fix comment

* fix

* fix msg

* fix error msg

* format
```
  fa874a46
- W
  
  remove hardcoded -Wunused-variable compiler flags (#47706) · 45bc4542
  由 Wang Xin 提交于 11月 07, 2022
  
  45bc4542
- L
  
  fix nlu compilation (#47707) · 75f34bb7
  由 Leo Chen 提交于 11月 07, 2022
  
  75f34bb7
- Q
  support kldiv_loss/kldiv_loss_grad for kunlun (#47638) · 5f0a8adc
  由 QingshuChen 提交于 11月 07, 2022
```
*test=kunlun
```
  5f0a8adc
- T
  Test FLAGS_enable_cudnn_frontend In CUDA117 CI (#47635) · 87753ee8
  由 tianshuo78520a 提交于 11月 07, 2022
```
* test=cuda117

* test=cuda11

* test=document_fix;test=cuda117

* test=document_fix
```
  87753ee8
- C
  
  update error msg ci check rule, test=document_fix (#47708) · dcc4b46f
  由 Chen Weihang 提交于 11月 07, 2022
  
  dcc4b46f
- Z
  [AutoParallel]fp16 pass support assign op (#47649) · 6c51e493
  由 zhaoyingli 提交于 11月 07, 2022
```
* fp16 pass support assign op

* choose assign op exec mode

* add unittest

* add cmakelist
```
  6c51e493
- P
  
  disable WITH_CUDNN_DSO (#47674) · c65f0565
  由 pangyoki 提交于 11月 07, 2022
  
  c65f0565
- Y
  add roll and roll_grad kernels and strided_slice and strided_slice_grad... · 5a4d2186
  由 ykkk2333 提交于 11月 07, 2022
```
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368)

* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
```
  5a4d2186
- W
  [Eager] eager tensor support pickler (#47025) · 8a7e54d5
  由 wanghuancoder 提交于 11月 07, 2022
```
* test_paddle_multiprocessing support eager tensor pickler
```
  8a7e54d5

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功