提交 · 04dcb9d78d72273b672e8807db9fa451c3f1aead · PaddlePaddle / Paddle

17 11月, 2022 20 次提交
- H
  
  fix new executor gc dep bug (#48068) · 04dcb9d7
  由 hong 提交于 11月 17, 2022
  
  04dcb9d7
- H
  
  rm "paddle/fluid/framework/convert_utils.h" in phi (#48001) · 2f34fc7a
  由 huangjiyi 提交于 11月 17, 2022
  
  2f34fc7a
- Y
  [PHI]Standardise some C++ API (Part5) (#47860) · f3650201
  由 YuanRisheng 提交于 11月 17, 2022
```
* standard api

* fix xpu bugs
```
  f3650201
- M
  
  optimizing a bit tensor_array initialization (#48066) · c374894d
  由 Mountagha 提交于 11月 17, 2022
  
  c374894d
- T
  
  xpu-paddlepaddle-41 [任务] ffn and attention test=kunlun (#46658) · 071708fa
  由 taixiurong 提交于 11月 17, 2022
  
  071708fa
- 傅
  
  remove fluid.layers.affine_grid API (#47851) · b4460eee
  由傅剑寒提交于 11月 17, 2022
  
  b4460eee
- W
  
  move "function_traits.h" from fluid to phi (#48065) · b7841a2b
  由 Wang Xin 提交于 11月 17, 2022
  
  b7841a2b
- X
  [Paddle Inference] Support cast trt converter of bool input and output . (#48043) · ff44df18
  由 xiaoxiaohehe001 提交于 11月 17, 2022
```
* add_cast_bool

* cast
```
  ff44df18
- Y
  Implement a common dimension simplifier. (#47981) · bf6af816
  由 Yiqun Liu 提交于 11月 17, 2022
```
* Implement a common dims simplifier.

* Fix the include position error.

* Reduce the cpu overhead of broadcast computing.
```
  bf6af816
- S
  
  fix bug of p2p (#48045) · cb087beb
  由 ShenLiang 提交于 11月 17, 2022
  
  cb087beb
- W
  
  support stage2 for gradient merge. (#47711) · c20eb7a6
  由 wuhuachaocoding 提交于 11月 17, 2022
  
  c20eb7a6
- K
  
  Remove reduntant numpy input in Example code, test=document_fix (#47916) · 460d5040
  由 Kevin吴嘉文提交于 11月 17, 2022
  
  460d5040
- H
  
  rm "paddle/phi/kernels/gpu/batch_norm_utils.h" in phi (#48057) · b7e120d2
  由 huangjiyi 提交于 11月 17, 2022
  
  b7e120d2
- H
  [PHI decoupling] move "paddle/fluid/operators/math.h" to phi (#48062) · f62bd3b4
  由 huangjiyi 提交于 11月 17, 2022
```
* rm "paddle/fluid/operators/math.h" in phi

* rm "paddle/fluid/operators/math.h" in fluit
```
  f62bd3b4
- Y
  Support bfloat16 for adamw and adam optimizer. Fit the lr for pure bf16... · e5ed5257
  由 Yuang Liu 提交于 11月 17, 2022
```
Support bfloat16 for adamw and adam optimizer. Fit the lr for pure bf16 training with tensor fusion. (#48041)

* add bfloat16 for adamw

* set lr not to bfloat16 for pure bf16 training

* update the logic

* update the adamw optimizer

* support bfloat for adam
```
  e5ed5257
- [Zero-Dim] temporarily revert create_scalar due to input 0D is not fully supported (#48058) · 4f57da5f
  由 zhouweiwei2014 提交于 11月 17, 2022
  
  4f57da5f
- S
  Add vectorized bfloat16 atomicAdd (#48056) · ccbd03d5
  由 sneaxiy 提交于 11月 17, 2022
```
* add vectorized bfloat16 atomicAdd

* fix compile error

* fix compile error again

* fix V100 compile error

* fix V100 compile again
```
  ccbd03d5
- N
  
  [CodeStyle][F821] add a missing import (#48006) · 33d81aa4
  由 Nyakku Shigure 提交于 11月 17, 2022
  
  33d81aa4
- H
  
  xpu.cmake: use baidu-kunlun-product. update to 1116. (#48031) · efdf75e3
  由 houj04 提交于 11月 17, 2022
  
  efdf75e3
- Z
  
  generate static graph code for some op (#48036) · 7cc0d171
  由 zyfncg 提交于 11月 17, 2022
  
  7cc0d171
16 11月, 2022 20 次提交
- 傅
  
  [fluid clear] Remove elu in nn.py (#47855) · 992b30ba
  由傅剑寒提交于 11月 16, 2022
  
  992b30ba
- H
  [Clean fluid] Clean fluid elementwise_min (part1) (#48033) · 99ec2c16
  由 HongyuJia 提交于 11月 16, 2022
```
* clean fluid elementwise_min

* fix elementwise_min op testcase
```
  99ec2c16
- H
  
  rm "paddle/fluid/framework/gpu_utils.h" in phi (#48020) · 29a0987a
  由 huangjiyi 提交于 11月 16, 2022
  
  29a0987a
- Q
  [NPU] update npu prop, test=develop (#47859) · ad8847aa
  由 Qi Li 提交于 11月 16, 2022
```
* [NPU] update npu prop, test=develop

* remove ddim.h

* remove diff

* update storage prop, test=develop
```
  ad8847aa
- H
  
  clean fluid elementwise_max (part2): remove API (#48034) · b68e0c47
  由 HongyuJia 提交于 11月 16, 2022
  
  b68e0c47
- X
  [Paddle Inference] Add fill_any_like trt converter. (#47974) · d6be9000
  由 xiaoxiaohehe001 提交于 11月 16, 2022
```
* add_fill_any_like

* add_fill_any_like
```
  d6be9000
- W
  elementwise_floordiv (#47944) · b4b78060
  由 wenbin 提交于 11月 16, 2022
```
* elementwise_op

* add teller

* modify ut

* comments

* modify ut

* return

* modify
```
  b4b78060
- W
  [remove fluid] under fleet meta_optimizers (#47864) · a2a97cbb
  由 wangzhen38 提交于 11月 16, 2022
```
* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers

* [remove fluid] under fleet meta_optimizers
```
  a2a97cbb
- C
  
  remove adaptive_pool2d and adaptive_pool3d (#48004) · 9fba1e72
  由 ccrrong 提交于 11月 16, 2022
  
  9fba1e72
- C
  remove chunk_eval in nn.py under fluid (#47948) · fd15390a
  由 ccrrong 提交于 11月 16, 2022
```
* remove chunk_eval
```
  fd15390a
- Z
  
  trt memory set change from setMaxWorkspaceSize to setMemoryPoolLimit since trt 8.3+ (#47795) · 9cf3aa61
  由 Zhang Jun 提交于 11月 16, 2022
  
  9cf3aa61
- Z
  
  [inference][trt] update trt hardswish plugin to layer (#47745) · 6c54e0e8
  由 Zhang Jun 提交于 11月 16, 2022
  
  6c54e0e8
- H
  [Opt depthwise_conv2d] Simplify depthwise_conv2d use_cudnn attribute (#48010) · 7c304580
  由 HongyuJia 提交于 11月 16, 2022
```
* simplify depthwise_conv2d phi kernel selection

* fix depthwise_conv2d
```
  7c304580
- P
  Add bf16 data type support to oneDNN bilinear_interp kernel (#46770) · 8e6315e4
  由 Piotr Paturej 提交于 11月 16, 2022
```
* Enable bf16 in oneDNN bilinear_interp kernel

* Fix bilinear_interp_v2 not enabled in models

* Remove unnecessary checks
```
  8e6315e4
- Y
  Fix paddle rec, kim, dsin models' bugs (#47792) · e23dfed9
  由 ykkk2333 提交于 11月 16, 2022
```
* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun

* embedding and embedding_grad add int32 input, test=kunlun
```
  e23dfed9
- H
  remove avx check (#48003) · a762d68e
  由 hong 提交于 11月 16, 2022
```
* remove avx check

* fix bug;
```
  a762d68e
- L
  
  increase the level of some log (#47990) · 2f8901cb
  由 Leo Chen 提交于 11月 16, 2022
  
  2f8901cb
- S
  
  fix xccl (#48018) · 0d507fc2
  由 shentanyue 提交于 11月 16, 2022
  
  0d507fc2
- W
  
  move "gpu_primitives.h" to phi (#48015) · 9adca1e7
  由 Wang Xin 提交于 11月 16, 2022
  
  9adca1e7
- W
  Update `ProcessGroupCustom` for `sync_op` compatibility (#47976) · e4ebf383
  由 Wen Sun 提交于 11月 16, 2022
```
* refactor: update pg custom

* fix: use new api in ut

* fix: typo

* revert: recover legacy apis

* fix: add GetDeviceContext
```
  e4ebf383

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功