提交 · a3a9168268c530ce04c63bda83d41e5257cab3de · PaddlePaddle / Paddle

27 4月, 2023 13 次提交

Y

【Hackathon No.91】register_hook for static mode (#52948) · db30aa1d
由 yangguohao 提交于 4月 27, 2023

db30aa1d

【PaddlePaddle Hackathon 4】：为maxout算子支持 float16 数据类型 (#50976) · 8bfd978f

由 NetPunk 提交于 4月 27, 2023

* support fp16 for maxout op

* format code

* change api

* add test for static float16

* format code

* formatting code

* atol alignment

* experiment—1

* experiment-2

* experiment-3

* format code

8bfd978f

Move fused feedforward (#53166) · 25b4ba7f

由 Sonder 提交于 4月 27, 2023

* trans fused_feedward Compute function to phi

* add register info

* remove maxfunctor

* move fused feedward to phi

* remove sig file

* remove fliud include

* add include

* add include

* add sig file

* add output register info

* fix sig file

* Update fused_feedforward_sig.cc

* fix grad kernel

* update output register info

* fix

* open fused_feedforward static build

* add optional and fix code style

* fix output info for fused attention

* add optional param

* merge

25b4ba7f

Z
[AMP] support OD level and skip dynamic loss scaling for bf16 (#53289) · 18e9dcdc
由 Zhang Ting 提交于 4月 27, 2023
```
* support OD level and skip dynamic loss scaling for bf16
```
18e9dcdc

[Fix CppExtension Unittest] Change CUDAExtension to CppExtension if necessary (#53352) · 3278dec7

由 HongyuJia 提交于 4月 27, 2023

* [Fix CppExtension Unittest] Change CUDAExtension to CppExtension if necessary

* Temporarily test cpp_extension under GPU

* Split mixed_extension unittest

3278dec7

Add jacobian and hessian (#53331) · e8d296ef

由 HydrogenSulfate 提交于 4月 27, 2023

* add jacobian and hessian in paddle.autograd

* disable unitest 'func_multi_input' for bug in high-order gradient of multiply

* add dimension checks

* add support for 0-D tensor

* change return type from Jacobian to Hessian in hessian function

* refine Jacobian _flatten function for single xs

* refine support for 0-D tensor

* 1. add 'func_multi_input' unitest for multiply_grad_kernel bug fixed
already.
2. support non-inplace math operation via magical method overwriting.

* add unitest for math operation and raise error when 0-D tensor is indexed

* add ndim check on ys and xs according to is_batched, and add one unitest

* refine docstring of jacobian and hessian

* move paddle.incubate.autograd.Jacobian/Hessian to paddle.incubate.autograd.functional.Jacobian/Hessian

* remove single_input unitest case because numerical differentiation is wrong

* remove 3 unitest for numerical result(reference result) is wrong

* 1. rename autodiff.py to autograd.py
2. increase TIMEOUT to 100

* cancel modification for functional Jacobian/Hessian

* 1. use tuple as return type instead of list
2. refine docstring

* add more unitest case to improve coverage

* remove 2 unitest of Hessian for numerical result is wrong

* remove 1 unitest of Hessian for numerical result is wrong

* remove 1 unitest of Hessian for numerical result is wrong

* change unit test to shape check

* correct doc and replace incubate API to stable API in _grad

e8d296ef

X
【prim】Concat bug (#53350) · 6768c6ec
由 xiaoguoguo626807 提交于 4月 27, 2023
```
* modify concat_grad add sum comp rule

* modify opcompat
```
6768c6ec
H
updata Adamw.py (#52984) · c0ee14f6
由 hua-zi 提交于 4月 27, 2023
```
* updata Adamw.py

out.backward()  -> loss.backward()

* Update adamw.py
```
c0ee14f6
J

Hack__getitem__ from 0-d to 1-d with FLAGS_set_to_1d (#53358) · 1bd468e2
由 JYChen 提交于 4月 27, 2023

1bd468e2
H
[XPU] remove scale_loss in parallel.py (#53337) · 2e1ac529
由 houj04 提交于 4月 27, 2023
```
* [XPU] remove scale_loss in parallel.py

* [XPU] throw Unimplemented when using Reducer
```
2e1ac529
S

【Hackathon No.55】add fmax BF16 test (#51925) · 8a6ad6e5
由 superwinner1 提交于 4月 27, 2023

8a6ad6e5
C

【Hackathon4】No5 nextafter (#52544) · 82ac3913
由 cyberslack_lee 提交于 4月 27, 2023

82ac3913

Pad grad (#53374) · bfeedd29

由 mengziheng 提交于 4月 27, 2023

* add pad op

* add_some_code

* modify some code

* add some code

* add some code

* modify some code

* add some code

* modify some code

* Update composite_backward_api.h

* modify some code

* add some code

* add some code

* add some code

bfeedd29

26 4月, 2023 12 次提交
- [Zero-Dim] distributed scatter/all_to_all support input 0D tensor (#53186) · 0b6dd535
  由 zhouweiwei2014 提交于 4月 26, 2023
  
  0b6dd535
- pp 策略调整后，模型转换，以便模型热启 (#52927) · 3650c4a8
  由 zhenhailiu 提交于 4月 26, 2023
```
* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish
```
  3650c4a8
- M
  【prim】scatter_nd_add_grad (#52469) · 55c4eb8a
  由 mhy-666 提交于 4月 26, 2023
```
* add scatter_nd_add comp

* add scatter_nd_add prim

* fix

* fix

* add public_python_api in TestScatterNdAddSimpleOp setup function

* fix composite_backward_api.h

* fix composite_backward

* add test cases

* fix composite_backward_api.h, unittest
```
  55c4eb8a
- Z
  [Zero-Dim] Support output 0D for gather_nd, einsum. (#53175) · 1a790edf
  由 zqw_1997 提交于 4月 26, 2023
```
* add test cases, test=allcase

* fix test cases, test=allcase

* fix test cases, test=allcase

* assert_allclose, test=allcase

* 1e-5 to 1e-4, test=allcase

* change rtol from 1e-4 to 1e-3, test=allcase
```
  1a790edf
- L
  add paddle.optimizer.LBFGS API and a modify its test case test=develop (#51912) · 8386417e
  由 lijialin03 提交于 4月 26, 2023
```
* modify numel in lbfgs and add a new test case. test=develop

* change param 'lr' to 'learning_rate' in lbfgs and its test

* add opt LBFGS and change test
```
  8386417e
- 骑
  
  [Bug fixes] enable two ops to support bf16 in llama model (#53026) · cd88156a
  由骑马小猫提交于 4月 26, 2023
  
  cd88156a
- S
  
  [HybridParallel]Add segment methods for pipelineparallel (#53344) · c59debe2
  由 ShenLiang 提交于 4月 26, 2023
  
  c59debe2
- S
  Optimize c_embedding op in deterministic mode (#53197) · 35f5c245
  由 sneaxiy 提交于 4月 26, 2023
```
* optimize embedding deterministic mode

* fix compile error

* change FLAGS_cudnn_deterministic to int64

* fix 700 error

* add ut

* fix ut

* fix ut

* fix win32 ci

* fix flags with PHI_DEFINE_EXPORTED_int64
```
  35f5c245
- add leaky relu composite rule (#52909) · 66fbfba8
  由 warrentdrew 提交于 4月 26, 2023
```
* add leaky relu composite rule

* add public python api

* unset default negative slope

* fix unittest case
```
  66fbfba8
- D
  
  【Hackathon No.48】为 Paddle kthvalue 算子实现 float16 数据类型支持 (#53285) · cf6ed7cb
  由 denglianbin 提交于 4月 26, 2023
  
  cf6ed7cb
- D
  
  【Hackathon No.48】为 Paddle determinant 算子实现 float16 数据类型支持 (#53286) · 2a705b74
  由 denglianbin 提交于 4月 26, 2023
  
  2a705b74
- D
  
  【Hackathon No.48】为 Paddle meshgrid 算子实现 float16 数据类型支持 (#53284) · 9127cc3c
  由 denglianbin 提交于 4月 26, 2023
  
  9127cc3c
25 4月, 2023 8 次提交

Fix some problems in Paddle english instruction doc files. (#53145) · 6c152472

由 Zenghui Yuan 提交于 4月 25, 2023

* Fix some problems in Paddle english instruction doc files.

* fix some new questions

* fix cn doc problems, test=document_fix

6c152472

W

add mp_sync config. (#53254) · 503f422e
由 wuhuachaocoding 提交于 4月 25, 2023

503f422e
C
[DEBUG] print modifed flags (#53243) · 8d4b64e8
由 Chitsing KUI 提交于 4月 25, 2023
```
* print modifed flags

* fix ref, opt print

* fix default getter

* fix ut
```
8d4b64e8

【Hackathon No57】add_bf16_fp16 unittest for conv3d & conv3d_transpose (#52195) · eb677102

由 Difer 提交于 4月 25, 2023

* add test+conv3d_transpose_part2

* fix some merge error

* fix codestyle

* fix typo

* fix codestyle

* fix some error

* add redef float2uint

* fix conv3d and conv3d_transpose

eb677102

C

【Hackathon No.61】min 算子FP16/BF16单测完善 (#52887) · d7a5e900
由 cyberslack_lee 提交于 4月 25, 2023

d7a5e900

[fluid clean] remove Print. (#51778) · ed45ecc6

由 qizhaoaoe 提交于 4月 25, 2023

* fluid clean: remove print/switch from fluid to static

* remove Switch in static.__init__

* fix conflicts.

* replace Switch by case.

* fix piecewise_lr decay.

* fix typo

* fix conflicts.

* fix lr dtype

* keep Switch in paddle.static.nn.control_flow and fix piecewise_lr.

* fix conflicts.

* keep Switch in the fluid.

* fix Switch doc

* fix example in Switch doc

* fix Switch doc.

* fix static/__init__.

ed45ecc6

【Hackathon No57】add fp16 & bf16 for max_pool2d_with_index, max_pool3d_with_index (#52314) · 46951224

由 Difer 提交于 4月 25, 2023

* add fp_bf for pool_max_withidx

* fix some error

* fix error

* codestyle error

* fix masktype

* fix input bf type

* input bf dtype convert error

* back to convert input to bf16 first

* fix convert error

* fix bf16 grad check

46951224

rename monkey_patch_{math_}varbase as monkey_patch_{math_}tensor (#53191) · 5ca3bc6d

由 Meteor Liu 提交于 4月 25, 2023

* rename monkey_patch_varbase as monkey_patch_tensor & monkey_patch_math_varbase as monkey_patch_math_tensor

* rename monkey_patch_varbase as monkey_patch_tensor & monkey_patch_math_varbase as monkey_patch_math_tensor

* rename monkey_patch_varbase as monkey_patch_tensor & monkey_patch_math_varbase as monkey_patch_math_tensor v2

* rename monkey_patch_varbase as monkey_patch_tensor & monkey_patch_math_varbase as monkey_patch_math_tensor fixed bug

5ca3bc6d

24 4月, 2023 7 次提交
- [Zero-Dim] Support paddle.max output 0D, test=allcase (#53242) · 9f9cd919
  由 zhouweiwei2014 提交于 4月 24, 2023
  
  9f9cd919
- L
  
  fix dist_grad kernel (#53239) · ddd72039
  由 Leo Chen 提交于 4月 24, 2023
  
  ddd72039
- J
  
  fix right value is 0d and index is List/Tensor (#53225) · 81c89dd6
  由 JYChen 提交于 4月 24, 2023
  
  81c89dd6
- K
  [BugFix] wrong match between depend and c_allreduce_sum (#53089) · f0f58665
  由 kangguangli 提交于 4月 24, 2023
```
* fix bug: wrong match between depend and c_allreduce_sum

* fix codestyle

* fix bug

* add c_sync_calc_stream back

* fix

* revert

* use flag to control

* fix for code coverage
```
  f0f58665
- N
  
  Add "enable_tensor_checker" and "disable_tensor_checker" to api list (#52936) · 41138718
  由 niuliling123 提交于 4月 24, 2023
  
  41138718
- Y
  [Zero-Dim] support 0d tensor for shape and squeeze onednn kernel (#52832) · c0a604e7
  由 YangQun 提交于 4月 24, 2023
```
* support 0d tensor for shape and squeeze onednn kernel

* set python api for shape op ut
```
  c0a604e7
- G
  add 0D support for trace (#53208) · 9d90738c
  由 GGBond8488 提交于 4月 24, 2023
```
* add 0D support for trace, test=allcase

* fix trace gpu kernel 0d error, test=allcase

* fix windows error, test=allcase
```
  9d90738c

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功