提交 · 7f6d222f02f6668fc95eb0d40ed8e999b4caca02 · PaddlePaddle / Paddle

14 7月, 2023 1 次提交

[AutoTuner] Distribute best cfg (#54834) · 7f6d222f

由 caozhou 提交于 7月 14, 2023

* distribute best cfg

* adapt to multi args transmission

* update metric extracting

* fix bugs of prune and reading log

* fix time default value

* remove time record

* adjust the order of searching dim

* fix prune bugs

* fix adding cfg bug

* fix multi nodes bug

* reset status

* remove alarm and set logdir

* deepcopy ctx

* change alarm

* fix restart bug

* add exit

* best no need alarm

* add warmup time

7f6d222f

13 7月, 2023 7 次提交
- N
  
  Add fused_attention, fused_feedforward, fused_gemm_epilogue to amp white_list (#55373) · cb68b58a
  由 niuliling123 提交于 7月 13, 2023
  
  cb68b58a
- R
  Support nvprof for auto parallel (#55347) · 9210b1af
  由 Ruibiao Chen 提交于 7月 13, 2023
```
* Support nvprof for auto parallel

* Fix CI errors

* Fix CI errors
```
  9210b1af
- C
  【AMP Prim OP】support instance_norm prim ops for fp16 and bf16 dtype (#55368) · 65950324
  由 Charles-hit 提交于 7月 13, 2023
```
* [prim]support fp16 for instance_norm and instance_norm_grad

* support fp16 and bfp16 dtype for instance_norm prim rules

* fix new ir test

---------
Co-authored-by: Ncxxly <chenxx_id@163.com>
```
  65950324
- add phi operator c_concat and ut (#55320) · 788be26d
  由 lil-Xing 提交于 7月 13, 2023
```
* add phi operator c_concat and ut

* update create_var use

* update copyright
```
  788be26d
- L
  Integrate QAT into distributed optimizer (#54241) · aaf021c9
  由 Leo Chen 提交于 7月 13, 2023
```
* Support AMP program for onnx QAT API

* Integrate QAT into distributed optimizer

* Reduce the size of test data and increase time limit

* Use logger and reduce time limit of unittests

* Rename and move unittest into fleet test

* Test qat_init API
```
  aaf021c9
- R
  fix protobuf problem (#55305) · 0cea7b7d
  由 risemeup1 提交于 7月 13, 2023
```
* fix protobuf problem

* fix protobuf problem
```
  0cea7b7d
- Y
  
  sharding vpp overlap bug fixer (#55365) · 1558ee02
  由 Yuang Liu 提交于 7月 13, 2023
  
  1558ee02
11 7月, 2023 7 次提交

support sharding parallel (#54634) · b7a05057

由 pangengzheng 提交于 7月 11, 2023

* support sharding parallel

* fix name

* fix

* update

* test amp for sharding

---------

Co-authored-by: pangengzheng <pangengzheng.baidu.com>

b7a05057

M
DOCS: Adding imformation about datatype in math.py (#55297) · ab73b8c6
由 Muhammad Ishaque Nizamani 提交于 7月 11, 2023
```
* DOCS: Adding imformation about datatype in math.py

* replaced uint16 with bfloat16.
```
ab73b8c6

Pipeline pass base (#55174) · 5434560a

由 Wennie396 提交于 7月 11, 2023

* format correction

* variable names adjustment

* variable names adjustment, name-->type, value-->sub_program

5434560a

replace the AdagradOptimizer... · 94365855

由 LoneRanger 提交于 7月 11, 2023

replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152)

* replace the AdadeltaOptimizer with Adadelta

* replace the RMSPropOptimizer with RMSProp

* replace the LambOptimizer with lamb

* replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py

* fix bug

* fix bug

* fix bug

* fix bug of Lamp

* fix bug of Lamp

* fix bug of import

* replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer

* fix bug

* fix bug

* Update optimizer.py

* fix bug

* fix bug

94365855

Integrate rmsnorm kernel (#54998) · 97d3d6ee

由 MarDino 提交于 7月 11, 2023

* add rmsnorm kernel
* add static graph test
* fix round type
* use alignas to avoid msvc compile error
* remove redundant headerfile to avoid rocm compile error
* fix rocm compile not found cub
* Add document

97d3d6ee

Linear compress (#55128) · f4290a92
由 FormlessUnit 提交于 7月 11, 2023
```
* rename weight_only/llm.int8
```
f4290a92

赛题七-开发grad_fn、next_functions两个API 并暴露到python端-v1 (#54838) · ab46b14c

由 qiuwenbo 提交于 7月 11, 2023

* [尝试] 给tensor增加一个属性, 这个属性是一个定值 1

* 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问

* 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理

* 增加一个单元测试

* 优化 code-style

ab46b14c

10 7月, 2023 3 次提交
- R
  
  [CustomDevice] add custom device support for Variable.set_value (#55272) · df311526
  由 ronnywang 提交于 7月 10, 2023
  
  df311526
- R
  
  [CustomDevice] fix get_paddle_place (#55225) · 33730ae7
  由 ronnywang 提交于 7月 10, 2023
  
  33730ae7
- W
  
  Fix load fine tune error (#55233) · 5f00305d
  由 WangZhen 提交于 7月 10, 2023
  
  5f00305d
07 7月, 2023 2 次提交
- G
  修改COPY-FROM No. 2 jit (#54920) · a02e6dbd
  由 gouzil 提交于 7月 07, 2023
```
* [jit] add copy-from; test=document_fix

* [jit] add copy-from; test=document_fix

* fix TracedLayer
```
  a02e6dbd
- L
  remove the extend_optimizer_with_weight_decay function (#55007) · 3cca2a87
  由 LoneRanger 提交于 7月 07, 2023
```
* remove the extend_optimizer_with_weight_decay function

* Update __init__.py

* fix bug

* fix bug
```
  3cca2a87
06 7月, 2023 7 次提交
- G
  修改COPY-FROM No. 3 autograd (#54921) · d6e259bb
  由 gouzil 提交于 7月 06, 2023
```
* [autograd] add copy-from; test=document_fix

* [autograd] add copy-from; test=document_fix

* fix
```
  d6e259bb
- W
  
  add new version print_table_stat (#55092) · d166c118
  由 wangxiaoning 提交于 7月 06, 2023
  
  d166c118
- Z
  add clip_grad_value_ api (#54603) · 88402cdb
  由 zqw_1997 提交于 7月 06, 2023
```
* add clip_grad_value_ api

* add test for ClipGradByValue

* typo fix

* refine and modify clip_grad_norm_

* no_grad

* clip_

* remove g=p.grad

* bug: AssertionError: When Variable is used as the condition of if/while , Variable can only contain one element.
```
  88402cdb
- C
  [Prim] Fix none var added with op error (#55133) · 7f0ba045
  由 cyber-pioneer 提交于 7月 06, 2023
```
* fix prim add fill_any_like bug

* polish code
```
  7f0ba045
- Z
  
  [AMP] modify default value for GradScaler (#54653) · 77e289ae
  由 Zhang Ting 提交于 7月 06, 2023
  
  77e289ae
- Z
  remove allreduce before c_allgather (#55143) · c234f1f2
  由 zhaoyingli 提交于 7月 06, 2023
```
* remove allreduce before c_allgather

* update reshard insert_fill_constant_op func

* insert_fill_constant_op add shape arg
```
  c234f1f2
- X
  Revert "[XPU] fix the dataloader problem in RDMA env (#54150)" (#55150) · 86694ce3
  由 XiaociZhang 提交于 7月 06, 2023
```
This reverts commit 15c87528.
```
  86694ce3
05 7月, 2023 5 次提交
- W
  
  fix error in code-block directive, test=document_fix (#55104) · 902de74c
  由 Wang Xin 提交于 7月 05, 2023
  
  902de74c
- Z
  
  修改COPY-FROM No. 15 regularizer (#54926) · 97e87d2d
  由 zhangjingwei 提交于 7月 05, 2023
  
  97e87d2d
- C
  
  修改COPY-FROM No.18 (#54842) · 567dabeb
  由 cyberslack_lee 提交于 7月 05, 2023
  
  567dabeb
- G
  
  fix pow label, test=document_fix (#54945) · d6e90046
  由 GGBond8488 提交于 7月 05, 2023
  
  d6e90046
- L
  
  [sparse] Add backend conv2d support (#54707) · 3e3f5d90
  由 LUZY0726 提交于 7月 05, 2023
  
  3e3f5d90
03 7月, 2023 6 次提交
- relax_micro_batch_check (#54788) · 802613cc
  由 zhenhailiu 提交于 7月 03, 2023
  
  802613cc
- L
  
  fix the example code (#55053) · 0fd50551
  由 LoneRanger 提交于 7月 03, 2023
  
  0fd50551
- M
  [Fix]fix cleandoc with a first blank line (#55052) · ff7e6ec5
  由 megemini 提交于 7月 03, 2023
```
* [Fix]fix cleandoc with a first blank line

* [Fix]fix metrics.py code-block

* [Fix]fix metrics.py code-block indent
```
  ff7e6ec5
- L
  【PaddlePaddle Hackathon 4】No.63 : add lerp bf16 support (#53078) · ce31a72e
  由 LoneRanger 提交于 7月 03, 2023
```
* add lerp bf16 support

* fix bug

* Update test_lerp_op.py

modify the input dtype

* modify the test_lerp_op.py

* Update test_lerp_op.py

* fix bug of import

* add user_defined_grads

* Update test_lerp_op.py

* fix bug of grad

* fix bug of grad

* fix bug of grad

* add the check for bfloat16 dtype
```
  ce31a72e
- add linear_compress API (#54140) · c4d5ec66
  由 FormlessUnit 提交于 7月 03, 2023
```
* add linear_compress API
```
  c4d5ec66
- N
  
  Update the rope op according to the comments (#54985) · 2401d48d
  由 niuliling123 提交于 7月 03, 2023
  
  2401d48d
30 6月, 2023 2 次提交

replace the PolynomialDecay、NoamDecay、LinearLrWarmup、ReduceLROnPlateau in... · 051e55c6

由 LoneRanger 提交于 6月 30, 2023

replace the PolynomialDecay、NoamDecay、LinearLrWarmup、ReduceLROnPlateau in fluid with 2.0 version (#54806)

* remove the ReduceLROnPlateau in fluid

* fix bug

* remove the PolynomialDecay in fluid

* remove the LinearLrWarmup in fluid

* fix bug

* remove the NoamDecay in fluid

* fix bug

* fix bug

* fix bug

051e55c6

S

fix launch unorder (#55011) · 8ede4a9c
由 sneaxiy 提交于 6月 30, 2023

8ede4a9c

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功