提交 · bc153701da60fd042335177c6bf2f145f38fc90b · PaddlePaddle / Paddle

19 7月, 2023 5 次提交

陶

add sequence parallel utils to fleet utils (#55462) · bc153701
由陶泽伟提交于 7月 19, 2023

bc153701
Y

Sharding stage 1 tensor fusion (#55427) · 4c4d3185
由 Yuang Liu 提交于 7月 19, 2023

4c4d3185
J
修改COPY-FROM No.14 incubate (#55234) · cf146106
由 jjyaoao 提交于 7月 19, 2023
```
Signed-off-by: Njjyaoao <jjyaoao@126.com>
```
cf146106
J
修改COPY-FROM No.4 optimizer (#55238) · 413efdc9
由 jjyaoao 提交于 7月 19, 2023
```
Signed-off-by: Njjyaoao <jjyaoao@126.com>
```
413efdc9

disable __setitem__ in static mode & add API paddle.static.setitem with dy2st strategy (#53682) · 7849d58d

由 JYChen 提交于 7月 19, 2023

* add paddle.static.setitem

* add some help doc

* support setitem

* support machanism

* add more unittest

* remove usless code

* raise error in static setitem

* fix d2s UT

* remove static only for both-used code

* fix bool set_value in static, fix set_value_op UT

* fix unittests

* [May case some error]: remove inplace-version check

* add two test case for dy2st

* fix function in vision

* fix dy2st setitem support, refine UT case

* fix slice in static_mode

* add ParametersMap

* remove pop

* modify place

* [fix]: variable is also a tensor

* rewrite some ut & remove slicetransformer in dy2st

* solve error in static-mode

* fix ut

* return a result for set_array_write

* fix test_set_value_op_xpu

* code is different in dynamic / static mode

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: NNotHaozi <zhangmenghao@baidu.com>

7849d58d

18 7月, 2023 5 次提交

Z
修改COPY-FROM add_example_for_lazygurd (#55411) · 96ff6103
由 zhangjingwei 提交于 7月 18, 2023
```
* add_example_for_lazygurd

* fix
```
96ff6103

batch add inpalce api (#55078) · 19302938

由 GGBond8488 提交于 7月 18, 2023

* batch add inpalce api

* fix inplace fn generate

* add test for  new inpalce api

* fix typro

* fix typro

* fix typro

* fix test error

* fix atan2

* remove atan2

* auto genereate inpalce api

* fix inplace generate fn error

* fix windows error

* fix test error

* fix test error

* fix windows ci error

* fix test error

* fix test_error

* fix test error

* fix eigen aliasing error in inplace

* remove elementwise_pow inplace

* fix doc error

* fix test error

19302938

[NewIR]Fix new ir concat split bug (#55419) · 5e6645d7

由 hong 提交于 7月 18, 2023

* fix new ir concat op bug

* fix bug

* using add_n_with_kernel instead of add_n impl

* fix pd_op yaml bug

* fix bug

5e6645d7

N

[Dy2St] skip compare between func and module attribute to fix NumPy 1.25 error (#55482) · 2dcb0ebf
由 Nyakku Shigure 提交于 7月 18, 2023

2dcb0ebf

[Add] Paddle 代码 CI 中引入 xdoctest 检查 (#55295) · 26fba07c

由 megemini 提交于 7月 18, 2023

* [Add]Add Xdoctester

* [Fix]fix beta docstring

* [Doctest]change dirichlet docstring

* [Doctest]change gumbel docstring

* [Doctest]change bernoulli docstring

* [Doctest]change categorical docstring

* [Doctest]change ops.py docstring

* [Doctest]change conv docstring

* [Doctest]change distance docstring, test=docs_preview

* [Change]add ref

* [Change]patch xdoctest debug

26fba07c

17 7月, 2023 1 次提交

Support more dtype for any/all API. (#55253) · 7b19efe4

由 zxcd 提交于 7月 17, 2023

* add more data type for all/any.

* remove xpu fix.

* add test unit.

* fix typename name.

* fix output data type.

7b19efe4

14 7月, 2023 1 次提交

[AutoTuner] Distribute best cfg (#54834) · 7f6d222f

由 caozhou 提交于 7月 14, 2023

* distribute best cfg

* adapt to multi args transmission

* update metric extracting

* fix bugs of prune and reading log

* fix time default value

* remove time record

* adjust the order of searching dim

* fix prune bugs

* fix adding cfg bug

* fix multi nodes bug

* reset status

* remove alarm and set logdir

* deepcopy ctx

* change alarm

* fix restart bug

* add exit

* best no need alarm

* add warmup time

7f6d222f

13 7月, 2023 7 次提交
- N
  
  Add fused_attention, fused_feedforward, fused_gemm_epilogue to amp white_list (#55373) · cb68b58a
  由 niuliling123 提交于 7月 13, 2023
  
  cb68b58a
- R
  Support nvprof for auto parallel (#55347) · 9210b1af
  由 Ruibiao Chen 提交于 7月 13, 2023
```
* Support nvprof for auto parallel

* Fix CI errors

* Fix CI errors
```
  9210b1af
- C
  【AMP Prim OP】support instance_norm prim ops for fp16 and bf16 dtype (#55368) · 65950324
  由 Charles-hit 提交于 7月 13, 2023
```
* [prim]support fp16 for instance_norm and instance_norm_grad

* support fp16 and bfp16 dtype for instance_norm prim rules

* fix new ir test

---------
Co-authored-by: Ncxxly <chenxx_id@163.com>
```
  65950324
- add phi operator c_concat and ut (#55320) · 788be26d
  由 lil-Xing 提交于 7月 13, 2023
```
* add phi operator c_concat and ut

* update create_var use

* update copyright
```
  788be26d
- L
  Integrate QAT into distributed optimizer (#54241) · aaf021c9
  由 Leo Chen 提交于 7月 13, 2023
```
* Support AMP program for onnx QAT API

* Integrate QAT into distributed optimizer

* Reduce the size of test data and increase time limit

* Use logger and reduce time limit of unittests

* Rename and move unittest into fleet test

* Test qat_init API
```
  aaf021c9
- R
  fix protobuf problem (#55305) · 0cea7b7d
  由 risemeup1 提交于 7月 13, 2023
```
* fix protobuf problem

* fix protobuf problem
```
  0cea7b7d
- Y
  
  sharding vpp overlap bug fixer (#55365) · 1558ee02
  由 Yuang Liu 提交于 7月 13, 2023
  
  1558ee02
12 7月, 2023 1 次提交

[ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7

由 YangQun 提交于 7月 12, 2023

* squash pick the poc code
* fix build after rebase
* fix int8 conv and fc uts
* Fix and clean-up Get_SRC_Scale_Memory
* fix floating point fc uts
* fix test_analyzer_int8_googlenet
* test_analyzer_int8_mobilenetv1
* fix int8 mobilenet v2 and v3
* fix build error after rebase
* [oneDNN] rename library version
* fix conv bias datatype
* try to fix import error
* fix rebase error
* [oneDNN] pack library into python wheel
* add MKLDNN_SHARED_LIB_3 to env_dict
* fix test_analyzer_bert
* fix fill_constant op kernel
* fix ernie and matmul op ut
* fix softplus ut
* fix conv+relu6 fusion ut
* fix hardswish fusion
* fix quant+transpose fusion ut
* fixsgd ut
* fix int8 matmul with flatten
* fix fc+scale fusion
* fix conv/matmul+gelu fusion uts
* fix rebase error
* Revert "fix conv/matmul+gelu fusion uts"
This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
* upgrade to onednn v3.1
* remove older version onednn
* use densetensor::data() for achieving mean and var in layernorm impl
* comments for atol of integer tests
* fix clang-format
* Revert "remove older version onednn"
This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
* improve binary handle
* fix expand kernel
* Revert "use densetensor::data() for achieving mean and var in layernorm impl"
* always use forward_inference for conv
* remove activation scales
* rollback changes to mkldnn.cmake
* address comments
* port changes to dequantize kernel
* fix merge error
* fix fused_elementwise_kernel
* upgrade onednn version to v3.1.1
* fix some approval error
* fix error msg format
* remove old onednn libs
* try to fix symbolic link issue
* fix cinn test case segfault
* do not explicit link test with onednn
* remove unnecessary changes
* integrate CINN with onednn v3
* link with mkldnn project
* fix cinn build file

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
Co-authored-by: Ntianshuo78520a <707759223@qq.com>

cfa513f7

11 7月, 2023 7 次提交

support sharding parallel (#54634) · b7a05057

由 pangengzheng 提交于 7月 11, 2023

* support sharding parallel

* fix name

* fix

* update

* test amp for sharding

---------

Co-authored-by: pangengzheng <pangengzheng.baidu.com>

b7a05057

M
DOCS: Adding imformation about datatype in math.py (#55297) · ab73b8c6
由 Muhammad Ishaque Nizamani 提交于 7月 11, 2023
```
* DOCS: Adding imformation about datatype in math.py

* replaced uint16 with bfloat16.
```
ab73b8c6

Pipeline pass base (#55174) · 5434560a

由 Wennie396 提交于 7月 11, 2023

* format correction

* variable names adjustment

* variable names adjustment, name-->type, value-->sub_program

5434560a

replace the AdagradOptimizer... · 94365855

由 LoneRanger 提交于 7月 11, 2023

replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152)

* replace the AdadeltaOptimizer with Adadelta

* replace the RMSPropOptimizer with RMSProp

* replace the LambOptimizer with lamb

* replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py

* fix bug

* fix bug

* fix bug

* fix bug of Lamp

* fix bug of Lamp

* fix bug of import

* replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer

* fix bug

* fix bug

* Update optimizer.py

* fix bug

* fix bug

94365855

Integrate rmsnorm kernel (#54998) · 97d3d6ee

由 MarDino 提交于 7月 11, 2023

* add rmsnorm kernel
* add static graph test
* fix round type
* use alignas to avoid msvc compile error
* remove redundant headerfile to avoid rocm compile error
* fix rocm compile not found cub
* Add document

97d3d6ee

Linear compress (#55128) · f4290a92
由 FormlessUnit 提交于 7月 11, 2023
```
* rename weight_only/llm.int8
```
f4290a92

赛题七-开发grad_fn、next_functions两个API 并暴露到python端-v1 (#54838) · ab46b14c

由 qiuwenbo 提交于 7月 11, 2023

* [尝试] 给tensor增加一个属性, 这个属性是一个定值 1

* 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问

* 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理

* 增加一个单元测试

* 优化 code-style

ab46b14c

10 7月, 2023 3 次提交
- R
  
  [CustomDevice] add custom device support for Variable.set_value (#55272) · df311526
  由 ronnywang 提交于 7月 10, 2023
  
  df311526
- R
  
  [CustomDevice] fix get_paddle_place (#55225) · 33730ae7
  由 ronnywang 提交于 7月 10, 2023
  
  33730ae7
- W
  
  Fix load fine tune error (#55233) · 5f00305d
  由 WangZhen 提交于 7月 10, 2023
  
  5f00305d
08 7月, 2023 1 次提交
- 张
  [CodeStyle][CINN] ruff F401 and F403 in python/cinn (#55182) · 32bc8b88
  由张春乔提交于 7月 08, 2023
```
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
```
  32bc8b88
07 7月, 2023 3 次提交

修改COPY-FROM No. 2 jit (#54920) · a02e6dbd

由 gouzil 提交于 7月 07, 2023

* [jit] add copy-from; test=document_fix

* [jit] add copy-from; test=document_fix

* fix TracedLayer

a02e6dbd

L
remove the extend_optimizer_with_weight_decay function (#55007) · 3cca2a87
由 LoneRanger 提交于 7月 07, 2023
```
* remove the extend_optimizer_with_weight_decay function

* Update __init__.py

* fix bug

* fix bug
```
3cca2a87

[CINN] Remove some pybind interface in cinn to fix compile problem (#55043) · 2fc429f1

由 zyfncg 提交于 7月 07, 2023

* remove some pybind interface in cinn to fix compile problem

* modify cmake

* fix cmake

* add log for build cinn whl

* fix ninja for cinn

* fix conflict

2fc429f1

06 7月, 2023 6 次提交
- G
  修改COPY-FROM No. 3 autograd (#54921) · d6e259bb
  由 gouzil 提交于 7月 06, 2023
```
* [autograd] add copy-from; test=document_fix

* [autograd] add copy-from; test=document_fix

* fix
```
  d6e259bb
- X
  
  fix cinn version path (#55190) · 8b9b9400
  由 Xinyu Chen 提交于 7月 06, 2023
  
  8b9b9400
- W
  
  add new version print_table_stat (#55092) · d166c118
  由 wangxiaoning 提交于 7月 06, 2023
  
  d166c118
- Z
  add clip_grad_value_ api (#54603) · 88402cdb
  由 zqw_1997 提交于 7月 06, 2023
```
* add clip_grad_value_ api

* add test for ClipGradByValue

* typo fix

* refine and modify clip_grad_norm_

* no_grad

* clip_

* remove g=p.grad

* bug: AssertionError: When Variable is used as the condition of if/while , Variable can only contain one element.
```
  88402cdb
- C
  [Prim] Fix none var added with op error (#55133) · 7f0ba045
  由 cyber-pioneer 提交于 7月 06, 2023
```
* fix prim add fill_any_like bug

* polish code
```
  7f0ba045
- Z
  
  [AMP] modify default value for GradScaler (#54653) · 77e289ae
  由 Zhang Ting 提交于 7月 06, 2023
  
  77e289ae

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功