提交 · d1da885fafd55236fa934220289118da8e6053b0 · BaiXuePrincess / Paddle

03 2月, 2023 2 次提交
- X
  
  solve auto_aprallel pp2 with fp16 question (#49913) · d1da885f
  由 xu98bin 提交于 2月 03, 2023
  
  d1da885f
- W
  
  fix found_inf bug for custom optimizer (#50158) · 64573f9f
  由 wanghuancoder 提交于 2月 03, 2023
  
  64573f9f
02 2月, 2023 2 次提交
- J
  [Auto Parallel]fix bugs in cluster to device meshes (#49892) · 0a5b2e79
  由 Jianghai 提交于 2月 02, 2023
```
* fix bugs in cluster to device meshes

* add tests

* 1
```
  0a5b2e79
- W
  
  update recompute doc. (#50088) · 2b4dd5b9
  由 wuhuachaocoding 提交于 2月 02, 2023
  
  2b4dd5b9
01 2月, 2023 2 次提交

remove fluid.initializer.UniformInitializer, ConstantInitializer,... · 6edc7bba

由 zqw_1997 提交于 2月 01, 2023

remove fluid.initializer.UniformInitializer, ConstantInitializer, NormalInitializer, TruncatedNormalInitializer, XavierInitializer, BilinearInitializer, MSRAInitializer, NumpyArrayInitializer and calculate_gain.. (#49498)

* move UniformInitializer and ConstantInitializer

* more modify

* circular import resolved

* another circular import resolved?

* more circular import 2

* circular import 3

* change import paddle in metric.py

* BuildStrategy import from fluid

* modify the framework import path in common.py

* change rnn.py import, from static to original framework

* change import static in the nn folder

* default_main_program should import from common_ops_import

* add import paddle in param_attr.py

* use core not paddle module for using VarDesc

* another old uniform

* mistake that use Uniform instead of UniformInitializer

* modify UniformInitializer doc

* move fluid.NormalInitializer to nn.initializer.NormalInitializer

* remove import of Normal in fluid.layers.nn.py

* remove more import of old Normal

* remove more import of old Normal

* sample code modify and tests modify import

* is_listen_failed passing arg should be log file

* problem solved

* a mistake solved

* comments resoleved and remove paddle.fluid.initializer.TruncatedNormalInitializer

* remove paddle.fluid.initializer.XavierInitializer and paddle.fluid.initializer.MSRAInitializer

* remove paddle.fluid.initializer.BilinearInitializer NumpyArrayInitializer and set_global_initializer

* change fluid to static

* change static to fluid to avoid circular import in distributed_strategy.py

* fix example code and test_initializer

* ValueType

* sample code fix

* change set_global_initializer back to fluid

* put paddle.static.BuildStrategy.ReduceStrategy into the fuction to avoid circular import

* remove calculate_gain, delete BilinearInitializer and revert set_global_initializer

* change the time of using UniformInitializer, ConstantInitializer, NormalInitializer, TruncatedNormalInitializer, XavierInitializer, MSRAInitializer, NumpyArrayInitializer as few as possible

* fix argument incampatible

* fix more arg incompatible

* fix test_prelu_op_xpu.py Constant

* fix inaccurate doc

* more doc fix: default value

6edc7bba

W

clean ps_trainer_pass (#50117) · 73f3e676
由 wangxiaoning 提交于 2月 01, 2023

73f3e676

30 1月, 2023 2 次提交

[Pglbox2.0] merge gpugraph to develop (#49946) · cb525d4e

由 zmxdream 提交于 1月 30, 2023

* add set slot_num for psgpuwraper (#177)

* add set slot_num_for_pull_feature for psgpuwarper

* Add get_epoch_finish python interface (#182)

* add get_epoch_finish interface

* add return

* delete return

* add unzip op (#183)

* fix miss key for error dataset (#186)

* fix miss key for error dataset

* fix miss key for error dataset
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

* add excluded_train_pair and infer_node_type (#187)

* support return of degree (#188)

* fix task stuck in barrier (#189)
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

* check node/feature format when loading (#190)

* check node&feature format when loading

* check node&feature format when loading (2£ (2)

* degrade log (#191)

* [PGLBOX]fix conflict

* [PGLBOX]fix conflict

* [PGLBOX]replace LodTensor with phi::DenseTensor

* [PGLBOX]fix gpu_primitives.h include path

* [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip ut

* [PGLBOX]fix unzip ut

* [PGLBOX]fix code style

* [PGLBOX]fix code style

* [PGLBOX]fix code style

* fix code style

* fix code style

* fix unzip ut

* fix unzip ut

* fix unzip ut

* fix unzip

* fix code stype

* add ut

* add c++ ut & fix train_mode_ set

* fix load into memory

* fix c++ ut

* fix c++ ut

* fix c++ ut

* fix c++ ut

* fix code style

* fix collective

* fix unzip_op.cc

* fix barrier

* fix code style

* fix barrier

* fix barrier

* fix code styple

* fix unzip

* add unzip.py

* add unzip.py

* fix unzip.py

---------
Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com>
Co-authored-by: NSiming Dai <908660116@qq.com>
Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

cb525d4e

W
refine amp scaler found_inf (#49864) · 382e9a06
由 wanghuancoder 提交于 1月 30, 2023
```
* refine _found_inf
```
382e9a06

29 1月, 2023 1 次提交
- L
  [FleetExecutor] Remove max_slot_num and implement multi-scope fetch (#50041) · decbb588
  由 LiYuRio 提交于 1月 29, 2023
```
* remove max_slot_num

* fix test case
```
  decbb588
28 1月, 2023 1 次提交
- L
  
  add cond interceptor (#50019) · b2706b0c
  由 LiYuRio 提交于 1月 28, 2023
  
  b2706b0c
16 1月, 2023 2 次提交

W

[Fluid clean]clean distributed fluid API (#49795) · 7de9420a
由 wangxiaoning 提交于 1月 16, 2023

7de9420a

[Auto Parallel] Clear some fluid APIs (#49793) · e70af91d

由 Yulong Ao 提交于 1月 16, 2023

* [Auto Parallel] Rename methods of ProcessMesh

* [Auto Parallel] Impl the python process_mesh by the c++ one

* [Auto Parallel] Add some minor modifications

* [Auto Parallel] Rename some methods

* [Auto Parallel] Remove unnecessary codes

* [Auto Parallel] Add back some removed files

* [Auto Parallel] Fix bugs

* [Auto Parallel] Fix a bug

* Update process_mesh.cc

* [Auto Parallel] Merge dist attrs of Python into C++

* [Auto Parallel] Add back deleted importing

* [Auto Parallel] Add back removed unittest

* [Auto Parallel] Remove type qualifiers of return types

* [Auto Parallel] Fix some bugs

* [Auto Parallel] Fix a bug of the quant pass

* [Auto Parallel] Fix the code style

* [Auto Parallel] Clear some fluid APIs

e70af91d

13 1月, 2023 3 次提交
- D
  [Custom Device] Clear ProcessGroup Manually (#49182) · a923a757
  由 duanyanhui 提交于 1月 13, 2023
```
* clear ProcessGroupCustom manually

* fix bug

* fix bug

* move destroy ProcessGroup to ProcessGroupIdMap

* enable destroy to all device

* remove unused comments

* change to internal api

* Update process_group.cc

* Update process_group.cc
```
  a923a757
- W
  
  update fluid api. (#49731) · dd827bbe
  由 wuhuachaocoding 提交于 1月 13, 2023
  
  dd827bbe
- W
  
  fix a bug of stage2 offload. (#49767) · 1c8531ce
  由 wuhuachaocoding 提交于 1月 13, 2023
  
  1c8531ce
12 1月, 2023 3 次提交
- Z
  
  move fuild.contrib.mixed_precision to paddle.static.amp (#49412) · 69d01eb9
  由 zhangkaihuo 提交于 1月 12, 2023
  
  69d01eb9
- W
  
  [ci unnitest fix] dgc optimizer (#49741) · 81ec63a4
  由 wangzhen38 提交于 1月 12, 2023
  
  81ec63a4
- Z
  [AutoParallel] recovery annotation (#49665) · 5c9c1a39
  由 zhaoyingli 提交于 1月 12, 2023
```
* recovery annotation

* bugfix
```
  5c9c1a39
11 1月, 2023 4 次提交
- W
  
  refactor: rm fluid deps in fleet (#49724) · 7d46d9f9
  由 Wen Sun 提交于 1月 11, 2023
  
  7d46d9f9
- W
  
  refactor: rm fluid deps in distributed communication (#49722) · e0b50269
  由 Wen Sun 提交于 1月 11, 2023
  
  e0b50269
- Y
  add FusedLinear pass (#49606) · 0f08a432
  由 yuehuayingxueluo 提交于 1月 11, 2023
```
* add FusedLinear pass

* add fused_op_list and renname PASSES to OP_FUSION

* add fused_passes_list to constants.py

* add test_passes.py

* fix test_fused_passes.py

* fix add if float(paddle.version.cuda()) >= 11.6:

* renamed test_fused_passes.py

* fix CMakeList.txt
```
  0f08a432
- W
  
  [rm fluid] dgc_optimizer (#49714) · 1bdb7960
  由 wangzhen38 提交于 1月 11, 2023
  
  1bdb7960
10 1月, 2023 5 次提交
- W
  Use `CommContextManager` to init comm op using gloo backend (#49666) · 05df6973
  由 Wen Sun 提交于 1月 10, 2023
```
* refactor: gloo comm context migration

* fix: headers & avoid mutable_data usage

* fix: cmake gloo dep

* style: rename funcs

* refactor: move to new files

* fix: gloo deps

* refactor: simplify create device
```
  05df6973
- W
  
  solve share params bugs and add exclude_layer attr for stage3. (#48695) · 79b261ba
  由 wuhuachaocoding 提交于 1月 10, 2023
  
  79b261ba
- Y
  [Auto Parallel] Remove some deprecated fluid APIs (#49099) · c70fe47c
  由 Yulong Ao 提交于 1月 10, 2023
```
* [Auto Parallel] Remove some fluid APIs

* [Auto Parallel] Fix the wrong import

* [Auto Parallel] Remove unnecessary comments

* [Auto Parallel] Fix the importing bug
```
  c70fe47c
- W
  
  support cpu offload for stage3 (#49196) · 451756fb
  由 wuhuachaocoding 提交于 1月 10, 2023
  
  451756fb
- Y
  
  [Fuse attention pass] Forward pattern. (#49621) · b0ece266
  由 Yuang Liu 提交于 1月 10, 2023
  
  b0ece266
09 1月, 2023 2 次提交

Z
[AutoParalle] balancing the calculation of global_norm in data parallel (#49510) · 926c4bd2
由 zhaoyingli 提交于 1月 09, 2023
```
* [AutoParalle] balancing the calculation of global_norm in data parallel

* fix unittest

* update cond pure_data_parallel
```
926c4bd2

Create comm_context and modified static init (#49536) · 04e24e58

由 LiYuRio 提交于 1月 09, 2023

* comm_context and static init

* refactor: move to phi/core/distributed

* refactor: avoid mutable_data usage

* fix: windows sock

* fix: device without nccl
Co-authored-by: Wen Sun <syl1887415157@126.com>

04e24e58

07 1月, 2023 1 次提交

Enable standalone executor for fleet training (#49293) · 67fc8e93

由 Ruibiao Chen 提交于 1月 07, 2023

* Enable standalone executor for fleet training

* Update code

* Replace use_standalone_executor utils in auto parallel

* Update code

* Diable standalone executor for test_pass_sharding

* Update code

* Set sequential run for auto parallel

* Fix dist_attr bug

* Set sequential run for auto parallel

67fc8e93

06 1月, 2023 2 次提交

G

Add observer attribute in qdq node & Add quant config for different backends. (#46887) · 8bbae468
由 Guanghua Yu 提交于 1月 06, 2023

8bbae468

[Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074

由 Yulong Ao 提交于 1月 06, 2023

* [Auto Parallel] Rename methods of ProcessMesh

* [Auto Parallel] Impl the python process_mesh by the c++ one

* [Auto Parallel] Add some minor modifications

* [Auto Parallel] Rename some methods

* [Auto Parallel] Remove unnecessary codes

* [Auto Parallel] Add back some removed files

* [Auto Parallel] Fix bugs

* [Auto Parallel] Fix a bug

* Update process_mesh.cc

* [Auto Parallel] Merge dist attrs of Python into C++

* [Auto Parallel] Add back deleted importing

* [Auto Parallel] Add back removed unittest

* [Auto Parallel] Remove type qualifiers of return types

* [Auto Parallel] Fix some bugs

* [Auto Parallel] Fix a bug of the quant pass

* [Auto Parallel] Fix the code style

c7899074

05 1月, 2023 2 次提交

U

Fix throw exception typo in paddle/nn/functional/loss.py (#39750) · 414ca6b9
由 ucsk 提交于 1月 05, 2023

414ca6b9

姜

Yj/rm core ops exp (#49490) · 70ea88bf

由姜永久提交于 1月 05, 2023

* rm op_function_generator

* rm op_func_generator.h

* rm op_function

* modify cmake

* rm op_function.h

* rm check for op_function_generator.cc

* reset imperative

* rm python part

* fix imperative

* lint

* lint

* modify legacy_c

* review

* modify

* modify legacy

* rm gen op_functions code

* reset framework

* rm core.ops for test

* core.ops->core.eager.ops.legacy

* not raiseError for xpu

70ea88bf

04 1月, 2023 2 次提交

R

support mp on xpu (#49531) · 7875accb
由 Roc 提交于 1月 04, 2023

7875accb

[Auto Parallel-Performance] Sharding Comm Optimization (#48604) · 5592f8ad

由 JZ-LIANG 提交于 1月 04, 2023

* remove deps and prior comm

* grad comm fuse

* add deps for amp&global norm

* stage2 broadcast prior deps

* stage2 grad overlap

* stream_analyzer bugfix

* overlap enable

* dep op namescope

* depend support multiple inputs

* check finite deps

* stage2 param comm overlap

* Set kD2HStream

* grad comm hierarchical

* grad comm hierarchical

* new unitest
Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>

5592f8ad

03 1月, 2023 1 次提交
- 骑
  
  [FluidAPI]remove clip api (#48946) · fe0dc40d
  由骑马小猫提交于 1月 03, 2023
  
  fe0dc40d
30 12月, 2022 2 次提交

Z

[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) · 72973d5a
由 zhouzj 提交于 12月 30, 2022

72973d5a

在文档中统一静态图模式与动态图模式的英文翻译 (#49170) · a186e60d

由 Sanbu 提交于 12月 30, 2022

* 1219

* temporarily change the num_diff_files limit, test=document_fix

* Revert "temporarily change the num_diff_files limit, test=document_fix"

This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20.

* for codestyle

* remove duplicate license

* `static mode` -> `static graph mode`

* Update hybrid_parallel_inference.py

* Update layer_function_generator.py

* Update manipulation.py

* reset
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

a186e60d

29 12月, 2022 1 次提交
- X
  auto parallel bf16 (#49079) · 418edae5
  由 xu98bin 提交于 12月 29, 2022
```
* auto parallel bf16
```
  418edae5

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致