提交 · 04e24e58f569a1cd288a81330952d3d4c94e5cf6 · BaiXuePrincess / Paddle

09 1月, 2023 1 次提交

Create comm_context and modified static init (#49536) · 04e24e58

由 LiYuRio 提交于 1月 09, 2023

* comm_context and static init

* refactor: move to phi/core/distributed

* refactor: avoid mutable_data usage

* fix: windows sock

* fix: device without nccl
Co-authored-by: Wen Sun <syl1887415157@126.com>

04e24e58

07 1月, 2023 1 次提交

Enable standalone executor for fleet training (#49293) · 67fc8e93

由 Ruibiao Chen 提交于 1月 07, 2023

* Enable standalone executor for fleet training

* Update code

* Replace use_standalone_executor utils in auto parallel

* Update code

* Diable standalone executor for test_pass_sharding

* Update code

* Set sequential run for auto parallel

* Fix dist_attr bug

* Set sequential run for auto parallel

67fc8e93

06 1月, 2023 12 次提交

G

Add observer attribute in qdq node & Add quant config for different backends. (#46887) · 8bbae468
由 Guanghua Yu 提交于 1月 06, 2023

8bbae468

[zero-dim] Support 0-d for kthvalue and mode (#49340) · 292738f3

由 JYChen 提交于 1月 06, 2023

* add 0-d support for paddle.kthvalue

* add 0-d support for paddle.mode

* fix coverage test for device

* fix check-bug in windows

* change axis check from LT to LE

* add shape & value check for grad when input is 0d tensor

292738f3

S
【Zero-Dim】Support Zero dim for embedding and one-hot (#49562) · 370b50f6
由 seemingwang 提交于 1月 06, 2023
```
* zero-tensor

* remove unused

* zero_dim_xpu

* relocate

* add value test

* fix syntax
```
370b50f6
A
[D2SCinn]Add test_cinn unittest and param_grad into skip_gc_vars (#49575) · 0400eaed
由 Aurelius84 提交于 1月 06, 2023
```
* [D2SCinn]Add test_cinn unittest and param_grad into skip_gc_vars

* remove print
```
0400eaed
H
[Custom device stream] Acquire custom_deivce stream, add unit test (#49571) · e4c438f5
由 HongyuJia 提交于 1月 06, 2023
```
* acquire custom_deivce stream

* regulate file name and unittest
```
e4c438f5

【Zero-Dim】Flatten support 0d tensor (#49361) · 0093aaa6

由 jiangcheng 提交于 1月 06, 2023

* flatten op support 0D-tensor

* add test in zero dim py

* fix shape should be list

* short code for ci-coverage

* add backward test

* simple code for ci coverage

* add axis check

* add 0D-tensor test in test_flatten_contiguous_range_op.py

* add axis error test for Coverage CI

* add more test for CI-Coverage

* add more test for CI-Coverage

0093aaa6

W

[Eager] polish several api (#49589) · 6e80b84d
由 Weilong Wu 提交于 1月 06, 2023

6e80b84d
W

[Eager] polish adaptive series api (#49574) · cac5f5a7
由 Weilong Wu 提交于 1月 06, 2023

cac5f5a7
N

[CodeStyle][UP005] replace deprecated unittest aliases (#49522) · d00c2ca6
由 Nyakku Shigure 提交于 1月 06, 2023

d00c2ca6
张

Expansions of some unmaintained pr (#49551) · 419c2d14
由张春乔提交于 1月 06, 2023

419c2d14
N

Fix inaccurate return of low precision op list (#49391) · a214e5dc
由 niuliling123 提交于 1月 06, 2023

a214e5dc

[Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074

由 Yulong Ao 提交于 1月 06, 2023

* [Auto Parallel] Rename methods of ProcessMesh

* [Auto Parallel] Impl the python process_mesh by the c++ one

* [Auto Parallel] Add some minor modifications

* [Auto Parallel] Rename some methods

* [Auto Parallel] Remove unnecessary codes

* [Auto Parallel] Add back some removed files

* [Auto Parallel] Fix bugs

* [Auto Parallel] Fix a bug

* Update process_mesh.cc

* [Auto Parallel] Merge dist attrs of Python into C++

* [Auto Parallel] Add back deleted importing

* [Auto Parallel] Add back removed unittest

* [Auto Parallel] Remove type qualifiers of return types

* [Auto Parallel] Fix some bugs

* [Auto Parallel] Fix a bug of the quant pass

* [Auto Parallel] Fix the code style

c7899074

05 1月, 2023 17 次提交
- F
  sequence_mask fix: when the input length is an empty tensor, the kernel tries... · 0f3ccd14
  由 Feiyu Chan 提交于 1月 05, 2023
```
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
```
  0f3ccd14
- S
  Support 0D for paddle.sort/argsort (#49501) · 032da731
  由 Siming Dai 提交于 1月 05, 2023
```
* support 0D for paddle.sort/argsort

* support 0D tensor for paddle.sort/argsort in xpu

* fix bug

* fix grad and add value assertion
```
  032da731
- Generate the static graph code of ops (#49413) · 39f0eb2c
  由 HappyHeavyRain 提交于 1月 05, 2023
```
* generate the static graph code of ops

* modify the isclose comment

* modify the clip comment in nn.py

* reset nn.py
```
  39f0eb2c
- S
  
  remove paddle.fluid.distributed (#49517) · 89f2c652
  由 sneaxiy 提交于 1月 05, 2023
  
  89f2c652
- Z
  [inference][trt]Upgrade expand cast nearestinterp for sd (#48998) · 5defefd6
  由 Zhang Jun 提交于 1月 05, 2023
```
* update nearest_interp, expand_v2, cast for stable diffusion

* update nearest_interp, expand_v2, cast for stable diffusion

* correct shape rank

* Update expand_v2_op.cc
```
  5defefd6
- J
  [Auto Parallel] Add conv2d and pool flops (#48084) · 351d37d9
  由 Jianghai 提交于 1月 05, 2023
```
* add pool flops

* add annotations and tests
```
  351d37d9
- I
  
  refix 40644 in english docs (#49532) · 35f3c258
  由 Infinity_lee 提交于 1月 05, 2023
  
  35f3c258
- Z
  
  adjust timeout of quantiization unittest. (#49559) · 9446fabd
  由 zhouzj 提交于 1月 05, 2023
  
  9446fabd
- U
  
  Fix throw exception typo in paddle/nn/functional/loss.py (#39750) · 414ca6b9
  由 ucsk 提交于 1月 05, 2023
  
  414ca6b9
- 姜
  Yj/rm core ops exp (#49490) · 70ea88bf
  由姜永久提交于 1月 05, 2023
```
* rm op_function_generator

* rm op_func_generator.h

* rm op_function

* modify cmake

* rm op_function.h

* rm check for op_function_generator.cc

* reset imperative

* rm python part

* fix imperative

* lint

* lint

* modify legacy_c

* review

* modify

* modify legacy

* rm gen op_functions code

* reset framework

* rm core.ops for test

* core.ops->core.eager.ops.legacy

* not raiseError for xpu
```
  70ea88bf
- W
  
  [Inference] inplace all reshape op (#49146) · 017af746
  由 Wilber 提交于 1月 05, 2023
  
  017af746
- Y
  
  udpate_fused_attention_en_docs, test=document_fix (#49564) · 2811dcd0
  由 Yuang Liu 提交于 1月 05, 2023
  
  2811dcd0
- W
  [Eager] optimize same python api logic (#49473) · 5949f2d7
  由 Weilong Wu 提交于 1月 05, 2023
```
* [Eager] optimize same python api logic

* optimize full api

* optimize logic

* optimize logic
```
  5949f2d7
- H
  Add 0d Tensor Test Cases for cond, case, switch_case (#49544) · d5f1e300
  由 Huihuang Zheng 提交于 1月 05, 2023
```
Add 0d Tensor Test Cases for cond, case, switch_case. Since the 3 APIs are control flow APIs, their support for 0d tensor relies on the underneath APIs. This PR just added test cases to prove that the 3 APIs have already handled 0d tensor well.
```
  d5f1e300
- Y
  
  Add transpose_qkv_wb flags to the fused_attention_op. (#49494) · ec857b85
  由 Yuang Liu 提交于 1月 05, 2023
  
  ec857b85
- W
  
  polish batch_norm api (#49508) · 11f5848b
  由 Weilong Wu 提交于 1月 05, 2023
  
  11f5848b
- Z
  
  move fuild.dygraph.amp to paddle.amp (#49193) · da3e9d66
  由 zhangkaihuo 提交于 1月 05, 2023
  
  da3e9d66
04 1月, 2023 9 次提交
- H
  
  [XPU] fix clip op unit test. (#49535) · 2098c283
  由 houj04 提交于 1月 04, 2023
  
  2098c283
- G
  
  Add the input check for softmax_with_cross_entropy (#49333) · f17b2de8
  由 Guanghua Yu 提交于 1月 04, 2023
  
  f17b2de8
- W
  
  [Inference] Add conv_fusion nhwc impl. (#49047) · 4a8708bb
  由 Wilber 提交于 1月 04, 2023
  
  4a8708bb
- R
  
  support mp on xpu (#49531) · 7875accb
  由 Roc 提交于 1月 04, 2023
  
  7875accb
- J
  [Auto Parallel-Performance] Sharding Comm Optimization (#48604) · 5592f8ad
  由 JZ-LIANG 提交于 1月 04, 2023
```
* remove deps and prior comm

* grad comm fuse

* add deps for amp&global norm

* stage2 broadcast prior deps

* stage2 grad overlap

* stream_analyzer bugfix

* overlap enable

* dep op namescope

* depend support multiple inputs

* check finite deps

* stage2 param comm overlap

* Set kD2HStream

* grad comm hierarchical

* grad comm hierarchical

* new unitest
Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
```
  5592f8ad
- 张
  
  Fix some en docs (#49513) · 04aa80e6
  由张春乔提交于 1月 04, 2023
  
  04aa80e6
- M
  [CodeStyle][py36] update requirements.txt and setup.py.in (#49516) · 77e432ee
  由 Matsumoto Ruko 提交于 1月 04, 2023
```
* update requirements.txt and setup.py.in

* update requirements.txt setup.py.in setup.py
```
  77e432ee
- S
  Revert "Replace matmul with matmul_v2 during oneDNN fuse passes (#49108)" (#49524) · 338cbeaa
  由 Sławomir Siwek 提交于 1月 04, 2023
```
This reverts commit 2c444dfa.
```
  338cbeaa
- 张
  Add for-else (#49521) · 49f5a97b
  由张春乔提交于 1月 04, 2023
```
* add for-else

* add * for unpacking
```
  49f5a97b

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致