提交 · 69d01eb9c308f9e8250f98d1eef4cf156a0bd476 · PaddlePaddle / Paddle

12 1月, 2023 8 次提交
- Z
  
  move fuild.contrib.mixed_precision to paddle.static.amp (#49412) · 69d01eb9
  由 zhangkaihuo 提交于 1月 12, 2023
  
  69d01eb9
- J
  Fix reduce func bug in process_group_bkcl (#49749) · 8e291bf7
  由 jameszhang 提交于 1月 12, 2023
```
* Fix reduce func bug in process_group_bkcl

Also catch up with a recent process_group PR that failed to add XPU branch.
Note that reduce is still accomplished by allreduce for xpu. Fix this should
xccl lib be updated.

* fix compile issue for non-XPU
```
  8e291bf7
- W
  more preln_gn patterns (#49728) · adcb0039
  由 wenbin 提交于 1月 12, 2023
```
* compile fix

* fix compile

* compile fix

* add more preln
```
  adcb0039
- F
  [Zero-Dim] support input 0D Tensor for fmax/fmin/complex api (#49730) · a015f815
  由 FlyingQianMM 提交于 1月 12, 2023
```
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss

* [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor

* [Zero-Dim] support input 0D Tensor for fmax,fmin,complex
```
  a015f815
- W
  
  [ci unnitest fix] dgc optimizer (#49741) · 81ec63a4
  由 wangzhen38 提交于 1月 12, 2023
  
  81ec63a4
- Y
  
  change test class's name (#49729) · 280677c5
  由 yuehuayingxueluo 提交于 1月 12, 2023
  
  280677c5
- Z
  [AutoParallel] recovery annotation (#49665) · 5c9c1a39
  由 zhaoyingli 提交于 1月 12, 2023
```
* recovery annotation

* bugfix
```
  5c9c1a39
- N
  remove Travis CI build status badge from README (#49747) · cc3b2009
  由 Nyakku Shigure 提交于 1月 12, 2023
```
* remove Travis CI build status badge from README.md

* empty commit, test=document_fix
```
  cc3b2009
11 1月, 2023 11 次提交

W

refactor: rm fluid deps in fleet (#49724) · 7d46d9f9
由 Wen Sun 提交于 1月 11, 2023

7d46d9f9

Add API for quantization-aware training in dygraph mode (#49398) · b53888e7

由 whs 提交于 1月 11, 2023

* Add tools for quantization-aware training
1. Expose an API named paddle.quantization.QAT
2. Define a wrapper class to insert quanters into model for QAT
3. Add some functions in QuantConfig for QAT
4. Add unittest for QAT

* Add QuantedConv2D and QuantedLinear for QAT

* Add paddle.nn.quant.qat to setup.py

b53888e7

W

refactor: rm fluid deps in distributed communication (#49722) · e0b50269
由 Wen Sun 提交于 1月 11, 2023

e0b50269
N

Update the style of print for low precision op list (#49648) · 395520f1
由 niuliling123 提交于 1月 11, 2023

395520f1
A
[D2SCinn]Fix self.infer_program always build cinn pass without cache (#49696) · 18a7e13f
由 Aurelius84 提交于 1月 11, 2023
```
* [D2SCinn]Fix self.infer_program always build cinn pass without cache

* fix infer op size
```
18a7e13f

add FusedLinear pass (#49606) · 0f08a432

由 yuehuayingxueluo 提交于 1月 11, 2023

* add FusedLinear pass

* add fused_op_list and renname PASSES to OP_FUSION

* add fused_passes_list to constants.py

* add test_passes.py

* fix test_fused_passes.py

* fix add if float(paddle.version.cuda()) >= 11.6:

* renamed test_fused_passes.py

* fix CMakeList.txt

0f08a432

W

[rm fluid] dgc_optimizer (#49714) · 1bdb7960
由 wangzhen38 提交于 1月 11, 2023

1bdb7960

[Dy2St] 移除 ProgramTranslator (#49628) · 2bb28f31

由 Ryan 提交于 1月 11, 2023

* add enable_to_static and drop some methods of ProgramTranslator

* fix code style

* fix cant import enable_to_static and update unitest

* change unitest and rollback code of PT

* fix can't import as of utils

* roll back PT

* fix roll back

* add some unitest

* add unitest and fix codestyle bug in api.py

* finish all unitest

* remove ProgramTranslator

* fix code style

* restore test_program_translator

* api.py remove get_func

* TestDygraphToStaticCode

* fix check_type and import err

* roll back PT without getcode

* roll back pt with get_code

* convert_to_static

* fix import __all__

2bb28f31

L

fix hsigmoid_loss (#49549) · 8f0adcb5
由 Linjie Chen 提交于 1月 11, 2023

8f0adcb5
L
Add input check for NLLLoss (#49547) · 08bf1b49
由 Linjie Chen 提交于 1月 11, 2023
```
* fix nll_loss

* fix nll_loss

* update

* update

* update

* fix
```
08bf1b49

姜

rm retain_grad_flag for tests part0 (#49655) · a504508c

由姜永久提交于 1月 11, 2023

* rm retain_grad_flag for tests

* modify transpose op

* retain grads for xpu tests

* lint

* modify xpu test

a504508c

10 1月, 2023 15 次提交
- W
  Use `CommContextManager` to init comm op using gloo backend (#49666) · 05df6973
  由 Wen Sun 提交于 1月 10, 2023
```
* refactor: gloo comm context migration

* fix: headers & avoid mutable_data usage

* fix: cmake gloo dep

* style: rename funcs

* refactor: move to new files

* fix: gloo deps

* refactor: simplify create device
```
  05df6973
- F
  [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss (#49616) · 98693428
  由 FlyingQianMM 提交于 1月 10, 2023
```
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss

* [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor
```
  98693428
- X
  
  reorganize the prim unittests struct (#49679) · 4370a91a
  由 Xiaoxu Chen 提交于 1月 10, 2023
  
  4370a91a
- Z
  
  update reduce ut (#49689) · cfe55f31
  由 Zhang Jun 提交于 1月 10, 2023
  
  cfe55f31
- W
  
  solve share params bugs and add exclude_layer attr for stage3. (#48695) · 79b261ba
  由 wuhuachaocoding 提交于 1月 10, 2023
  
  79b261ba
- 姜
  rm retain grads flag for tests part1 (#49660) · 3f2f036c
  由姜永久提交于 1月 10, 2023
```
* rm retain grads flag for tests

* modify fill_diagonal

* retain grads for fill_diagonal tests

* reset sum & concat

* fix fill_diagonal
```
  3f2f036c
- W
  [Eager] polish several ops (#49612) · 1712e212
  由 Weilong Wu 提交于 1月 10, 2023
```
* [Eager] polish several ops

* rm useless code
```
  1712e212
- G
  
  Fix the problem that the quantization model cannot find the weight (#49664) · a6bd6957
  由 Guanghua Yu 提交于 1月 10, 2023
  
  a6bd6957
- Y
  [Auto Parallel] Remove some deprecated fluid APIs (#49099) · c70fe47c
  由 Yulong Ao 提交于 1月 10, 2023
```
* [Auto Parallel] Remove some fluid APIs

* [Auto Parallel] Fix the wrong import

* [Auto Parallel] Remove unnecessary comments

* [Auto Parallel] Fix the importing bug
```
  c70fe47c
- H
  [Dy2St] Add ignore_module API (#49485) · daea892c
  由 hjyp 提交于 1月 10, 2023
```
* Add ignore_module API

* fix type of parameter

* Add test case of ignore-module
```
  daea892c
- W
  
  support cpu offload for stage3 (#49196) · 451756fb
  由 wuhuachaocoding 提交于 1月 10, 2023
  
  451756fb
- X
  
  [Zero_Dim][unittest] add repeat_interleave unittest for zero_dim (#49596) · 923f2458
  由 xysheng-baidu 提交于 1月 10, 2023
  
  923f2458
- W
  [Bug fix] Fix kaiming initializer div zero (#49656) · 35fa30d0
  由 wanghuancoder 提交于 1月 10, 2023
```
* fix kaiming initializer div zero
```
  35fa30d0
- Y
  
  [Fuse attention pass] Forward pattern. (#49621) · b0ece266
  由 Yuang Liu 提交于 1月 10, 2023
  
  b0ece266
- S
  
  Add reduce_min prod trt converter (#49615) · 13992de7
  由 Sanbu 提交于 1月 10, 2023
  
  13992de7
09 1月, 2023 6 次提交
- Z
  Add build strategy for infer program of dy2st (#49641) · 9415a6af
  由 zhangbo9674 提交于 1月 09, 2023
```
* add build strategy for infer program of dy2st

* refine code

* fix bug
```
  9415a6af
- W
  Preln groupnorm (#49463) · 591be3bd
  由 wenbin 提交于 1月 09, 2023
```
* skip_groupnorm

* init

* preln

* add ut

* more assert

* set timeout

* fix windows ci issue
```
  591be3bd
- H
  Rewrite batch norm act fuse pass tester (#49277) · aaa25222
  由 Hulek 提交于 1月 09, 2023
```
* Rewritten

* change mkldnn to onednn

* fix cmake name
```
  aaa25222
- R
  [Dy2St] transforms.RandomErasing Support static mode (#49617) · e9df6fcd
  由 Ryan 提交于 1月 09, 2023
```
* static.nn.cond ten

* add unitest

* update code style
```
  e9df6fcd
- Q
  
  add fill/fill_any for kunlun (#49645) · 31ea3231
  由 QingshuChen 提交于 1月 09, 2023
  
  31ea3231
- Y
  [XPU] add einsum fill diagonal and diagonal kernels (#49465) · a5bf156b
  由 ykkk2333 提交于 1月 09, 2023
```
* migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun

* fix dlrm throughput problem, test=kunlun

* add xpu einsum, fill_diagonal, and diagonal kernels, test=kunlun
```
  a5bf156b

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功