提交 · 8cd0d5b3005c5f98218f358d17f96c606dc4dac5 · PaddlePaddle / Paddle

12 1月, 2023 18 次提交
- S
  lerp support 0 Tensor (#49667) · 8cd0d5b3
  由 sunli 提交于 1月 12, 2023
```
* lerp support 0 Tensor

* fix lerp grad

* fix lerp zero test

* fix 0D + ND/ND + 0D

* fix check

* update code

* fix lerp infer shape

* static backward test

* updata static graph test
```
  8cd0d5b3
- W
  Migrate collective communication checks to PHI (#49754) · c24e7fe1
  由 Wen Sun 提交于 1月 12, 2023
```
* refactor: migrate comm checks

* refactor: add check in comm context

* feat: add gloo static check

* refactor: add place param in static check
```
  c24e7fe1
- Z
  
  move fuild.contrib.mixed_precision to paddle.static.amp (#49412) · 69d01eb9
  由 zhangkaihuo 提交于 1月 12, 2023
  
  69d01eb9
- J
  Fix reduce func bug in process_group_bkcl (#49749) · 8e291bf7
  由 jameszhang 提交于 1月 12, 2023
```
* Fix reduce func bug in process_group_bkcl

Also catch up with a recent process_group PR that failed to add XPU branch.
Note that reduce is still accomplished by allreduce for xpu. Fix this should
xccl lib be updated.

* fix compile issue for non-XPU
```
  8e291bf7
- Y
  
  deal with conflict (#49766) · 27aec62b
  由 YuanRisheng 提交于 1月 12, 2023
  
  27aec62b
- T
  add ninja-build in docker (#48490) · 1a9d6be9
  由 tianshuo78520a 提交于 1月 12, 2023
```
* test=ninja;test=document_fix

* test=ninja;test=document_fix

* test=ninja;test=document_fix

* add ninja

* update dockerfile

* update dockerfile

* update dockerfile

* update dockerfile

* update dockerfile

* test=cuda117

* update ce dockerfile

* update ce dockerfile
```
  1a9d6be9
- G
  
  conv2d_fusion(cudnn or cutlass) (#49707) · c95c35a2
  由 gem5 提交于 1月 12, 2023
  
  c95c35a2
- X
  
  fix_split (#49743) · 3fb4a08c
  由 xiaoxiaohehe001 提交于 1月 12, 2023
  
  3fb4a08c
- W
  more preln_gn patterns (#49728) · adcb0039
  由 wenbin 提交于 1月 12, 2023
```
* compile fix

* fix compile

* compile fix

* add more preln
```
  adcb0039
- F
  [Zero-Dim] support input 0D Tensor for fmax/fmin/complex api (#49730) · a015f815
  由 FlyingQianMM 提交于 1月 12, 2023
```
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss

* [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor

* [Zero-Dim] support input 0D Tensor for fmax,fmin,complex
```
  a015f815
- W
  
  [ci unnitest fix] dgc optimizer (#49741) · 81ec63a4
  由 wangzhen38 提交于 1月 12, 2023
  
  81ec63a4
- L
  Fix the bugs of set_value and set_value_grad ops and add register in (#49750) · 438975fd
  由 Leo Guo 提交于 1月 12, 2023
```
xpu2_op_list.cc. test=kunlun
```
  438975fd
- Y
  
  add hostdevice.h to inference lib (#49746) · 8fabf417
  由 Yuanle Liu 提交于 1月 12, 2023
  
  8fabf417
- J
  
  [CINN] temp fix batch_norm check as inplace op bug (#49738) · ce57365d
  由 jiangcheng 提交于 1月 12, 2023
  
  ce57365d
- Y
  [PHI]Rename some PHI Kernel (#49470) · 30f5e39b
  由 YuanRisheng 提交于 1月 12, 2023
```
* rename kernel

* delete sig

* modify code according comment

* fix ci bugs
```
  30f5e39b
- Y
  
  change test class's name (#49729) · 280677c5
  由 yuehuayingxueluo 提交于 1月 12, 2023
  
  280677c5
- Z
  [AutoParallel] recovery annotation (#49665) · 5c9c1a39
  由 zhaoyingli 提交于 1月 12, 2023
```
* recovery annotation

* bugfix
```
  5c9c1a39
- N
  remove Travis CI build status badge from README (#49747) · cc3b2009
  由 Nyakku Shigure 提交于 1月 12, 2023
```
* remove Travis CI build status badge from README.md

* empty commit, test=document_fix
```
  cc3b2009
11 1月, 2023 18 次提交

W

refactor: rm fluid deps in fleet (#49724) · 7d46d9f9
由 Wen Sun 提交于 1月 11, 2023

7d46d9f9

Add API for quantization-aware training in dygraph mode (#49398) · b53888e7

由 whs 提交于 1月 11, 2023

* Add tools for quantization-aware training
1. Expose an API named paddle.quantization.QAT
2. Define a wrapper class to insert quanters into model for QAT
3. Add some functions in QuantConfig for QAT
4. Add unittest for QAT

* Add QuantedConv2D and QuantedLinear for QAT

* Add paddle.nn.quant.qat to setup.py

b53888e7

W

refactor: rm fluid deps in distributed communication (#49722) · e0b50269
由 Wen Sun 提交于 1月 11, 2023

e0b50269

Implement a common segmented array. (#49450) · b1faa562

由 Yiqun Liu 提交于 1月 11, 2023

* Implement a common PointerArray.

* Polish codes.

* Add including of header file.

* Add the branch of kFix8.

* Fix compiling error.

* Add alignas hint to fix the performance drop.

* Optimize the H2D copy in stack_grad.

* Rename the macro.

* Fix align hint for different compilers.

* Polish the define of PADDLE_ALIGN.

* Fix compiling error.

* Remove the align hint on windows.

b1faa562

Z
fix paddle_infer_contrib inclue (#49720) · 24f5c46e
由 zhangxin81 提交于 1月 11, 2023
```
* fix paddle_infer_contrib include
```
24f5c46e
N

Update the style of print for low precision op list (#49648) · 395520f1
由 niuliling123 提交于 1月 11, 2023

395520f1
A
[D2SCinn]Fix self.infer_program always build cinn pass without cache (#49696) · 18a7e13f
由 Aurelius84 提交于 1月 11, 2023
```
* [D2SCinn]Fix self.infer_program always build cinn pass without cache

* fix infer op size
```
18a7e13f
K

fix: standaloneExecutor return empty array while setting FLAGS_use_cinn=true (#49698) · c79befa0
由 kangguangli 提交于 1月 11, 2023

c79befa0

add FusedLinear pass (#49606) · 0f08a432

由 yuehuayingxueluo 提交于 1月 11, 2023

* add FusedLinear pass

* add fused_op_list and renname PASSES to OP_FUSION

* add fused_passes_list to constants.py

* add test_passes.py

* fix test_fused_passes.py

* fix add if float(paddle.version.cuda()) >= 11.6:

* renamed test_fused_passes.py

* fix CMakeList.txt

0f08a432

D

fix save_combine_op (#49695) · 7a4f09f1
由 duanyanhui 提交于 1月 11, 2023

7a4f09f1
W

[rm fluid] dgc_optimizer (#49714) · 1bdb7960
由 wangzhen38 提交于 1月 11, 2023

1bdb7960
W
Compile fix (#49690) · 2fe896df
由 wenbin 提交于 1月 11, 2023
```
* compile fix

* fix compile

* compile fix
```
2fe896df
W

fix qk bias for multihead (#49702) · 6578da51
由 Wangzheee 提交于 1月 11, 2023

6578da51

[Dy2St] 移除 ProgramTranslator (#49628) · 2bb28f31

由 Ryan 提交于 1月 11, 2023

* add enable_to_static and drop some methods of ProgramTranslator

* fix code style

* fix cant import enable_to_static and update unitest

* change unitest and rollback code of PT

* fix can't import as of utils

* roll back PT

* fix roll back

* add some unitest

* add unitest and fix codestyle bug in api.py

* finish all unitest

* remove ProgramTranslator

* fix code style

* restore test_program_translator

* api.py remove get_func

* TestDygraphToStaticCode

* fix check_type and import err

* roll back PT without getcode

* roll back pt with get_code

* convert_to_static

* fix import __all__

2bb28f31

L

fix hsigmoid_loss (#49549) · 8f0adcb5
由 Linjie Chen 提交于 1月 11, 2023

8f0adcb5
L
Add input check for NLLLoss (#49547) · 08bf1b49
由 Linjie Chen 提交于 1月 11, 2023
```
* fix nll_loss

* fix nll_loss

* update

* update

* update

* fix
```
08bf1b49
H

[XPU] update xpu.cmake to 20230110 (#49681) · 203f9594
由 houj04 提交于 1月 11, 2023

203f9594

姜

rm retain_grad_flag for tests part0 (#49655) · a504508c

由姜永久提交于 1月 11, 2023

* rm retain_grad_flag for tests

* modify transpose op

* retain grads for xpu tests

* lint

* modify xpu test

a504508c

10 1月, 2023 4 次提交

Optimization for StackGradCUDAKernel for last dimension stack case. (#48992) · 0cae5c7f

由 limingshu 提交于 1月 10, 2023

* add stack grad kernel optimization

* add basic optimization kernel for stack_grad_kernel

* optimization of stack_grad_kernel for last dim stack and change code format with pre-commit

0cae5c7f

Use `CommContextManager` to init comm op using gloo backend (#49666) · 05df6973

由 Wen Sun 提交于 1月 10, 2023

* refactor: gloo comm context migration

* fix: headers & avoid mutable_data usage

* fix: cmake gloo dep

* style: rename funcs

* refactor: move to new files

* fix: gloo deps

* refactor: simplify create device

05df6973

[Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss (#49616) · 98693428

由 FlyingQianMM 提交于 1月 10, 2023

* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss

* [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor

98693428

R

fix coverage_ci_bug (#49234) · 72b2e486
由 risemeup1 提交于 1月 10, 2023

72b2e486

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功