提交 · 690d7a6922d9c6246bfb57ccc37eb647ab6ef3c9 · PaddlePaddle / Paddle

13 1月, 2023 16 次提交
- Z
  [inference][trt]set output data type of trt network (#49712) · 690d7a69
  由 Zhang Jun 提交于 1月 13, 2023
```
* update trt engine to set in/out data type

* update

* Update engine.cc

* Update engine.cc

* update

* set engine output type before freeze the network

* update

* update trt autoscan ut

* update

* update ut

* fix equal bug, update ut

* fix cast and equal ut

* update cast ut using TRT < 8.4

* set datatype from scope

* check output var is nullptr

* Update op_converter.h

* update tensorrt_engine_op_test ut

* update
```
  690d7a69
- D
  [Custom Device] Clear ProcessGroup Manually (#49182) · a923a757
  由 duanyanhui 提交于 1月 13, 2023
```
* clear ProcessGroupCustom manually

* fix bug

* fix bug

* move destroy ProcessGroup to ProcessGroupIdMap

* enable destroy to all device

* remove unused comments

* change to internal api

* Update process_group.cc

* Update process_group.cc
```
  a923a757
- D
  [Custom Device] update get_device to custom and add custom_device api (#49721) · bd165b94
  由 duanyanhui 提交于 1月 13, 2023
```
* update get_device to custom

* add custom_device api

* rm is_compiled_with_custom_device from framework

* add todo comments
```
  bd165b94
- J
  【Prim】Support elementwise related VJP with primitives (#49784) · 561f9013
  由 Jiabin Yang 提交于 1月 13, 2023
```
* support elementwise base func

* fix compiling error and add test

* remove additional param

* support vjp for div using comp

* remove additional change

* fix dy2st error with magic num

* fix dy magic num

* another magic

* another magic

* add more test

* fix windows problem

* another magic

* fix windows compile

* invoke ci

* add skip rename strategy

* support add vjp

* fix test_tanh

* support add with new axis cal

* fix resnet and some test

* add composite log

* support sub vjp
```
  561f9013
- W
  
  update fluid api. (#49731) · dd827bbe
  由 wuhuachaocoding 提交于 1月 13, 2023
  
  dd827bbe
- W
  [Eager] polish some apis logic (#49733) · a1772bb8
  由 Weilong Wu 提交于 1月 13, 2023
```
* [Eager] polish some apis logic

* polish api logic
```
  a1772bb8
- W
  
  fix a bug of stage2 offload. (#49767) · 1c8531ce
  由 wuhuachaocoding 提交于 1月 13, 2023
  
  1c8531ce
- J
  kunlun add support for c_concat and c_split (#49757) · a09b9a3f
  由 jameszhang 提交于 1月 13, 2023
```
* kunlun add support for c_concat and c_split

* replace mutable_data() and ShareDataWith()
```
  a09b9a3f
- Y
  
  add xpu adagrad and where_grad kernels (#49701) · a99c3cd4
  由 ykkk2333 提交于 1月 13, 2023
  
  a99c3cd4
- J
  fix xpu unittest issue (#49760) · ddc8a726
  由 jameszhang 提交于 1月 13, 2023
```
* fix xpu unittest issue: zero_dim_tensor

* deal with leftout issue introduced by #49470
```
  ddc8a726
- L
  
  Add unitest for set_value, set_value_grad. test=kunlun (#49773) · 5e722245
  由 Leo Guo 提交于 1月 13, 2023
  
  5e722245
- [Zero-Dim] add static graph gradient test method for 0D Tensor input (#49755) · 5fd115f3
  由 zhouweiwei2014 提交于 1月 13, 2023
  
  5fd115f3
- W
  
  add prelu & prelu_grad op for xpu (#49672) · 8d512b8f
  由 wangshengxiang 提交于 1月 13, 2023
  
  8d512b8f
- W
  [PHI] rrelu add yaml (#49779) · 8447f876
  由 Weilong Wu 提交于 1月 13, 2023
```
* [PHI] rrelu add yaml

* polish

* polish
```
  8447f876
- W
  
  update reader in sharding unit test. (#49652) · 163c6a9e
  由 wuhuachaocoding 提交于 1月 13, 2023
  
  163c6a9e
- W
  [Eager] polish some api logic (#49717) · 609b50a8
  由 Weilong Wu 提交于 1月 13, 2023
```
* [Eager] polish some api logic

* fix split

* revover
```
  609b50a8
12 1月, 2023 9 次提交
- S
  lerp support 0 Tensor (#49667) · 8cd0d5b3
  由 sunli 提交于 1月 12, 2023
```
* lerp support 0 Tensor

* fix lerp grad

* fix lerp zero test

* fix 0D + ND/ND + 0D

* fix check

* update code

* fix lerp infer shape

* static backward test

* updata static graph test
```
  8cd0d5b3
- Z
  
  move fuild.contrib.mixed_precision to paddle.static.amp (#49412) · 69d01eb9
  由 zhangkaihuo 提交于 1月 12, 2023
  
  69d01eb9
- J
  Fix reduce func bug in process_group_bkcl (#49749) · 8e291bf7
  由 jameszhang 提交于 1月 12, 2023
```
* Fix reduce func bug in process_group_bkcl

Also catch up with a recent process_group PR that failed to add XPU branch.
Note that reduce is still accomplished by allreduce for xpu. Fix this should
xccl lib be updated.

* fix compile issue for non-XPU
```
  8e291bf7
- W
  more preln_gn patterns (#49728) · adcb0039
  由 wenbin 提交于 1月 12, 2023
```
* compile fix

* fix compile

* compile fix

* add more preln
```
  adcb0039
- F
  [Zero-Dim] support input 0D Tensor for fmax/fmin/complex api (#49730) · a015f815
  由 FlyingQianMM 提交于 1月 12, 2023
```
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss

* [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor

* [Zero-Dim] support input 0D Tensor for fmax,fmin,complex
```
  a015f815
- W
  
  [ci unnitest fix] dgc optimizer (#49741) · 81ec63a4
  由 wangzhen38 提交于 1月 12, 2023
  
  81ec63a4
- Y
  
  change test class's name (#49729) · 280677c5
  由 yuehuayingxueluo 提交于 1月 12, 2023
  
  280677c5
- Z
  [AutoParallel] recovery annotation (#49665) · 5c9c1a39
  由 zhaoyingli 提交于 1月 12, 2023
```
* recovery annotation

* bugfix
```
  5c9c1a39
- N
  remove Travis CI build status badge from README (#49747) · cc3b2009
  由 Nyakku Shigure 提交于 1月 12, 2023
```
* remove Travis CI build status badge from README.md

* empty commit, test=document_fix
```
  cc3b2009
11 1月, 2023 11 次提交

W

refactor: rm fluid deps in fleet (#49724) · 7d46d9f9
由 Wen Sun 提交于 1月 11, 2023

7d46d9f9

Add API for quantization-aware training in dygraph mode (#49398) · b53888e7

由 whs 提交于 1月 11, 2023

* Add tools for quantization-aware training
1. Expose an API named paddle.quantization.QAT
2. Define a wrapper class to insert quanters into model for QAT
3. Add some functions in QuantConfig for QAT
4. Add unittest for QAT

* Add QuantedConv2D and QuantedLinear for QAT

* Add paddle.nn.quant.qat to setup.py

b53888e7

W

refactor: rm fluid deps in distributed communication (#49722) · e0b50269
由 Wen Sun 提交于 1月 11, 2023

e0b50269
N

Update the style of print for low precision op list (#49648) · 395520f1
由 niuliling123 提交于 1月 11, 2023

395520f1
A
[D2SCinn]Fix self.infer_program always build cinn pass without cache (#49696) · 18a7e13f
由 Aurelius84 提交于 1月 11, 2023
```
* [D2SCinn]Fix self.infer_program always build cinn pass without cache

* fix infer op size
```
18a7e13f

add FusedLinear pass (#49606) · 0f08a432

由 yuehuayingxueluo 提交于 1月 11, 2023

* add FusedLinear pass

* add fused_op_list and renname PASSES to OP_FUSION

* add fused_passes_list to constants.py

* add test_passes.py

* fix test_fused_passes.py

* fix add if float(paddle.version.cuda()) >= 11.6:

* renamed test_fused_passes.py

* fix CMakeList.txt

0f08a432

W

[rm fluid] dgc_optimizer (#49714) · 1bdb7960
由 wangzhen38 提交于 1月 11, 2023

1bdb7960

[Dy2St] 移除 ProgramTranslator (#49628) · 2bb28f31

由 Ryan 提交于 1月 11, 2023

* add enable_to_static and drop some methods of ProgramTranslator

* fix code style

* fix cant import enable_to_static and update unitest

* change unitest and rollback code of PT

* fix can't import as of utils

* roll back PT

* fix roll back

* add some unitest

* add unitest and fix codestyle bug in api.py

* finish all unitest

* remove ProgramTranslator

* fix code style

* restore test_program_translator

* api.py remove get_func

* TestDygraphToStaticCode

* fix check_type and import err

* roll back PT without getcode

* roll back pt with get_code

* convert_to_static

* fix import __all__

2bb28f31

L

fix hsigmoid_loss (#49549) · 8f0adcb5
由 Linjie Chen 提交于 1月 11, 2023

8f0adcb5
L
Add input check for NLLLoss (#49547) · 08bf1b49
由 Linjie Chen 提交于 1月 11, 2023
```
* fix nll_loss

* fix nll_loss

* update

* update

* update

* fix
```
08bf1b49

姜

rm retain_grad_flag for tests part0 (#49655) · a504508c

由姜永久提交于 1月 11, 2023

* rm retain_grad_flag for tests

* modify transpose op

* retain grads for xpu tests

* lint

* modify xpu test

a504508c

10 1月, 2023 4 次提交
- W
  Use `CommContextManager` to init comm op using gloo backend (#49666) · 05df6973
  由 Wen Sun 提交于 1月 10, 2023
```
* refactor: gloo comm context migration

* fix: headers & avoid mutable_data usage

* fix: cmake gloo dep

* style: rename funcs

* refactor: move to new files

* fix: gloo deps

* refactor: simplify create device
```
  05df6973
- F
  [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss (#49616) · 98693428
  由 FlyingQianMM 提交于 1月 10, 2023
```
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss

* [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor
```
  98693428
- X
  
  reorganize the prim unittests struct (#49679) · 4370a91a
  由 Xiaoxu Chen 提交于 1月 10, 2023
  
  4370a91a
- Z
  
  update reduce ut (#49689) · cfe55f31
  由 Zhang Jun 提交于 1月 10, 2023
  
  cfe55f31

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功