提交 · f4c42389f4a12b7c37ab7ed86d3907d81f2be156 · 机器未来 / Paddle

22 6月, 2022 5 次提交

Z
fix the bug that _DataLoaderIterMultiProcess use time to generate the seed (#43318) (#43702) · f4c42389
由 Zhang Ting 提交于 6月 22, 2022
```
 fix the bug that _DataLoaderIterMultiProcess use time to generate the seed

cherry-pick #43318
```
f4c42389

[cherry pick] Support optional residual add in fused ops and slice large... · 0660d5f2

由 Zhang Ting 提交于 6月 22, 2022

[cherry pick] Support optional residual add in fused ops and slice large tensor for cudnn_softmax (#43719)

 [cherry pick] Support optional residual add in fused ops and slice large tensor for cudnn_softmax

cherry-pick #43635 #43681 #43474

0660d5f2

L
[Cherrypick 2.3] fix decode jpeg example code (#42752) · a4c898cf
由 LielinJiang 提交于 6月 22, 2022
```
* fix decode_jpeg example code

* fix decode_jpeg example code
```
a4c898cf

set_state_dict not use state_dict hook (#43407) (#43711) · 0fb66355

由 zhangbo9674 提交于 6月 22, 2022

在 amp-o2功能开发过程中，为了支持指定网络存储数据类型的功能，添加state_dict hook功能，但是在Layer的set_state_dict是通过state_dict获取网络参数并加载的，hook接口的存在导致 set_state_dict无法加载到原本网络参数。
本pr通过增加hook控制开关，在set_state_dict中禁用hook解决该问题。

详见pr43407

0fb66355

[FIx bug]layer to 'NoneType' object has no attribute 'place' (#43597) (#43717) · 0b879318

由 zhangbo9674 提交于 6月 22, 2022

bug：
当class Layer的_buffers中有参数为None的时候，调用to()方法将会报layer to 'NoneType' object has no attribute 'place'的错误。
修复方法：
to()方法增加对_buffers中None类型参数的判断，如果为None，跳过该参数的处理。

0b879318

21 6月, 2022 2 次提交
- J
  [Cherry-pick ] to Release/2.3, Add prefetch_factor in dataloader (#43674) · af415bc2
  由 Jackwaterveg 提交于 6月 21, 2022
```
* fix usage of prefetch_factor

* add assert

* add docstring and change prefetch_factor when num_workers=0

* fix doc
```
  af415bc2
- G
  [cherry pick #43088 #40664] Add float16 to fake quantize/dequantize OP (#43689) · 9783e887
  由 Guanghua Yu 提交于 6月 21, 2022
```
* cherry pick #43088 #40664

* fix clang format
```
  9783e887
20 6月, 2022 5 次提交
- [cherry-pick]to Release/2.3,modify scale op xpu unittest (#43657) · 6262efb5
  由 z8hanghuan 提交于 6月 20, 2022
```
* modify xpu.cmake,*test=kunlun (#41832)

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* support bilstm,*test=kunlun

* [cherry-pick]support multi_layer of bilstm,*test=kunlun

* [cherry-pick]refactor sum unit test,*test=kunlun (#43561)
```
  6262efb5
- X
  [Cherry pick] Einsum memory optimization PR #43397 (#43554) · 638b69dc
  由 xiongkun 提交于 6月 20, 2022
```
* cherry pick from #43397

* fix code
```
  638b69dc
- S
  
  fix unittest (#43609) (#43617) · 68d5c12b
  由 Shang Zhizhou 提交于 6月 20, 2022
  
  68d5c12b
- Z
  
  place all save/load path into temporary directory (#43652) · a5ccc713
  由 zhaoyingli 提交于 6月 20, 2022
  
  a5ccc713
- Z
  [Cherry-Pick] place all save/load path into temporary directory (#43316) (#43651) · 0f16ccf5
  由 zhaoyingli 提交于 6月 20, 2022
```
* place all save/load path into temporary directory

* rm no need unittest
```
  0f16ccf5
18 6月, 2022 1 次提交
- G
  Cherry pick 42508 (#43601) · bfe21ff3
  由 gongweibao 提交于 6月 18, 2022
```
* fix test

* fix test.
```
  bfe21ff3
17 6月, 2022 3 次提交

Y

cherry pick 43581 (#43596) · 2eb60ddb
由 YuanRisheng 提交于 6月 17, 2022

2eb60ddb
H
[Dygraph] Fix barrier bugs of ProcessGroup in Eager Mode (#43589) · 3689a126
由 Haohongxiang 提交于 6月 17, 2022
```
* fix pg bugs

* update
```
3689a126

[cherry-pick 2.3] Cherry parallel fused transformer api (#43505) · 19b87aec

由 WangXi 提交于 6月 17, 2022

* Rename dropout is test (#43098)

* replace dropout_is_test with is_test.
* improve atol on a100.

* fused_attention fused_feedforward api support Model Tensor Parallel (#42985)

* fix is_test bug in fused_feedforward. (#43508)
Co-authored-by: NLi Min <11663212+limin2021@users.noreply.github.com>

19b87aec

16 6月, 2022 5 次提交

[cherry pick] Unit test with tempfile to place the temporary files (#43522) · 1a660c8a

由 zhangbopd 提交于 6月 16, 2022

Use tempfile for unit test & custom op test to replace temporary files to ensure that all temporary files will be deleted normally after a single measurement, avoiding the usage of disk files.
The PR only involves single-test and op test modifications and does not affect existing functionality.
Release/2.3 branch modified in PR43521;

1a660c8a

Q
[Cherry-pick] Fix ut tempfile v23 (#43387) · 24843fcb
由 Qi Li 提交于 6月 16, 2022
```
* fix unit test temp file, test=develop (#43155)

* add cleanup code, test=develop (#43305)
```
24843fcb

[Cherry-pick] Fix numpy 1.20+ deprecation warnings (#43513) · 689e0999

由 Qi Li 提交于 6月 16, 2022

* Fix numpy 1.20+ deprecation warnings (#42929)

* Replace np.bool/np.bool8 with np.bool_

* Replace np.object with np.object_

* Replace np.complex with np.complex128

* Replace np.float with np.float64

* Replace np.int with np.int_

* Rerun pre-commit for newer pre-commit configuration

* Use builtin bool instead of np.bool_ based on the context

* fix mode dtype
Co-authored-by: Nzlsh80826 <rewang@nvidia.com>

689e0999

Z

cherry-pick adamw unittest (#43498) · 0cdde0b4
由 zhaoyingli 提交于 6月 16, 2022

0cdde0b4
G
[cherry-pick]Add progress bar and speed up Quantization Pass (#43454) · abb0b2d6
由 Guanghua Yu 提交于 6月 16, 2022
```
* Add progress bar and speed up Quantization Pass

* fix typo
```
abb0b2d6

14 6月, 2022 2 次提交

[ CherryPick ] Cherry pick for einsum optimization. (#43468) · 22e75d92

由 xiongkun 提交于 6月 14, 2022

* [EinsumOp] Polish forward logic and backward logic for optimize (#42603)

* change logic for optimize

* modifty

* merge

* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 (#43010)

* [EinsumOp] Make EinsumOp support bfloat16. (#43085)

* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0

* make EInsumOP support bf16

* add unittest for BF16

* add condition for test_BF16

* fix bugs

* fix

* change the backward api to fit einsum op

22e75d92

Use tempfile to place all the temporary files. (#43392) · afd0c1db

由 freeliuzc 提交于 6月 14, 2022

使用 tempfile 替换临时文件，保证在单测结束后，所有临时文件都会被正常的删除，避免占用磁盘文件。
此 PR 仅涉及单测修改，不影响现有功能。
develop 分支修改在 PR 43376

afd0c1db

09 6月, 2022 2 次提交
- G
  cherry pick #42255 (fuse conv + bn in QAT) and #42378 (support skip_op_list in PTQ) (#43301) · 0a00fc4e
  由 Guanghua Yu 提交于 6月 09, 2022
```
* support fuse conv and bn in QAT (#42255)

* support skip_op_list in PostTrainingQuantization (#42378)

* fix unittest
```
  0a00fc4e
- G
  
  Modify quantization use tempfile to place the temporary files (#43281) · f4e09397
  由 Guanghua Yu 提交于 6月 09, 2022
  
  f4e09397
07 6月, 2022 2 次提交
- Z
  
  fix the problem of slice infer shape (#42568) (#43246) · f1b4e4d5
  由 zyfncg 提交于 6月 07, 2022
  
  f1b4e4d5
- X
  
  fix memory leakage (#43141) (#43220) · e09803c5
  由 xiongkun 提交于 6月 07, 2022
  
  e09803c5
30 5月, 2022 1 次提交
- W
  [Dy2St]Fix cond_block_grad error when handle no need grad vras (#43034) (#43084) · e6e85b35
  由 WangZhen 提交于 5月 30, 2022
```
* Fix cond_block_grad error when handle no need grad vras

* Add comment and UT
```
  e6e85b35
26 5月, 2022 1 次提交
- S
  make some test run with old executor in specified windows server (#42777) (#42981) · 7a223585
  由 Sing_chan 提交于 5月 26, 2022
```
cherry-pick PR #42777
```
  7a223585
23 5月, 2022 1 次提交

Update metrics.py · d5b6eec2

由 onecatcn 提交于 5月 19, 2022

the doc was editted based on the discussion in the issue:
INT32 Failed on paddle.metric.accuracy: https://github.com/PaddlePaddle/Paddle/issues/42845

d5b6eec2

19 5月, 2022 1 次提交
- A
  [Dy2Stat]Modify all jit.save path into tempfile under dygraph_to_static directory (#42842) (#42860) · 84840481
  由 Aurelius84 提交于 5月 19, 2022
```
* [Dy2Stat]Modify all jit.save path into tempfile

* [Dy2Stat]Modify all jit.save path into tempfile
```
  84840481
17 5月, 2022 1 次提交
- C
  put_record_event_in_python_on_timeline_python (#42555) (#42790) · a40e60f7
  由 chenjian 提交于 5月 17, 2022
```
* put_record_event_in_python_on_timeline_python

* fix
```
  a40e60f7
16 5月, 2022 1 次提交
- W
  fix sample code error of paddle.lerp, test=document_fix (#42753) · 07029e0c
  由 wuhuanzhou 提交于 5月 16, 2022
```
修复paddle.lerp中示例代码错误。
```
  07029e0c
10 5月, 2022 1 次提交

[cherry-pick][MLU] support add callback to stream and profiler (#42115) · 25124d7f

由 fwenguang 提交于 5月 10, 2022

* [MLU] add mlu new profiler (#41138)

* [MLU] add mlu new profiler

* fix format

* [MLU] support add callback to stream (#41831)

* [MLU] add gather mlu kernel (#41969)

* [MLU] add mlu activation kernels (#41751)

25124d7f

09 5月, 2022 1 次提交

[Cherry-pick][IPU] merge recent changes (#42078) (#42582) · 1f9b60df

由 Allen Guo 提交于 5月 09, 2022

    add class NameScopeHelper for adding namescope info
    添加更多 种类优化器状态的映射
    为 IpuStrategy 添加 compilation_progress_logger option 用于输出 编译进度
    部分代码清理和杂项优化

1f9b60df

07 5月, 2022 2 次提交
- W
  
  remove the test case for the matmul_v2_mkldnn (#42530) · 54ef3d56
  由 wawltor 提交于 5月 07, 2022
  
  54ef3d56
- R
  [cherry-pick] Fix UT timeout problem for cuda_managed_memory_test and test_tensordot (#42492) · c9d156b1
  由 Ruibiao Chen 提交于 5月 07, 2022
```
* Reduce time variation for cuda_managed_memory_test (#42458)

* Disable standalone executor for test_tensordot (#42476)
```
  c9d156b1
06 5月, 2022 1 次提交
- L
  [cherry-pick] fix wrong place in ut (#42488) · 35ed11f3
  由 Leo Chen 提交于 5月 06, 2022
```
* fix wrong place

* skip bf16 test if not supported (#42503)
```
  35ed11f3
05 5月, 2022 2 次提交
- W
  
  fix unittest of conv2d due to V100 do not support bfloat16 (#42496) · 71d3b06c
  由 wangxinxin08 提交于 5月 05, 2022
  
  71d3b06c
- W
  
  fix the v100 cuda11.2 matmul_v2 and elementwise_div bug (#42479) · e052fde7
  由 wawltor 提交于 5月 05, 2022
  
  e052fde7

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致