提交 · afd0c1db8a7bf0d173a5d4d7d4bacfd7b85afe9a · BaiXuePrincess / Paddle

14 6月, 2022 1 次提交

Use tempfile to place all the temporary files. (#43392) · afd0c1db

由 freeliuzc 提交于 6月 14, 2022

使用 tempfile 替换临时文件，保证在单测结束后，所有临时文件都会被正常的删除，避免占用磁盘文件。
此 PR 仅涉及单测修改，不影响现有功能。
develop 分支修改在 PR 43376

afd0c1db

09 6月, 2022 1 次提交
- G
  
  Modify quantization use tempfile to place the temporary files (#43281) · f4e09397
  由 Guanghua Yu 提交于 6月 09, 2022
  
  f4e09397
30 5月, 2022 1 次提交
- W
  [Dy2St]Fix cond_block_grad error when handle no need grad vras (#43034) (#43084) · e6e85b35
  由 WangZhen 提交于 5月 30, 2022
```
* Fix cond_block_grad error when handle no need grad vras

* Add comment and UT
```
  e6e85b35
26 5月, 2022 1 次提交
- S
  make some test run with old executor in specified windows server (#42777) (#42981) · 7a223585
  由 Sing_chan 提交于 5月 26, 2022
```
cherry-pick PR #42777
```
  7a223585
19 5月, 2022 1 次提交
- A
  [Dy2Stat]Modify all jit.save path into tempfile under dygraph_to_static directory (#42842) (#42860) · 84840481
  由 Aurelius84 提交于 5月 19, 2022
```
* [Dy2Stat]Modify all jit.save path into tempfile

* [Dy2Stat]Modify all jit.save path into tempfile
```
  84840481
10 5月, 2022 1 次提交

[cherry-pick][MLU] support add callback to stream and profiler (#42115) · 25124d7f

由 fwenguang 提交于 5月 10, 2022

* [MLU] add mlu new profiler (#41138)

* [MLU] add mlu new profiler

* fix format

* [MLU] support add callback to stream (#41831)

* [MLU] add gather mlu kernel (#41969)

* [MLU] add mlu activation kernels (#41751)

25124d7f

09 5月, 2022 1 次提交

[Cherry-pick][IPU] merge recent changes (#42078) (#42582) · 1f9b60df

由 Allen Guo 提交于 5月 09, 2022

    add class NameScopeHelper for adding namescope info
    添加更多 种类优化器状态的映射
    为 IpuStrategy 添加 compilation_progress_logger option 用于输出 编译进度
    部分代码清理和杂项优化

1f9b60df

07 5月, 2022 2 次提交
- W
  
  remove the test case for the matmul_v2_mkldnn (#42530) · 54ef3d56
  由 wawltor 提交于 5月 07, 2022
  
  54ef3d56
- R
  [cherry-pick] Fix UT timeout problem for cuda_managed_memory_test and test_tensordot (#42492) · c9d156b1
  由 Ruibiao Chen 提交于 5月 07, 2022
```
* Reduce time variation for cuda_managed_memory_test (#42458)

* Disable standalone executor for test_tensordot (#42476)
```
  c9d156b1
06 5月, 2022 1 次提交
- L
  [cherry-pick] fix wrong place in ut (#42488) · 35ed11f3
  由 Leo Chen 提交于 5月 06, 2022
```
* fix wrong place

* skip bf16 test if not supported (#42503)
```
  35ed11f3
05 5月, 2022 2 次提交
- W
  
  fix unittest of conv2d due to V100 do not support bfloat16 (#42496) · 71d3b06c
  由 wangxinxin08 提交于 5月 05, 2022
  
  71d3b06c
- W
  
  fix the v100 cuda11.2 matmul_v2 and elementwise_div bug (#42479) · e052fde7
  由 wawltor 提交于 5月 05, 2022
  
  e052fde7
03 5月, 2022 1 次提交
- H
  Hotfix Release 2.3 Bug for CUDA 11.2 (#42438) · 713d5a4b
  由 Huihuang Zheng 提交于 5月 03, 2022
```
* Fix Release 2.3 Bug

* Fix format
```
  713d5a4b
30 4月, 2022 4 次提交
- A
  [Dy2Stat]Fix losting pre/post hook from outermost layer while jit.save (#42273) (#42388) · 16ef2b2e
  由 Aurelius84 提交于 4月 30, 2022
```
* [Dy2Stat]Fix losting pre/post hook from outermost layer while jit.save

* fix kwargs

* fix unittest
```
  16ef2b2e
- W
  
  [Eager] Support test_diff_op switch to eager mode (#42360) (#42392) · 1e3d2e4a
  由 Weilong Wu 提交于 4月 30, 2022
  
  1e3d2e4a
- X
  Make einsum_v2 support multi-operands (#42327) (#42397) · 34352fcd
  由 xiongkun 提交于 4月 30, 2022
```
* Extend python einsum interface to make einsum_v2 support multi-operands and switch it to default.

* add opt_einsum dependence

* add yaml and support eager model

* fix by code review
```
  34352fcd
- R2.3/fix pad3d infer shape (#42414) · 2dce1e88
  由 littletomatodonkey 提交于 4月 30, 2022
```
* fix pad3d infer shape

* fix pad3d

* fix pad default value

* fix order

* add unit test

* fix unittest for ci coverage

* add ndhwc check
```
  2dce1e88
29 4月, 2022 2 次提交

[cherry-pick 2.3] Add fused_multi_transformer op to optimize transformer... · 50bfe420

由 WangXi 提交于 4月 29, 2022

[cherry-pick 2.3] Add fused_multi_transformer op to optimize transformer generation performance (#42311)

* Add fused_multi_transformer op to optimize transformer generation performance (#41814)

* fix fused_multi_transformer compile failed in cuda arch < sm53 (#42315)

* fix ci timeout

50bfe420

Z
[cherry-pick] Fix bug of building InferMetaContext (#42211) (#42399) · 765fbb59
由 zyfncg 提交于 4月 29, 2022
```
* fix bug of building InferMetaContext (#42211)

* add unitest
```
765fbb59

28 4月, 2022 2 次提交

Add C++ EinsumOp which support 2 operands einsum. (#42105) (#42357) · d04a68d3

由 xiongkun 提交于 4月 28, 2022

* full api fix

* when out is None, go old dygraph mode

* by static check

* first version: support 2-inputs forwards. TODO: 1. backward  2. BroadCast  3. MultiVariable

* time out -> 120

d04a68d3

[cherry-pick] implement autotune python API(42299) (#42301) · b37e626e

由 Zhang Ting 提交于 4月 28, 2022

* implement autotune python API

* fix doc

* fix windows error

* fix doc and enable auto-tuning when config is None

* fix windows error

* fix doc

b37e626e

27 4月, 2022 3 次提交
- P
  
  fix format · 880c2a94
  由 pangyoki 提交于 4月 26, 2022
  
  880c2a94
- P
  
  solve conflict · eade1fd9
  由 pangyoki 提交于 4月 25, 2022
  
  eade1fd9
- W
  [Eager] Remove retain_grad_flag in accumulation_nade, add is_new_grad args in... · 56b93800
  由 Weilong Wu 提交于 4月 27, 2022
```
[Eager] Remove retain_grad_flag in accumulation_nade, add is_new_grad args in operator (#42240) (#42290)
```
  56b93800
26 4月, 2022 3 次提交
- W
  
  [Eager] Support numpy.ndarry in CastNumpy2Scalar (#42136) (#42213) · 983fcb56
  由 Weilong Wu 提交于 4月 26, 2022
  
  983fcb56
- fix python3.10 compile bug on windows (#42140) (#42180) · 42297995
  由 zhouweiwei2014 提交于 4月 26, 2022
```
cherry-pick #42140
```
  42297995
- W
  [Eager] Support div(scalar) in eager mode (#42148) (#42214) · a887ffd0
  由 Weilong Wu 提交于 4月 26, 2022
```
* [Eager] Support div scalar in eager mode

* Updated and remove debug logs

* Remove list, use 'or' directly

* Remove useless statement
```
  a887ffd0
25 4月, 2022 1 次提交
- W
  
  [Eager] Remove redundancy code, fix fp16 case (#42169) (#42215) · e4da34fd
  由 Weilong Wu 提交于 4月 25, 2022
  
  e4da34fd
24 4月, 2022 3 次提交
- Z
  
  refine optest logic for bfloat16 (#42151) (#42165) · 5211282d
  由 zhangbo9674 提交于 4月 24, 2022
  
  5211282d
- C
  [cherry-pick]Reduce performance influence by record event in python (#42142) · 338fcc10
  由 chenjian 提交于 4月 24, 2022
```
* fix kenrel name apperance (#42071)

* Reduce performance influence by record event in python (#42040)

* optimize performance

* fix

* improve coverage

* fix

* fix
```
  338fcc10
- W
  [Cherry-pick, Eager] Fix CastPyArg2scalar for max value of int64 (#42098) (#42129) · b543998f
  由 Weilong Wu 提交于 4月 24, 2022
```
* [Eager] Fix CastPyArg2scalar for max value of int64 (#42098)

* [Eager] Fix CastPyArg2Scalar in Long case

* Add more test cases for paddle.clip

* Use PyLong_AsLongLong

* Fix merge conflicts
```
  b543998f
22 4月, 2022 4 次提交
- P
  Cherry pick PR41990, add _grad_name and _grad_value for eager tensor (#41990) (#42079) · 3475c2bf
  由 pangyoki 提交于 4月 22, 2022
```
* add _grad_name and _grad_value for eager tensor

* fix paddle_enforce

* fix paddle_enforce 2

* fix grad_name

* _grad_value return lodtensor rather than tensor

* fix
```
  3475c2bf
- J
  
  Add UT (#42055) · 4f6aba87
  由 Jacek Czaja 提交于 4月 22, 2022
  
  4f6aba87
- B
  [Cherry-pick] sharding for eager tensor (#42054) · 6ad0f061
  由 Baibaifan 提交于 4月 22, 2022
```
* sharding_for_eager_tensor (#41415)

* fix_sharding_copy_right (#41849)
```
  6ad0f061
- A
  [IPU] add mixed-precission support for ipu (#41733) (#41906) · c09b1d68
  由 Allen Guo 提交于 4月 22, 2022
```
add mixed-precission support for ipu

cherry-pick from #41733
```
  c09b1d68
21 4月, 2022 5 次提交
- W
  
  [Eager] Support numpy.narray as input for eager expand (#42043) (#42064) · ef0b5fdc
  由 Weilong Wu 提交于 4月 21, 2022
  
  ef0b5fdc
- W
  
  [Eager] remove useless logic (#42020) (#42061) · 218e759b
  由 Weilong Wu 提交于 4月 21, 2022
  
  218e759b
- Z
  
  support bce_loss and bce_loss_grad in XPU, test=kunlun (#41610) · b1ba98ca
  由 zhangyikun02 提交于 4月 13, 2022
  
  b1ba98ca
- [cherry-pick]support multi_layer of bilstm,*test=kunlun (#42076) · 58f6d459
  由 z8hanghuan 提交于 4月 21, 2022
```
* modify xpu.cmake,*test=kunlun (#41832)

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* support bilstm,*test=kunlun

* [cherry-pick]support multi_layer of bilstm,*test=kunlun
```
  58f6d459
- L
  [Cherry-pick] fix the bug for nccl barrier and alltoall (#42042) · 8a12f459
  由 lilong12 提交于 4月 21, 2022
```
* fix_nccl_barrier (#41970)

* be compatible with the old version of alltoall (#42007)
Co-authored-by: NBaibaifan <39549453+Baibaifan@users.noreply.github.com>
```
  8a12f459

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致