提交 · d5b6eec273a9f6fd1ddc1bbf1e676d058d09fab8 · PaddlePaddle / Paddle

23 5月, 2022 2 次提交
- O
  Update metrics.py · d5b6eec2
  由 onecatcn 提交于 5月 19, 2022
```
the doc was editted based on the discussion in the issue:
INT32 Failed on paddle.metric.accuracy: https://github.com/PaddlePaddle/Paddle/issues/42845
```
  d5b6eec2
- S
  【CI】run all demo ci before exit in windows (#42700) (#42897) · 2300d45f
  由 Sing_chan 提交于 5月 23, 2022
```
cherry-pick PR #42700
```
  2300d45f
19 5月, 2022 1 次提交
- A
  [Dy2Stat]Modify all jit.save path into tempfile under dygraph_to_static directory (#42842) (#42860) · 84840481
  由 Aurelius84 提交于 5月 19, 2022
```
* [Dy2Stat]Modify all jit.save path into tempfile

* [Dy2Stat]Modify all jit.save path into tempfile
```
  84840481
17 5月, 2022 2 次提交
- C
  
  fix trace op record event error (#42775) (#42789) · af79273d
  由 Chen Weihang 提交于 5月 17, 2022
  
  af79273d
- C
  put_record_event_in_python_on_timeline_python (#42555) (#42790) · a40e60f7
  由 chenjian 提交于 5月 17, 2022
```
* put_record_event_in_python_on_timeline_python

* fix
```
  a40e60f7
16 5月, 2022 1 次提交
- W
  fix sample code error of paddle.lerp, test=document_fix (#42753) · 07029e0c
  由 wuhuanzhou 提交于 5月 16, 2022
```
修复paddle.lerp中示例代码错误。
```
  07029e0c
11 5月, 2022 1 次提交
- A
  
  [Eager]Fix EagerTensor _copy_to memory overlap problem (#42668) (#42686) · d0e733dd
  由 Aurelius84 提交于 5月 11, 2022
  
  d0e733dd
10 5月, 2022 4 次提交
- J
  pdnode_compare (#42597) (#42633) · 403b503f
  由 JingZhuangzhuang 提交于 5月 10, 2022
```
* pdnode_compare

* panode compare

* pdnode_compare
```
  403b503f
- F
  [cherry-pick][MLU] support add callback to stream and profiler (#42115) · 25124d7f
  由 fwenguang 提交于 5月 10, 2022
```
* [MLU] add mlu new profiler (#41138)

* [MLU] add mlu new profiler

* fix format

* [MLU] support add callback to stream (#41831)

* [MLU] add gather mlu kernel (#41969)

* [MLU] add mlu activation kernels (#41751)
```
  25124d7f
- A
  set custom_nll_loss_op attr ignoreIndex to str (#42596) · 6c935e1d
  由 Allen Guo 提交于 5月 10, 2022
```
set attr ignoreIndex type to string for custom_nllloss_op

部分 cheery-pick of #42534
```
  6c935e1d
- Z
  
  fix bug of optional_tensor in amp logic (#42561) (#42577) · 37715dab
  由 zhangbo9674 提交于 5月 10, 2022
  
  37715dab
09 5月, 2022 1 次提交

[Cherry-pick][IPU] merge recent changes (#42078) (#42582) · 1f9b60df

由 Allen Guo 提交于 5月 09, 2022

    add class NameScopeHelper for adding namescope info
    添加更多 种类优化器状态的映射
    为 IpuStrategy 添加 compilation_progress_logger option 用于输出 编译进度
    部分代码清理和杂项优化

1f9b60df

07 5月, 2022 3 次提交
- W
  
  remove the test case for the matmul_v2_mkldnn (#42530) · 54ef3d56
  由 wawltor 提交于 5月 07, 2022
  
  54ef3d56
- F
  Reduce the number of threads per block of deformable_psroi_pooling to solve... · 44271ece
  由 FlyingQianMM 提交于 5月 07, 2022
```
Reduce the number of threads per block of deformable_psroi_pooling to solve the bug where too many resources requested for launch (PaddlePaddle#42531) (#42533)
```
  44271ece
- R
  [cherry-pick] Fix UT timeout problem for cuda_managed_memory_test and test_tensordot (#42492) · c9d156b1
  由 Ruibiao Chen 提交于 5月 07, 2022
```
* Reduce time variation for cuda_managed_memory_test (#42458)

* Disable standalone executor for test_tensordot (#42476)
```
  c9d156b1
06 5月, 2022 2 次提交
- L
  [cherry-pick] fix wrong place in ut (#42488) · 35ed11f3
  由 Leo Chen 提交于 5月 06, 2022
```
* fix wrong place

* skip bf16 test if not supported (#42503)
```
  35ed11f3
- W
  Fix the race condition in cumsum operator (#42205) (#42500) · 58f40144
  由 wawltor 提交于 5月 06, 2022
```
* Fix the race condition in cumsum operator

* Optimize cumsum operator
Co-authored-by: NLeo Chen <39020268+leo0519@users.noreply.github.com>
```
  58f40144
05 5月, 2022 3 次提交
- X
  
  fix bugs (#42495) · 590b4dbc
  由 xiongkun 提交于 5月 05, 2022
  
  590b4dbc
- W
  
  fix unittest of conv2d due to V100 do not support bfloat16 (#42496) · 71d3b06c
  由 wangxinxin08 提交于 5月 05, 2022
  
  71d3b06c
- W
  
  fix the v100 cuda11.2 matmul_v2 and elementwise_div bug (#42479) · e052fde7
  由 wawltor 提交于 5月 05, 2022
  
  e052fde7
04 5月, 2022 8 次提交

graph partition (#42472) · a3917625

由 seemingwang 提交于 5月 04, 2022

* enable graph-engine to return all id (#42319)

* enable graph-engine to return all id

* change vector's dimension

* change vector's dimension

* enlarge returned ids dimensions

* change sample result's structure to fit training (#42426)

* enable graph-engine to return all id

* change vector's dimension

* change vector's dimension

* enlarge returned ids dimensions

* add actual_val

* change vlog

* fix bug

* bug fix

* bug fix

* fix display test

* singleton of gpu_graph_wrapper

* change sample result's structure to fit training

* recover sample code

* fix

* secondary sample

* add graph partition

* fix pybind
Co-authored-by: NDesmonDay <908660116@qq.com>
Co-authored-by: NDesmonDay <908660116@qq.com>

a3917625

X
[cherry-pick 2.3] fix bug of batch_norm_grad kernel with fp16 (#42461) · a5745864
由 XiaoguangHu 提交于 5月 04, 2022
```
* fix bug of batch_norm_grad kernel with fp16

* format code
```
a5745864
H
fix paddle-ort python bug (#42464) (#42470) · 87e6149c
由 heliqi 提交于 5月 04, 2022
```
* fix paddle-ort python bug

* fix paddle-ort python bug
```
87e6149c
K

fix Tensor share memory in eager mode. test=develop (#42446) · 544352de
由 Kaipeng Deng 提交于 5月 04, 2022

544352de
G
[cherry-pick] fix PTQ unittest timeout (#42452) · 25318f6f
由 Guanghua Yu 提交于 5月 04, 2022
```
* fix PTQ unittest timeout

* fix ut
```
25318f6f
C
Fix problem with py3.6 and test for quant2_int8_lstm (#41420) (#42447) · 706b7b7f
由 cc 提交于 5月 04, 2022
```
Co-authored-by: Njoanna.wozna.intel <joanna.wozna@intel.com>
```
706b7b7f
X

fix bug when compiling with cusparse in CUDA version >=11.4 (#42456) · b57c132a
由 XiaoguangHu 提交于 5月 04, 2022

b57c132a
L
fix PIL sample mode deprecated warning (#42307) (#42451) · eae41b7d
由 LielinJiang 提交于 5月 04, 2022
```
* fix PIL sample mode deprecated warning

* compatible with old pil version
```
eae41b7d

03 5月, 2022 1 次提交
- H
  Hotfix Release 2.3 Bug for CUDA 11.2 (#42438) · 713d5a4b
  由 Huihuang Zheng 提交于 5月 03, 2022
```
* Fix Release 2.3 Bug

* Fix format
```
  713d5a4b
02 5月, 2022 1 次提交
- Z
  [Cherry-Pick]Fix test_cudnn_norm_conv and test_cudnn_bn_add_relu in CUDA11.2 (#42406) · 655c4981
  由 Zhang Zheng 提交于 5月 02, 2022
```
* Fix test_cudnn_norm_conv and test_cudnn_bn_add_relu in CUDA11.2

* no throw in V100 for some cases
```
  655c4981
01 5月, 2022 1 次提交
- C
  
  remove useless lod copy (#42425) · 778ec77b
  由 Chen Weihang 提交于 5月 01, 2022
  
  778ec77b
30 4月, 2022 6 次提交
- A
  [Dy2Stat]Fix losting pre/post hook from outermost layer while jit.save (#42273) (#42388) · 16ef2b2e
  由 Aurelius84 提交于 4月 30, 2022
```
* [Dy2Stat]Fix losting pre/post hook from outermost layer while jit.save

* fix kwargs

* fix unittest
```
  16ef2b2e
- W
  
  [Eager] Support test_diff_op switch to eager mode (#42360) (#42392) · 1e3d2e4a
  由 Weilong Wu 提交于 4月 30, 2022
  
  1e3d2e4a
- W
  
  [Eager] Support test_label_smooth_functional switch to eager mode (#42366) (#42393) · e2bc846f
  由 Weilong Wu 提交于 4月 30, 2022
  
  e2bc846f
- W
  
  [Eager] Support test_eigh_op switch to eager mode (#42379) (#42394) · ebb94504
  由 Weilong Wu 提交于 4月 30, 2022
  
  ebb94504
- X
  Make einsum_v2 support multi-operands (#42327) (#42397) · 34352fcd
  由 xiongkun 提交于 4月 30, 2022
```
* Extend python einsum interface to make einsum_v2 support multi-operands and switch it to default.

* add opt_einsum dependence

* add yaml and support eager model

* fix by code review
```
  34352fcd
- R2.3/fix pad3d infer shape (#42414) · 2dce1e88
  由 littletomatodonkey 提交于 4月 30, 2022
```
* fix pad3d infer shape

* fix pad3d

* fix pad default value

* fix order

* add unit test

* fix unittest for ci coverage

* add ndhwc check
```
  2dce1e88
29 4月, 2022 3 次提交

W
[cherry-pick 2.3] fix FusedResidualDropoutBias nan & fix lod_tensor_array gc (#42398) · 3b2bc0a0
由 WangXi 提交于 4月 29, 2022
```
* fix FusedResidualDropoutBias nan in v100 (#42344)

* fix lod_tensor_array gc (#42377)
```
3b2bc0a0

[cherry-pick 2.3] Add fused_multi_transformer op to optimize transformer... · 50bfe420

由 WangXi 提交于 4月 29, 2022

[cherry-pick 2.3] Add fused_multi_transformer op to optimize transformer generation performance (#42311)

* Add fused_multi_transformer op to optimize transformer generation performance (#41814)

* fix fused_multi_transformer compile failed in cuda arch < sm53 (#42315)

* fix ci timeout

50bfe420

Z
[cherry-pick] Fix bug of building InferMetaContext (#42211) (#42399) · 765fbb59
由 zyfncg 提交于 4月 29, 2022
```
* fix bug of building InferMetaContext (#42211)

* add unitest
```
765fbb59

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功