提交 · 983fcb56211197491750e000154bea7904dbd0cb · Crayon鑫 / Paddle

26 4月, 2022 4 次提交
- W
  
  [Eager] Support numpy.ndarry in CastNumpy2Scalar (#42136) (#42213) · 983fcb56
  由 Weilong Wu 提交于 4月 26, 2022
  
  983fcb56
- fix python3.10 compile bug on windows (#42140) (#42180) · 42297995
  由 zhouweiwei2014 提交于 4月 26, 2022
```
cherry-pick #42140
```
  42297995
- W
  [Eager] Support div(scalar) in eager mode (#42148) (#42214) · a887ffd0
  由 Weilong Wu 提交于 4月 26, 2022
```
* [Eager] Support div scalar in eager mode

* Updated and remove debug logs

* Remove list, use 'or' directly

* Remove useless statement
```
  a887ffd0
- C
  [Cherry-pick] Optimize dygraph performance part2 (#42224) · ab24b9c0
  由 Chen Weihang 提交于 4月 26, 2022
```
* Add paddle::variant and replace paddle::any (#42139)

* add variant and replace any

* split attribute

* Optimize dygraph GetExpectedKernelType perf (#42154)

* opt dygraph scheduling

* revert part impl

* fix variant compile error (#42203)

* replace any by variant in infermeta (#42181)
```
  ab24b9c0
25 4月, 2022 7 次提交
- W
  
  [Eager] Remove redundancy code, fix fp16 case (#42169) (#42215) · e4da34fd
  由 Weilong Wu 提交于 4月 25, 2022
  
  e4da34fd
- J
  update ampere sm (#42023) (#42074) · d5f05bd1
  由 JingZhuangzhuang 提交于 4月 25, 2022
```
* update ampere sm

* update ampere sm

* update ampere sm
```
  d5f05bd1
- B
  fix FlattenContiguousRangeOpConverter out dim error (#42087) (#42184) · 20e8bf1f
  由 baoachun 提交于 4月 25, 2022
```
* fix FlattenContiguousRangeOpConverter out dim error

* update code
```
  20e8bf1f
- K
  
  rm distri env (#41961) (#42167) · 8c3c6dae
  由 kuizhiqing 提交于 4月 25, 2022
  
  8c3c6dae
- Z
  [cherry-pick] Optimize performance of dygraph (#42093, #42103, #42137) (#42171) · 0d537003
  由 zyfncg 提交于 4月 25, 2022
```
* optimiaze performance of PreparePhiData (#42093)

* Dygraph performance optimization (v2) (#42103)

* optimiaze performance of PreparePhiData

* dygraph performance optimization

* optimize performance of dygraph (#42137)
```
  0d537003
- T
  Cherry-pick[41456] Update Mac cmake version >=3.15 (#42193) · 26167969
  由 tianshuo78520a 提交于 4月 25, 2022
```
官网中写到cmake版本最低3.15，更新cmake设置
```
  26167969
- A
  [Cherry-Pick][Performance]Remove CudaStreamSychornize in ClipGradByGlobalNorm... · 58d0d15e
  由 Aurelius84 提交于 4月 25, 2022
```
[Cherry-Pick][Performance]Remove CudaStreamSychornize in ClipGradByGlobalNorm and fix shape op (#42170)

* [Performance]Set ShapeKernel with ALL_BACKEND and ALL_LAYOUT (#42138)

* [Performance]Set ShapeKernel with ALL_BACKEND and ALL_LAYOUT

* [Performance]Set ShapeKernel with ALL_BACKEND and ALL_LAYOUT

* [Performance]Remove CudaStreamSychornize in ClipGradByGlobalNorm (#42132)
```
  58d0d15e
24 4月, 2022 5 次提交
- F
  
  remove redundant computation in Categorical.probs (#42178) · 4feca753
  由 Feiyu Chan 提交于 4月 24, 2022
  
  4feca753
- Z
  
  refine optest logic for bfloat16 (#42151) (#42165) · 5211282d
  由 zhangbo9674 提交于 4月 24, 2022
  
  5211282d
- T
  add build pylayer depend pybind (#42135) · dd4ef244
  由 tianshuo78520a 提交于 4月 24, 2022
```
解决编译依赖失败问题
```
  dd4ef244
- C
  [cherry-pick]Reduce performance influence by record event in python (#42142) · 338fcc10
  由 chenjian 提交于 4月 24, 2022
```
* fix kenrel name apperance (#42071)

* Reduce performance influence by record event in python (#42040)

* optimize performance

* fix

* improve coverage

* fix

* fix
```
  338fcc10
- W
  [Cherry-pick, Eager] Fix CastPyArg2scalar for max value of int64 (#42098) (#42129) · b543998f
  由 Weilong Wu 提交于 4月 24, 2022
```
* [Eager] Fix CastPyArg2scalar for max value of int64 (#42098)

* [Eager] Fix CastPyArg2Scalar in Long case

* Add more test cases for paddle.clip

* Use PyLong_AsLongLong

* Fix merge conflicts
```
  b543998f
23 4月, 2022 1 次提交

[XPUPS]add hashtable interface (#41987) (#42110) · 6ab441bb

由 zmxdream 提交于 4月 23, 2022

* add hashtable interface. test=develop

* update. test=develop

* update. test=develop

* fix. test=develop

* fix optimizer config for xpups. test=develop

* fix. test=develop

* fix. test=develop

6ab441bb

22 4月, 2022 11 次提交
- A
  
  [Eager]Fix SetDeviceId in eager_final_state_api from python_c_gen.py (#42025) (#42067) · 9f3d9381
  由 Aurelius84 提交于 4月 22, 2022
  
  9f3d9381
- 0
  
  Remove wrong check_variable_and_dtype in matrix_rank (#42062) (#42085) · b3d608e2
  由 0x45f 提交于 4月 22, 2022
  
  b3d608e2
- P
  Cherry pick PR41990, add _grad_name and _grad_value for eager tensor (#41990) (#42079) · 3475c2bf
  由 pangyoki 提交于 4月 22, 2022
```
* add _grad_name and _grad_value for eager tensor

* fix paddle_enforce

* fix paddle_enforce 2

* fix grad_name

* _grad_value return lodtensor rather than tensor

* fix
```
  3475c2bf
- Y
  Fix paddle.t doc en and the annotation display on 4 en docs (#41699) · 81468682
  由 Yilingyelu 提交于 4月 20, 2022
```
* gradients; test=document_fix

* fix VarType; test=document_fix

* fix vartype; test=document_fix

* cumsum; test=document_fix

* t; test=document_fix
```
  81468682
- G
  fix bug for MultiplicativeDecay (#41850) · 78d997a8
  由 guguguzi 提交于 4月 19, 2022
```
* fix bug for MultiplicativeDecay

* remove changes to test_lr_scheduler.py
```
  78d997a8
- H
  fix onnxruntime bug (#42095) (#42104) · 26cc5c54
  由 heliqi 提交于 4月 22, 2022
```
修复ORT在batch变动时，输出shape不对问题
```
  26cc5c54
- B
  
  add mkldnn compute_propagate_scales int8 pass (#41592) (#42080) · 41003161
  由 baoachun 提交于 4月 22, 2022
  
  41003161
- J
  
  Add UT (#42055) · 4f6aba87
  由 Jacek Czaja 提交于 4月 22, 2022
  
  4f6aba87
- B
  [Cherry-pick] sharding for eager tensor (#42054) · 6ad0f061
  由 Baibaifan 提交于 4月 22, 2022
```
* sharding_for_eager_tensor (#41415)

* fix_sharding_copy_right (#41849)
```
  6ad0f061
- A
  [IPU] add mixed-precission support for ipu (#41733) (#41906) · c09b1d68
  由 Allen Guo 提交于 4月 22, 2022
```
add mixed-precission support for ipu

cherry-pick from #41733
```
  c09b1d68
- H
  Change CINN tag, prepare for CINN release/v0.2 (#42065) · fd9c7818
  由 Huihuang Zheng 提交于 4月 22, 2022
```
Change CINN Tag to Prepare for CINN release/v0.2. This PR is the cherrypick of #42063
```
  fd9c7818
21 4月, 2022 12 次提交
- W
  
  [Eager] Support numpy.narray as input for eager expand (#42043) (#42064) · ef0b5fdc
  由 Weilong Wu 提交于 4月 21, 2022
  
  ef0b5fdc
- W
  
  [Eager] remove useless logic (#42020) (#42061) · 218e759b
  由 Weilong Wu 提交于 4月 21, 2022
  
  218e759b
- Z
  [Cherry-Pick]Move pass optimizations into CINN. (#42047) (#42070) · 2f2f987c
  由 Zhen Wang 提交于 4月 21, 2022
```
* Move pass optimizations into CINN.
```
  2f2f987c
- R
  [Cherry-pick]Release2.3/fix doc of nms op (#42024) · dbdb56d1
  由 RichardWooSJTU 提交于 4月 21, 2022
```
* fix nms op doc missing default value

* fix nms op doc add blank line
```
  dbdb56d1
- Z
  
  support bce_loss and bce_loss_grad in XPU, test=kunlun (#41610) · b1ba98ca
  由 zhangyikun02 提交于 4月 13, 2022
  
  b1ba98ca
- [cherry-pick]support multi_layer of bilstm,*test=kunlun (#42076) · 58f6d459
  由 z8hanghuan 提交于 4月 21, 2022
```
* modify xpu.cmake,*test=kunlun (#41832)

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* modify xpu.cmake,*test=kunlun

* support bilstm,*test=kunlun

* [cherry-pick]support multi_layer of bilstm,*test=kunlun
```
  58f6d459
- L
  [Cherry-pick] fix the bug for nccl barrier and alltoall (#42042) · 8a12f459
  由 lilong12 提交于 4月 21, 2022
```
* fix_nccl_barrier (#41970)

* be compatible with the old version of alltoall (#42007)
Co-authored-by: NBaibaifan <39549453+Baibaifan@users.noreply.github.com>
```
  8a12f459
- L
  
  fix bug for eager mode distributed training (#41841) (#41953) · f5a937eb
  由 lilong12 提交于 4月 21, 2022
  
  f5a937eb
- L
  
  update (#41636) (#41757) · 0ef694ac
  由 lilong12 提交于 4月 21, 2022
  
  0ef694ac
- S
  Fix pipeline in new dygraph (#41937) (#42053) · 7eae6570
  由 ShenLiang 提交于 4月 21, 2022
```
* fix utest

* fix time
```
  7eae6570
- W
  
  fix inf in fused_attention (#41933) (#42032) · 50fd2450
  由 WangXi 提交于 4月 21, 2022
  
  50fd2450
- W
  double accessor and show_scale (#41943) (#42014) · efaef31a
  由 wangguanqun 提交于 4月 21, 2022
```
* double accessor and show_scale

* double accessor and show_scale

* rename

* fix bug in pslib config

* add unittest
```
  efaef31a

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致