提交 · 1bd468e24dae9ae1b2793253232bc7a78ec840ef · PaddlePaddle / Paddle

27 4月, 2023 6 次提交
- J
  
  Hack__getitem__ from 0-d to 1-d with FLAGS_set_to_1d (#53358) · 1bd468e2
  由 JYChen 提交于 4月 27, 2023
  
  1bd468e2
- E
  
  fix softmax assert error (#53360) · c50f5fa4
  由 engineer1109 提交于 4月 27, 2023
  
  c50f5fa4
- G
  remove some [-Wunused-parameter] warning (#53365) · 0fac3281
  由 Galaxy1458 提交于 4月 27, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop
```
  0fac3281
- H
  [XPU] remove scale_loss in parallel.py (#53337) · 2e1ac529
  由 houj04 提交于 4月 27, 2023
```
* [XPU] remove scale_loss in parallel.py

* [XPU] throw Unimplemented when using Reducer
```
  2e1ac529
- S
  
  【Hackathon No.55】add fmax BF16 test (#51925) · 8a6ad6e5
  由 superwinner1 提交于 4月 27, 2023
  
  8a6ad6e5
- C
  
  【Hackathon4】No5 nextafter (#52544) · 82ac3913
  由 cyberslack_lee 提交于 4月 27, 2023
  
  82ac3913
26 4月, 2023 6 次提交
- R
  Fix fused_attention_op and fused_feedforward_op bugs in xpu (#53318) · 1164626c
  由 Ruibiao Chen 提交于 4月 26, 2023
```
* Fix fused_attention_op and fused_feedforward_op bugs in xpu

* Fix d_x alloc errors for fused_feedforward_grad_kernel
```
  1164626c
- G
  remove some [-Wunused-parameter] waring (#53319) · f9e5072b
  由 Galaxy1458 提交于 4月 26, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop
```
  f9e5072b
- S
  Optimize c_embedding op in deterministic mode (#53197) · 35f5c245
  由 sneaxiy 提交于 4月 26, 2023
```
* optimize embedding deterministic mode

* fix compile error

* change FLAGS_cudnn_deterministic to int64

* fix 700 error

* add ut

* fix ut

* fix ut

* fix win32 ci

* fix flags with PHI_DEFINE_EXPORTED_int64
```
  35f5c245
- D
  
  【Hackathon No.48】为 Paddle determinant 算子实现 float16 数据类型支持 (#53286) · 2a705b74
  由 denglianbin 提交于 4月 26, 2023
  
  2a705b74
- D
  
  【Hackathon No.48】为 Paddle meshgrid 算子实现 float16 数据类型支持 (#53284) · 9127cc3c
  由 denglianbin 提交于 4月 26, 2023
  
  9127cc3c
- L
  [Bug Fixs] fix bugs when using cast<int64_t, int32_t> in xpu/cross_entropy... · 1d549400
  由 Lucas 提交于 4月 26, 2023
```
[Bug Fixs] fix bugs when using cast<int64_t, int32_t> in xpu/cross_entropy kernels, *test=kunlun (#53325)
```
  1d549400
25 4月, 2023 7 次提交
- Z
  【PaddlePaddle Hackathon 4 No.33】为 Paddle 优化 Histogram op 在 GPU 上的计算性能 (#53112) · c1a61fc0
  由 Zero Rains 提交于 4月 25, 2023
```
* create KernelMinMax to optimize the performance of histogram op in GPU

* change to block and warp wise operation

* remove the time in DtoH

* fix a bug
```
  c1a61fc0
- Y
  [PHI]Add flags macro for PHI (#52991) · 22e96bde
  由 YuanRisheng 提交于 4月 25, 2023
```
* add flags for phi

* fix compile bugs

* fix ci bugs

* fix inference bugs

* fix cinn' bugs

* fix cinn bugs

* perfect code according comment

* fix ci bugs

* fix ci bugs
```
  22e96bde
- C
  
  【Hackathon No.61】min 算子FP16/BF16单测完善 (#52887) · d7a5e900
  由 cyberslack_lee 提交于 4月 25, 2023
  
  d7a5e900
- fix shared memory over usage in embedding grad kernel on deterministic mode (#53247) · 6f684bd2
  由 shaojie_wang 提交于 4月 25, 2023
```
* fix shared memory over usage in embedding grad kernel on determistic mode

* use IdT as interger dtype
```
  6f684bd2
- Z
  
  tile op support 0D input for xpu (#53237) · 336bc20b
  由 zhangyikun02 提交于 4月 25, 2023
  
  336bc20b
- D
  【Hackathon No57】add fp16 & bf16 for max_pool2d_with_index, max_pool3d_with_index (#52314) · 46951224
  由 Difer 提交于 4月 25, 2023
```
* add fp_bf for pool_max_withidx

* fix some error

* fix error

* codestyle error

* fix masktype

* fix input bf type

* input bf dtype convert error

* back to convert input to bf16 first

* fix convert error

* fix bf16 grad check
```
  46951224
- B
  
  add syncthreads (#53149) · b7565222
  由 Bo Zhang 提交于 4月 25, 2023
  
  b7565222
24 4月, 2023 13 次提交
- [Zero-Dim] Support paddle.max output 0D, test=allcase (#53242) · 9f9cd919
  由 zhouweiwei2014 提交于 4月 24, 2023
  
  9f9cd919
- L
  
  fix dist_grad kernel (#53239) · ddd72039
  由 Leo Chen 提交于 4月 24, 2023
  
  ddd72039
- W
  
  fix 'Werror-maybe-uninitialized' compiler error in GCC 11.3 (#53246) · 21508090
  由 Wang Xin 提交于 4月 24, 2023
  
  21508090
- Y
  [Zero-Dim] support 0d tensor for shape and squeeze onednn kernel (#52832) · c0a604e7
  由 YangQun 提交于 4月 24, 2023
```
* support 0d tensor for shape and squeeze onednn kernel

* set python api for shape op ut
```
  c0a604e7
- Z
  Fix the calculation of layer_norm_bwd (#53224) · a0aff194
  由 Zhang Zheng 提交于 4月 24, 2023
```
* Fix the calculation of layer_norm_bwd

* fix
```
  a0aff194
- Z
  
  fix compile bug of kps (#53251) · ae426b78
  由 zyfncg 提交于 4月 24, 2023
  
  ae426b78
- Y
  
  fix static_assert with no message (#53222) · 71474b10
  由 Yuanle Liu 提交于 4月 24, 2023
  
  71474b10
- G
  add 0D support for trace (#53208) · 9d90738c
  由 GGBond8488 提交于 4月 24, 2023
```
* add 0D support for trace, test=allcase

* fix trace gpu kernel 0d error, test=allcase

* fix windows error, test=allcase
```
  9d90738c
- S
  Add weighted sample (#52013) · 6a8d98e0
  由 Siming Dai 提交于 4月 24, 2023
```
Add paddle.geometric.weighted_sample_neighbors API
```
  6a8d98e0
- S
  Move fused feedforward xpu (#53196) · 83c2e682
  由 Sonder 提交于 4月 24, 2023
```
* add sig file

* trans fused feedforward compute function to phi

* remove fluid include

* delete old register info

* fix build error

* trans fused feedforward grad xpu to phi
```
  83c2e682
- C
  
  shared_external mermory add xpu (#53240) · d71615dc
  由 csy0225 提交于 4月 24, 2023
  
  d71615dc
- Z
  
  [Sparse]fix bug in paddle.sparse.transpose and paddle.sparse.reshape (#53038) · 15251291
  由 Zhan Rongrui 提交于 4月 24, 2023
  
  15251291
- G
  remove some [-Wunused-parameter] (#53185) · 834eb2ba
  由 Galaxy1458 提交于 4月 24, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test ,test=develop
```
  834eb2ba
23 4月, 2023 4 次提交
- Z
  delete overwrite from gather_grad (#52707) · a32c1391
  由 zhangyuqin1998 提交于 4月 23, 2023
```
* delete overwrite from gather_grad

* fix

* Update gather_grad_kernel.cc
```
  a32c1391
- H
  [XPU] fc use int_with_ll_t (#53183) · 7634a18a
  由 houj04 提交于 4月 23, 2023
```
* [XPU] fc use int_with_ll_t

* fix test_unbind_op_xpu
```
  7634a18a
- Z
  delete axis from elementwise_grad (#53202) · a3cd9cb9
  由 zhangyuqin1998 提交于 4月 23, 2023
```
* remove axis from elementwise_grad

* Update elementwise_sig.cc
```
  a3cd9cb9
- G
  remove some [-Wunused-parameter] (#53162) · b02687cc
  由 Galaxy1458 提交于 4月 23, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop
```
  b02687cc
22 4月, 2023 1 次提交

[Zero-Dim] support output 0D for... · b406a7db

由 wangfengsheng1999 提交于 4月 22, 2023

[Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850)

* [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase

* [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase

* add test case

* modify dot/metric.accuracy/static.accuracy/static.auc

* modfiy inner/tensordot bug

* test 9 api

* [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase

* fix bug

* support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy

* code style

* fix bug

* fix test_dot_op bug

* fix accuracy bug

* fix bug

* fix bug

* fix bug

* fix bug

* codestyle

* fix dot bug

* fix dot bug

* fix dot bug

* code style

* fix dot bug

* fix dot bug

* fix dot bug

* fix dot bug

* fix dot bug

* fix dot bug

* modify code

b406a7db

21 4月, 2023 3 次提交

support 0-D output and 0-D as indice in __getitem__/__setitem__ (#52814) · 4e939c89

由 JYChen 提交于 4月 21, 2023

* support 0-D output and 0-D as indice in __getitem__

* fix tests

* fix inference and UT

* add unittest for setitem

* fix xpu test

* fix xpu 0-d

4e939c89

add deterministic embedding grad kernel (#50494) · 017254d6

由 Shijie 提交于 4月 21, 2023

* add deterministic embedding grad kernel

* minor change

* minor change

* Add new FLAG to enable deterministic embedding

* Update embedding deterministic kernel

017254d6

C

Add trace tests (#52954) · 3371747d
由 co63oc 提交于 4月 21, 2023

3371747d

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功