- 25 4月, 2023 5 次提交
-
-
由 cyberslack_lee 提交于
-
由 shaojie_wang 提交于
* fix shared memory over usage in embedding grad kernel on determistic mode * use IdT as interger dtype
-
由 zhangyikun02 提交于
-
由 Difer 提交于
* add fp_bf for pool_max_withidx * fix some error * fix error * codestyle error * fix masktype * fix input bf type * input bf dtype convert error * back to convert input to bf16 first * fix convert error * fix bf16 grad check
-
由 Bo Zhang 提交于
-
- 24 4月, 2023 13 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Leo Chen 提交于
-
由 Wang Xin 提交于
-
由 YangQun 提交于
* support 0d tensor for shape and squeeze onednn kernel * set python api for shape op ut
-
由 Zhang Zheng 提交于
* Fix the calculation of layer_norm_bwd * fix
-
由 zyfncg 提交于
-
由 Yuanle Liu 提交于
-
由 GGBond8488 提交于
* add 0D support for trace, test=allcase * fix trace gpu kernel 0d error, test=allcase * fix windows error, test=allcase
-
由 Siming Dai 提交于
Add paddle.geometric.weighted_sample_neighbors API
-
由 Sonder 提交于
* add sig file * trans fused feedforward compute function to phi * remove fluid include * delete old register info * fix build error * trans fused feedforward grad xpu to phi
-
由 csy0225 提交于
-
由 Zhan Rongrui 提交于
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test ,test=develop
-
- 23 4月, 2023 4 次提交
-
-
由 zhangyuqin1998 提交于
* delete overwrite from gather_grad * fix * Update gather_grad_kernel.cc
-
由 houj04 提交于
* [XPU] fc use int_with_ll_t * fix test_unbind_op_xpu
-
由 zhangyuqin1998 提交于
* remove axis from elementwise_grad * Update elementwise_sig.cc
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
- 22 4月, 2023 1 次提交
-
-
由 wangfengsheng1999 提交于
[Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850) * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase * add test case * modify dot/metric.accuracy/static.accuracy/static.auc * modfiy inner/tensordot bug * test 9 api * [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase * fix bug * support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy * code style * fix bug * fix test_dot_op bug * fix accuracy bug * fix bug * fix bug * fix bug * fix bug * codestyle * fix dot bug * fix dot bug * fix dot bug * code style * fix dot bug * fix dot bug * fix dot bug * fix dot bug * fix dot bug * fix dot bug * modify code
-
- 21 4月, 2023 7 次提交
-
-
由 JYChen 提交于
* support 0-D output and 0-D as indice in __getitem__ * fix tests * fix inference and UT * add unittest for setitem * fix xpu test * fix xpu 0-d
-
由 Shijie 提交于
* add deterministic embedding grad kernel * minor change * minor change * Add new FLAG to enable deterministic embedding * Update embedding deterministic kernel
-
由 co63oc 提交于
-
由 co63oc 提交于
-
由 umiswing 提交于
-
由 Yuang Liu 提交于
-
- 20 4月, 2023 5 次提交
-
-
由 zhangyuqin1998 提交于
* setup * Update elementwise_kernel.cc * Update elementwise_kernel.cc * fix * fix * Update elementwise_kernel.cu * fix * Update elementwise_kernel.cc * Update elementwise_kernel.cc * Update elementwise_kernel.cc * Update elementwise_kernel.cc * Update elementwise_kernel.cc * Update elementwise_kernel.cc
-
由 Chitsing KUI 提交于
* add flash randomness control * fix VLOG undefied
-
由 zhouweiwei2014 提交于
-
由 Wang Xin 提交于
* remove ASCEND* keyword * update docstring * bug fixed * bug fixed
-
由 co63oc 提交于
-
- 19 4月, 2023 5 次提交
-
-
由 Sonder 提交于
* trans fused attention to phi * add optional parm * trans fused_attention_grad to phi * add fused attention grad register info * fix include * test=kunlun * add fused attention to static build list * add remove * update remove
-
由 zhangyikun02 提交于
-
由 houj04 提交于
-
由 limingshu 提交于
* first commit * restruct c++ interface to divide linear from matmulwithcublaslt * finish building in cublaslt impl * fix code bugs * fix host cost * add some changes
-
由 zhangyuqin1998 提交于
* fix graph_reindex * fix * Update op_compat.yaml
-