- 27 4月, 2023 1 次提交
-
-
由 mengziheng 提交于
* add pad op * add_some_code * modify some code * add some code * add some code * modify some code * add some code * modify some code * Update composite_backward_api.h * modify some code * add some code * add some code * add some code
-
- 26 4月, 2023 13 次提交
-
-
由 zhouweiwei2014 提交于
-
由 mhy-666 提交于
* add scatter_nd_add comp * add scatter_nd_add prim * fix * fix * add public_python_api in TestScatterNdAddSimpleOp setup function * fix composite_backward_api.h * fix composite_backward * add test cases * fix composite_backward_api.h, unittest
-
由 Ruibiao Chen 提交于
* Fix fused_attention_op and fused_feedforward_op bugs in xpu * Fix d_x alloc errors for fused_feedforward_grad_kernel
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 sneaxiy 提交于
* optimize embedding deterministic mode * fix compile error * change FLAGS_cudnn_deterministic to int64 * fix 700 error * add ut * fix ut * fix ut * fix win32 ci * fix flags with PHI_DEFINE_EXPORTED_int64
-
由 engineer1109 提交于
-
由 陈沧夜 提交于
-
由 denglianbin 提交于
-
由 denglianbin 提交于
-
由 Lucas 提交于
[Bug Fixs] fix bugs when using cast<int64_t, int32_t> in xpu/cross_entropy kernels, *test=kunlun (#53325)
-
由 risemeup1 提交于
* Optimize prompt information * add_information * add_information
-
由 Wang Xin 提交于
-
由 huangjiyi 提交于
* update * update
-
- 25 4月, 2023 17 次提交
-
-
由 lzydev 提交于
* support register single .cu file * add register GPU kernel function
-
由 ccrrong 提交于
-
由 sprouteer 提交于
-
由 wuhuachaocoding 提交于
-
由 Yuanle Liu 提交于
-
由 huangjiyi 提交于
* update * fix bug * Revert "affine_channel_op"
-
由 Zero Rains 提交于
* create KernelMinMax to optimize the performance of histogram op in GPU * change to block and warp wise operation * remove the time in DtoH * fix a bug
-
由 YuanRisheng 提交于
* add flags for phi * fix compile bugs * fix ci bugs * fix inference bugs * fix cinn' bugs * fix cinn bugs * perfect code according comment * fix ci bugs * fix ci bugs
-
由 Chitsing KUI 提交于
* print modifed flags * fix ref, opt print * fix default getter * fix ut
-
由 cyberslack_lee 提交于
-
由 shaojie_wang 提交于
* fix shared memory over usage in embedding grad kernel on determistic mode * use IdT as interger dtype
-
由 Galaxy1458 提交于
-
由 zhangyikun02 提交于
-
由 zhoutianzi666 提交于
* add ```converter_type``` for op converter
-
由 Difer 提交于
* add fp_bf for pool_max_withidx * fix some error * fix error * codestyle error * fix masktype * fix input bf type * input bf dtype convert error * back to convert input to bf16 first * fix convert error * fix bf16 grad check
-
由 Bo Zhang 提交于
-
由 Ruibiao Chen 提交于
-
- 24 4月, 2023 9 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Leo Chen 提交于
-
由 Wang Xin 提交于
-
由 niuliling123 提交于
-
由 YangQun 提交于
* support 0d tensor for shape and squeeze onednn kernel * set python api for shape op ut
-
由 Zhang Zheng 提交于
* Fix the calculation of layer_norm_bwd * fix
-
由 zhupengyang 提交于
-
由 zyfncg 提交于
-
由 Yuanle Liu 提交于
-