- 13 1月, 2023 11 次提交
-
-
由 ykkk2333 提交于
-
由 jameszhang 提交于
* fix xpu unittest issue: zero_dim_tensor * deal with leftout issue introduced by #49470
-
由 Leo Guo 提交于
-
由 wangshengxiang 提交于
-
由 zyfncg 提交于
* generate static graph code of stack, unbind, unique_consecutive op * fix bug
-
由 Wilber 提交于
-
由 wangzhen38 提交于
* [cpplint fix] under ps
-
由 Weilong Wu 提交于
* [PHI] rrelu add yaml * polish * polish
-
由 HongyuJia 提交于
-
由 zhangkaihuo 提交于
-
由 Yuanle Liu 提交于
-
- 12 1月, 2023 11 次提交
-
-
由 xiaoxiaohehe001 提交于
-
由 sunli 提交于
* lerp support 0 Tensor * fix lerp grad * fix lerp zero test * fix 0D + ND/ND + 0D * fix check * update code * fix lerp infer shape * static backward test * updata static graph test
-
由 Wen Sun 提交于
* refactor: migrate comm checks * refactor: add check in comm context * feat: add gloo static check * refactor: add place param in static check
-
由 jameszhang 提交于
* Fix reduce func bug in process_group_bkcl Also catch up with a recent process_group PR that failed to add XPU branch. Note that reduce is still accomplished by allreduce for xpu. Fix this should xccl lib be updated. * fix compile issue for non-XPU
-
由 YuanRisheng 提交于
-
由 gem5 提交于
-
由 xiaoxiaohehe001 提交于
-
由 wenbin 提交于
* compile fix * fix compile * compile fix * add more preln
-
由 Leo Guo 提交于
xpu2_op_list.cc. test=kunlun
-
由 jiangcheng 提交于
-
由 YuanRisheng 提交于
* rename kernel * delete sig * modify code according comment * fix ci bugs
-
- 11 1月, 2023 7 次提交
-
-
由 Yiqun Liu 提交于
* Implement a common PointerArray. * Polish codes. * Add including of header file. * Add the branch of kFix8. * Fix compiling error. * Add alignas hint to fix the performance drop. * Optimize the H2D copy in stack_grad. * Rename the macro. * Fix align hint for different compilers. * Polish the define of PADDLE_ALIGN. * Fix compiling error. * Remove the align hint on windows.
-
由 zhangxin81 提交于
* fix paddle_infer_contrib include
-
由 niuliling123 提交于
-
由 kangguangli 提交于
-
由 duanyanhui 提交于
-
由 wenbin 提交于
* compile fix * fix compile * compile fix
-
由 Wangzheee 提交于
-
- 10 1月, 2023 11 次提交
-
-
由 limingshu 提交于
* add stack grad kernel optimization * add basic optimization kernel for stack_grad_kernel * optimization of stack_grad_kernel for last dim stack and change code format with pre-commit
-
由 Wen Sun 提交于
* refactor: gloo comm context migration * fix: headers & avoid mutable_data usage * fix: cmake gloo dep * style: rename funcs * refactor: move to new files * fix: gloo deps * refactor: simplify create device
-
由 FlyingQianMM 提交于
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss * [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor
-
由 xiaoxiaohehe001 提交于
-
由 xiaoxiaohehe001 提交于
-
由 xiaoxiaohehe001 提交于
-
由 Ryan 提交于
* try sequence_padding * fix cant use mutable_data * fix mistake fluid_sequence_scale.hh/CMakeLists.t include * fix namespace bug * fix framework::ToAbsOffset not found * fix codestyle
-
由 wangxiaoning 提交于
* fix graph_brpc_server.h code style * fix ps_local_client.h * fix ps_local_client.cc
-
由 张春乔 提交于
-
由 Yuang Liu 提交于
-
由 Sanbu 提交于
-