- 15 8月, 2023 12 次提交
-
-
由 cyberslack_lee 提交于
-
由 傅剑寒 提交于
-
由 iSerendipity 提交于
* [dtype] add fp16 support for dist_kernel * fix typo * fix CE * fix CE * fix CE * fix CE * fix CE * refactor * fix CE * fix CE * fix varname * add bf16 * add ut for bf16 * fix CE
-
由 lzydev 提交于
* Improve GC for pipeline parallel * Delete print * fix bug of nop_op and sharding --------- Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
由 hong 提交于
* support new ir load combine * update * polish code * remove print * polish code * fix bug * polish code * fix compile bug
-
由 gouzil 提交于
[clang-tidy] Open clang-analyzer-unix.Vfork、clang-analyzer-security.insecureAPI.vfork Check (#56253)
-
由 zhaoyingli 提交于
-
由 Ruibin Cheung 提交于
* [Fluid] Migrate number_count to PHI * fix out alloc * fix ut (add python_api)
-
由 LiYuRio 提交于
-
由 傅剑寒 提交于
-
由 WangZhen 提交于
-
由 zhangbo9674 提交于
* refine code * refine code * refine code
-
- 14 8月, 2023 23 次提交
-
-
由 niuliling123 提交于
-
由 HongyuJia 提交于
* [CustomOP] InferDtypeFn supports attrs * Update paddle/fluid/framework/custom_operator.cc * update cmake list * fix cpu device * change unittest time
-
由 LielinJiang 提交于
-
由 gouzil 提交于
-
由 cyberslack_lee 提交于
-
由 cyberslack_lee 提交于
-
由 cyberslack_lee 提交于
-
由 gouzil 提交于
-
由 LiYuRio 提交于
-
由 周波涛 提交于
-
由 xiaoguoguo626807 提交于
* op_vjp generate template * op_vjp_gen templete * delete print Co-authored-by: NAurelius84 <zhangliujie@baidu.com> --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
由 iSerendipity 提交于
* [AMP] add fp16 op detection for custom device * resolve conflicts
-
由 ronnywang 提交于
-
由 zhangbo9674 提交于
* add code * fix bug * fix bug * delete unused code * refine code * fix bug * fix bug * fix bug * fix bug * fix bug
-
由 kangguangli 提交于
* add ir output check in OpTest * add ir grad check in op test * fix bug in output check * trigger CI * test gpu ci * trigger CI * trigger CI * add white list to relax precision check for some tests * relax timeout of test_concat_op * relax timeout of test_concat_op
-
由 tianshuo78520a 提交于
-
由 jiangfan06 提交于
-
由 ronnywang 提交于
-
由 Sonder 提交于
-
由 kangguangli 提交于
-
由 MarDino 提交于
* add rmsnorm residual bias add and quant * refine python interface * add rmsnorm unittest * Add layernorm * fix layernorm unittest * refine unittest * fix example code * fix review comment
-
由 liuruyan 提交于
* Add ShapeDialect dict & SymbolicDimOp without UT. * add unittest and fix Update_xxx_Func. * change std::string to const std::string & and remove phi dependency.
-
由 Yichen Zhang 提交于
* add reshape spmd rule * add unit test for reshape spmd rule * bug fix * replace the print_info function with to_string * fix typo * bug fix * add handling for "0" in target shape * remove the part of computing size in dim_trans.cc
-
- 11 8月, 2023 5 次提交
-
-
由 umiswing 提交于
* Add enforce for sparse_bn. * Add enforce for sp conv.
-
由 ronnywang 提交于
-
由 Meteor Liu 提交于
* [dy2static]implement tensor.cuda() in static graph * [dy2static]implement tensor.cuda() in static graph - change the patch place * [dy2static]implement tensor.cuda() in static graph - fix code-block in comment * [dy2static]implement tensor.cuda() in static graph - add ut for warning branch
-
由 Yiqun Liu 提交于
* Fix the shape of input sin and cos for fused_rope. * Update shape in unittest.
-
由 Bo Zhang 提交于
IR 的 builtin dialect 中加入 IndexType
-