- 10 8月, 2023 5 次提交
-
-
由 lzy 提交于
* add variable_length_memory_efficient_attention * update variable_length_memory_efficient_attention unittest * update variable_length_mem_eff_attn's docs and unittest * update variable_length_mem_eff_attn's docs * Update test_variable_length_memory_efficient_attention.py * Update variable_length_memory_efficient_attention.cu * fix codestyle * fix variable_length_fmha's docs and unittest * fix variable_length_fmha's docs
-
由 lxd-cumt 提交于
* decompose tanh_triple_grad and add it into prim_white_list test=develop * fix TanhTripleGradKernel bugs test=develop * decompose tanh_triple_grad test=develop
-
由 csy0225 提交于
-
由 xiaoye 提交于
-
由 RuohengMa 提交于
* [XPU kernel] fix warpctc issue * fix issue * temporal hack to circumvent depthwise_conv2d precision issue * reset test case
-
- 09 8月, 2023 17 次提交
-
-
由 xiaoguoguo626807 提交于
* [prim][newir] add basic framework for primitive * support desctensor in new ir * add vjp interface * support vjp in new ir * support vjp in new ir * polish vjp interface * fix stop_gradients set * fix vjp dispatch * add comment * add vjp test for new ir * add test for tanh vjp * [prim][newir] add basic framework for primitive * support desctensor in new ir * support vjp in new ir * support vjp in new ir * polish vjp interface * fix stop_gradients set * fix vjp dispatch * add comment * add vjp test for new ir * add test for tanh vjp * add eager and static backend for warp lower level api * support call_vjp pybind * polish code and add test for vjp * remove useless code * polish code * remove useless code * support mean vjp * backward origin code * add test for mean vjp and support has_vjp function * fix call_vjp * polish code * add attrs and dtype interface * add primitive ops set for backend * fix compile bugs * fix some bugs * fix windows bugs * add vjp test for tanh_ * fix inference CI * fix inference ci * modify fluid cmake * origin test of tanh and mean passed * fix conflict * modify stop_gradient * modify block.ops * modify test * fix conflict * reply review comments * reply review comments * pulish code --------- Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: NCharles-hit <wanghao107@baidu.com> Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 xiaoguoguo626807 提交于
* refine program translator * fix warning: not override * fix bug * merge new modifications * modify by reviews * resolve conflicts * resolve conflicts * fix * fix * fix conflicts * pseudocode of backward * modify test * modify register op * clear other code * modify ci build bug * reply review comments * reply review comments --------- Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
-
由 zyfncg 提交于
-
由 xiaoxiaohehe001 提交于
* fix_softmax_eigen * fix_ctest_seresnet * fix_ci_error
-
由 xiongkun 提交于
* [New Feature] add _inplace_assign interface for sot * add unittest for inplace_assign
-
由 Xinyu Chen 提交于
* onednn: rename macro to PADDLE_WITH_DNNL * onednn: rename macro to CINN_WITH_DNNL
-
由 risemeup1 提交于
* fix ccache print information * fix ccache print information * fix ccache print information * fix ccache print information * fix ccache print information
-
由 Ruibin Cheung 提交于
-
由 cyberslack_lee 提交于
-
由 niuliling123 提交于
-
由 houj04 提交于
-
由 houj04 提交于
* [XPU] add pos_weight for sigmoid_cross_entropy_with_logits. * update xdnn version.
-
由 Ruibin Cheung 提交于
-
由 kangguangli 提交于
* fix bugs about new ir test * enable dy2st newir test in all cases * fix
-
由 ronnywang 提交于
-
由 zhangbo9674 提交于
* fix bug * fix bug
-
由 Aurelius84 提交于
* [NewIR]Add cinn RuntimeDialect and JitKernelOp * remove PointerAttribute register * fix comment
-
- 08 8月, 2023 16 次提交
-
-
由 Wang Xin 提交于
* move decayed_adagrad_op to phi * fix bug
-
由 huangjiyi 提交于
-
由 zyfncg 提交于
-
由 leolishaohao 提交于
-
由 Ruibiao Chen 提交于
* Improve GC for pipeline parallel * Delete print
-
由 hong 提交于
* update * update cache * fix compile error
-
由 hong 提交于
* new ir remove fetch list * fix pattern rewrite bug * try to remove constant fold * revert code * add pattern rewrite test flag * fix multi fetch * remove usless code * new ir support legacy kernel instraction * new ir support legacy kernel instruction * add scope prefix * update * update * update * update * fix * revert channel shuffl test * polish code * try to fix windows compile error * polish code * update * update * revert op test
-
由 ronnywang 提交于
-
由 freeliuzc 提交于
-
由 jjyaoao 提交于
* Provide opoperands(), opresults() methods for the Operation module Signed-off-by: Njjyaoao <jjyaoao@126.com> * Update test_ir_pybind.py --------- Signed-off-by: Njjyaoao <jjyaoao@126.com>
-
由 hong 提交于
* fix new ir sgd op bug * fix bug * fix bug * update * revert code
-
由 Charles-hit 提交于
* [prim][newir] add basic framework for primitive * support desctensor in new ir * add vjp interface * support vjp in new ir * support vjp in new ir * polish vjp interface * fix stop_gradients set * fix vjp dispatch * add comment * add vjp test for new ir * add test for tanh vjp * [prim][newir] add basic framework for primitive * support desctensor in new ir * support vjp in new ir * support vjp in new ir * polish vjp interface * fix stop_gradients set * fix vjp dispatch * add comment * add vjp test for new ir * add test for tanh vjp * add eager and static backend for warp lower level api * support call_vjp pybind * polish code and add test for vjp * remove useless code * polish code * remove useless code * support mean vjp * add test for mean vjp and support has_vjp function * fix call_vjp * polish code * add primitive ops set for backend * add vjp test for tanh_ * fix inference CI * fix inference ci * modify fluid cmake * remove useless deps * add cmake --------- Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
由 easonchan1874 提交于
-
由 niuliling123 提交于
-
由 hong 提交于
-
由 zhangbo9674 提交于
-
- 07 8月, 2023 2 次提交
-
-
由 yin wei 提交于
* add mask * add backword * add enforce info * update scale * integrate code * update enforce * add enforce eq * add error type * update enforce * add test_flash_attention * Polish codes and fix compiling errors. * Set num_splits to 0 for flash-attn with tensor mask. * Fix the compiling error for non flash-attn case. --------- Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
由 LiYuRio 提交于
* make tcp store a global instance * fix windows compile error
-