- 31 3月, 2023 14 次提交
-
-
由 zhenhailiu 提交于
* gather with doc * resolve comment * polish * polish * code style * polish doc * add_test * polish * polish * add test check * add test check * polish * polish * polish * polish * fix_time_out * polish * fix timeout * fix_timeout * polish * polish * polish * polish * polish
-
由 RedContritio 提交于
-
由 Galaxy1458 提交于
* this is a test pr, test=develop * solve the four [-Wterminate] warning, test=develop * solve the four [-Wterminate] warning, test=develop * new fix [-Wterminate], test=delelop * new fix [-Wterminate], test=delelop * new fix [-Wterminate], test=delelop * new , test = develop * new , test = develop * new , test = develop * new , test = develop * new , test = develop * new , test = develop
-
由 zhangyuqin1998 提交于
-
由 chenjian 提交于
* first commit * add registry * add unit test * fix format * add unit test * fix bug * replace unsuqeeze to reshape * fix * fix unit test * update test * update test * fix unit test * fix * fix
-
由 zyfncg 提交于
* add yaml for some op * fix inplace_abn * fix test_leaky_relu_grad_grad_functor * fix yaml * fix typo
-
由 csy0225 提交于
-
由 HongyuJia 提交于
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete custom_inplace_setup.py * [CustomOP Optional Inplace] Custom operator supports inplace optional Tensor input * fix bug for vector<Tensor> inplace test
-
由 YuanRisheng 提交于
* remove distribute * fix py3 bugs * fix gpu-ps bugs * fix compile bugs * fix unittest bugs
-
由 ronnywang 提交于
-
由 houj04 提交于
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 zhupengyang 提交于
-
由 sneaxiy 提交于
-
- 30 3月, 2023 17 次提交
-
-
由 zhangyuqin1998 提交于
* move elementwise raw * fix * fix
-
由 zhouweiwei2014 提交于
-
由 zhangkaihuo 提交于
-
由 Roc 提交于
-
由 Yiqun Liu 提交于
* [AMP] Add python API for collecting operator stats. * Fix import and polish codes. * Add more unittest. * Add doc for the new APIs.
-
由 Wang Xin 提交于
* add autogen code support for spectral_norm * bug fixed * fix PR-CI-Static-Check fail
-
由 pangengzheng 提交于
* support run haokanctr model in heterps-models * polish setup.py * polish JVM_LIB in evn_dict * align infer auc with DistPsArch pre-stable * async and multi thread data feed * rewrite dense tensor intialization * async infer shape and reuse memory
-
由 zxcd 提交于
* add scatter composite rule. * add public_python_api * add python unit16 support. * fix code style. * add cinn to makelist * cinn unsupport uint16, forbidden cinn when dtype==uint16.
-
由 ykkk2333 提交于
-
由 yunyaoXYY 提交于
* add FP16 for multinomial * fix input data * update code * fix FP16 * fix code
-
由 Ainavo 提交于
* support auto generate for prelu * op_compat 中增加输入参数 * del attrs ; add kernel data_type * add PreluGradInferMeta
-
由 Wang Xinyu 提交于
* stride slice fp16 and bf16 unitest * fix code style * add self.dtype
-
由 gouzil 提交于
* add autogen code support for sigmoid_cross_entropy_with_logits * add inplace
-
由 Wang Xin 提交于
* add autogen code support for merge_selected_rows * bug fixed
-
由 Danyang Zhang 提交于
-
由 lzydev 提交于
* change op with xpu * change range yaml * fix bug in generate_op.py
-
由 cyberslack_lee 提交于
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 29 3月, 2023 9 次提交
-
-
由 zengshao0622 提交于
* pad3d add unittests of fp16 and bf16 * pad3d add unittests of fp16 and bf16 * fix cuda place * fix random to uniform * fix class name * fix fp16 max relative error to 1.5e-3 * add dytpe register for onednn * add pad uint16 check of common.py * remove check_eager * test_check_grad --> test_check_grad_normal
-
由 Yichen Zhang 提交于
* add group_norm composite rule * add test for scale_grad and bias_grad * resolve conflicts * remove amp in composite_rule.py * add float16 test * deal with NHWC format * keep the composite rule in float16 identical as original kernel * resolve conflicts
-
由 hjyp 提交于
* regist output type for GraphSampleNeighbors and GroupNorm * Update return type * fix return type * update * fix detail
-
由 chenxujun 提交于
-
由 ShenLiang 提交于
* fix bg * add utest * add utest
-
由 xiaoguoguo626807 提交于
* tanh_double_grad_rules * delete log got api_base * modify composite yaml * optimize rules
-
由 zhupengyang 提交于
-
由 yuehuayingxueluo 提交于
* add fuse adamw pass * fix some bugs * fix CIbug * change chunk_size * fix CI bug * rm test_fused_adam_op.py * fix CI bugs * fix fuse_adamw_op_pass.cc * change code style * fix CI bug * fix ut bug and use_adamw_op_pass.cc * fix test_fuse_adamw_pass.py * fix CI bug * remove fluid * fix ci bug * fix CI bug
-
由 zhangyikun02 提交于
-