- 31 3月, 2023 18 次提交
-
-
由 csy0225 提交于
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete custom_inplace_setup.py * [CustomOP Optional Inplace] Custom operator supports inplace optional Tensor input * fix bug for vector<Tensor> inplace test
-
由 Ainavo 提交于
-
由 YuanRisheng 提交于
-
由 YuanRisheng 提交于
* remove distribute * fix py3 bugs * fix gpu-ps bugs * fix compile bugs * fix unittest bugs
-
由 ronnywang 提交于
-
由 Wangzheee 提交于
* fix skiplayernorm, add trt_version check
-
由 engineer1109 提交于
fix with auto&
-
由 HongyuJia 提交于
-
由 wuhuachaocoding 提交于
* update bfp16 & fp16 test. * update. --------- Co-authored-by: NRoc <lrslj@live.com>
-
由 kangguangli 提交于
* update run checks of standalone executor * remove mlu related code
-
由 xiaoguoguo626807 提交于
* add to sub & delete full scale * decrease 1_div_shape_2 compute * x_sub_mean_mul_sqrt_var_1 * delete log * add mean var test * nothing
-
由 houj04 提交于
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 zhupengyang 提交于
-
由 sneaxiy 提交于
-
由 Yuang Liu 提交于
-
- 30 3月, 2023 22 次提交
-
-
由 Weilong Wu 提交于
* support layer_norm prim and cinn test * enable cinn test * fix merge conflict * polish input for check_output_with_place * fix merge conflict * add more test case * fix merge conflict * polish test case * polish op_test * change ln_g rules * modify scale is none case * modify scale is none case * add public_python_api for check prim * modify setoutputgrad and fp64bug * add todo & delete log * recover * fix some errors * recover * recover * recover * recover * fix merge conflicts --------- Co-authored-by: Nwangruting <wangruting@baidu.com>
-
由 cyber-pioneer 提交于
* fix_prim * fix bug * add note * fix logic * fix * add note * fix check * fix bug * fix bug * fix bug * add debug * fix check * fix bug * sync print log * fix test case * change default * change test case time
-
由 zhangyuqin1998 提交于
* move elementwise raw * fix * fix
-
由 zhouweiwei2014 提交于
-
由 zhouweiwei2014 提交于
-
由 Wang Xinyu 提交于
* transpose fp16 test * transpose auto tune fp16 test
-
由 zhangkaihuo 提交于
-
由 zhupengyang 提交于
-
由 Zheng-Bicheng 提交于
* update * update
-
由 Kim 提交于
* mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp * add missing cmake
-
由 Zheng-Bicheng 提交于
* update * update * update
-
由 Feiyu Chan 提交于
1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type; 2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative) 3. add support for cuda pinned place in `libpaddle.Tensor` element get and set --- 4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)
-
由 Roc 提交于
-
由 Yiqun Liu 提交于
* [AMP] Add python API for collecting operator stats. * Fix import and polish codes. * Add more unittest. * Add doc for the new APIs.
-
由 Wang Xin 提交于
* add autogen code support for spectral_norm * bug fixed * fix PR-CI-Static-Check fail
-
由 pangengzheng 提交于
* support run haokanctr model in heterps-models * polish setup.py * polish JVM_LIB in evn_dict * align infer auc with DistPsArch pre-stable * async and multi thread data feed * rewrite dense tensor intialization * async infer shape and reuse memory
-
由 Yuanle Liu 提交于
-
由 Vvsmile 提交于
* adjust defalut tolerance of output and grad * fix a bug in the grad of OpTest * fix the type of setting defalut value in optest, both forward and backward * add defalut * fix test_sum_op * fix test_sum_op test for testing add_n * modify the add_n op_test
-
由 zxcd 提交于
* add scatter composite rule. * add public_python_api * add python unit16 support. * fix code style. * add cinn to makelist * cinn unsupport uint16, forbidden cinn when dtype==uint16.
-
由 ykkk2333 提交于
-
由 huangjiyi 提交于
* update assign_pos * update attention_lstm * update barrier * update batch_fc * update beam_search * update beam_search_decode * update bilateral_slice * fix bug * Handle Structure kernel for InterpreterCore::RunOperator * fix bug * fix rocm compile * fix rocm compile * Revert "fix rocm compile" * test * revert test and update cmake --------- Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
由 zhupengyang 提交于
-