- 30 3月, 2023 32 次提交
-
-
由 Weilong Wu 提交于
* support layer_norm prim and cinn test * enable cinn test * fix merge conflict * polish input for check_output_with_place * fix merge conflict * add more test case * fix merge conflict * polish test case * polish op_test * change ln_g rules * modify scale is none case * modify scale is none case * add public_python_api for check prim * modify setoutputgrad and fp64bug * add todo & delete log * recover * fix some errors * recover * recover * recover * recover * fix merge conflicts --------- Co-authored-by: Nwangruting <wangruting@baidu.com>
-
由 cyber-pioneer 提交于
* fix_prim * fix bug * add note * fix logic * fix * add note * fix check * fix bug * fix bug * fix bug * add debug * fix check * fix bug * sync print log * fix test case * change default * change test case time
-
由 zhangyuqin1998 提交于
* move elementwise raw * fix * fix
-
由 zhouweiwei2014 提交于
-
由 zhouweiwei2014 提交于
-
由 Wang Xinyu 提交于
* transpose fp16 test * transpose auto tune fp16 test
-
由 zhangkaihuo 提交于
-
由 Zheng-Bicheng 提交于
* update * update
-
由 Feiyu Chan 提交于
1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type; 2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative) 3. add support for cuda pinned place in `libpaddle.Tensor` element get and set --- 4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)
-
由 Roc 提交于
-
由 Yiqun Liu 提交于
* [AMP] Add python API for collecting operator stats. * Fix import and polish codes. * Add more unittest. * Add doc for the new APIs.
-
由 Wang Xin 提交于
* add autogen code support for spectral_norm * bug fixed * fix PR-CI-Static-Check fail
-
由 Vvsmile 提交于
* adjust defalut tolerance of output and grad * fix a bug in the grad of OpTest * fix the type of setting defalut value in optest, both forward and backward * add defalut * fix test_sum_op * fix test_sum_op test for testing add_n * modify the add_n op_test
-
由 zxcd 提交于
* add scatter composite rule. * add public_python_api * add python unit16 support. * fix code style. * add cinn to makelist * cinn unsupport uint16, forbidden cinn when dtype==uint16.
-
由 ykkk2333 提交于
-
由 zhupengyang 提交于
-
由 Ghost Screaming 提交于
* Support ignore_index for c_softmax_with_cross_entropy_op. * Polish code. Remove useless comments and add Testcase. * Polish code for TestCase. * Polish code. * Polish code style. * Polish code. * Change loss calculation formula and ignore_index dtype. * Polish TestCase. * Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index' dtype is int64_t.
-
由 傅剑寒 提交于
* delete check_dygraph and use default atol,max_relative_error * add test case for bicubic_interp
-
由 yunyaoXYY 提交于
* add FP16 for multinomial * fix input data * update code * fix FP16 * fix code
-
由 kangguangli 提交于
* remove sync_calc_stream and sync_comm_stream * fix ci bug * fix * fix * fix
-
由 Zhang Ting 提交于
-
由 Wang Xinyu 提交于
* stride slice fp16 and bf16 unitest * fix code style * add self.dtype
-
由 gouzil 提交于
* [Test Mv] ipu_test * [Test Mv] cmake add py_test_modules * [Move Test] rm py_test_modules * rm asp
-
由 zhenhailiu 提交于
* add fp16 bfp16 test * polish * polish * polish
-
由 zhenhailiu 提交于
* polish * add type check
-
由 zhenhailiu 提交于
-
由 ShenLiang 提交于
* fix bug in proto * add utest
-
由 Guoxia Wang 提交于
* support python object input data broadcast for model parallel * add unittest * fix * fix concat 0D tensor * fix codestyle
-
由 cyberslack_lee 提交于
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 jjyaoao 提交于
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
由 yuehuayingxueluo 提交于
* add gloo send_recv * fix code_stype * fix CI bug * fix send_recv.cc * add send_recv without sync_op * fix send_recv test * fix gather.cc
-
- 29 3月, 2023 8 次提交
-
-
由 Guanghua Yu 提交于
-
由 zengshao0622 提交于
* pad3d add unittests of fp16 and bf16 * pad3d add unittests of fp16 and bf16 * fix cuda place * fix random to uniform * fix class name * fix fp16 max relative error to 1.5e-3 * add dytpe register for onednn * add pad uint16 check of common.py * remove check_eager * test_check_grad --> test_check_grad_normal
-
由 Zheng-Bicheng 提交于
-
由 Yichen Zhang 提交于
* add group_norm composite rule * add test for scale_grad and bias_grad * resolve conflicts * remove amp in composite_rule.py * add float16 test * deal with NHWC format * keep the composite rule in float16 identical as original kernel * resolve conflicts
-
由 Roc 提交于
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
由 houj04 提交于
* fix test_kldiv_loss_op_xpu * fix test_temporal_shift_op_xpu
-
由 wuyefeilin 提交于
* add fp16/bf16 to clip op * fix as reviewed * update test_clip_op.py * update test_clip_op.py
-