- 23 5月, 2023 17 次提交
-
-
由 Yuanle Liu 提交于
* fix trt inference fp16 io * fix inference fp16 io
-
由 co63oc 提交于
-
由 weishengying 提交于
-
由 co63oc 提交于
* Fix typos * Fix
-
由 cyberslack_lee 提交于
-
由 huangjiyi 提交于
* update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update HostAlloc * update param name * update cpu kernel * remove kernel header * update * update
-
由 Tian Zheng 提交于
-
由 Leo Chen 提交于
* add host memory stats * add ut
-
由 huangjiyi 提交于
* update * update * update * set out dtype
-
由 Wang Xin 提交于
* static graph autogen code support for pad3d op * bug fixed * add ut for pad3d mkldnn op * fix coverage * fix bug * fix bug * Delete test_pad3d_mkldnn_op.py
-
由 zhangyikun02 提交于
-
由 ronnywang 提交于
* [CustomDevice] fix auto_paralell * update * update * update
-
由 LoneRanger 提交于
* fix the static op generation for group_norm * fix bug of mismatch * fix bug of AssertionError * fix setting of composite
-
由 Yuanle Liu 提交于
-
由 kangguangli 提交于
* Use copy_if_different to avoid recompilation of generated cutlass kernels. * add program parameter dialect_interface * fix op create bug * add conv2d * draft of paddle converter * fix CI * fix windows CI * fix program destructor * printer draft * fix bug * printer draft finish * fix windows CI * reserve inplace semantics * revert program::destroy since no need to do topology sort * revert * modify by reviews * polish * fix op definition * fix CI * refresh file changes --------- Co-authored-by: Numiswing <umiswing@foxmail.com> Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
由 zhupengyang 提交于
-
由 HongyuJia 提交于
* [0D-Tensor] Support elementwise_add * support elementwise_add ZeroDim2&3
-
- 22 5月, 2023 18 次提交
-
-
由 risemeup1 提交于
* update_c++14_to_c++17_on_windows * disable test_audio_logmel_feature and test_audio_mel_feature
-
由 risemeup1 提交于
-
由 xiongkun 提交于
* [Dy2static-Fallback] add set_eval_frame function in pybind. 1. add set_eval_frame function in pybind. * add unittest for eval frame hooker. * [support py38] * fix-GeneratorExit error in eval frame hooker * support python == 3.9 * support 3.10 * fix some comments * speed up eval frame for cache hitted code. * code format * fix unittest --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 zhangbo9674 提交于
* refine op dyn_cast * fix bug * refine code * refine code * refine code * refine code
-
由 kangguangli 提交于
* add conv2d * printer draft * fix bug * printer draft finish * fix windows CI * commit printer and resnet50 related ops * fix * fix * fix op definition --------- Co-authored-by: Numiswing <umiswing@foxmail.com> Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
由 shentanyue 提交于
-
由 cyber-pioneer 提交于
* recompute bn grad * fix test case --------- Co-authored-by: Nsunli <466530738@qq.com>
-
由 lijin23 提交于
* fix empty bugs for xpu * fix empty bugs for xpu
-
由 zhupengyang 提交于
-
由 zhupengyang 提交于
-
由 Yuanle Liu 提交于
-
由 JYChen 提交于
-
由 Yuanle Liu 提交于
[Inference] add config.enable_low_precision_io api and remove rely on AnalysisConfig::Precison in trt (#52485)
-
由 zhoutianzi666 提交于
* fix transfer_layout when input size if too big * do not add TransferLayoutKernelGPU * add int64 and add check
-
由 zhangyikun02 提交于
-
由 Tian Zheng 提交于
* Add GPU kernel for multiclass_nms3 op * Make multiclass_nms3 gpu kernel output consistent with cpu kernel * Fix API incompatibility * Fix unittests on builds without CUDA * Fix ROCM build * Remove fluid headers; Use default atol for unittest * Change function and variable naming * Add comments; Reduce redundant code * Use paddle test framework
-
由 niuliling123 提交于
Print python trace back when debugmode = CHECK_NAN_INF_AND_ABORT and backward has nan/inf (#52808)
-
由 wangshengxiang 提交于
* bind xpu op: 3D grid sample * fix edge cases in xpu op: reshape & slice
-
- 20 5月, 2023 2 次提交
-
-
由 zhangbo9674 提交于
-
由 zhangbo9674 提交于
* add types and attributes * remove some const_cast * refine code
-
- 19 5月, 2023 3 次提交
-
-
由 shentanyue 提交于
-
由 wz1qqx 提交于
-
由 warrentdrew 提交于
* add minimum grad composite rules * add public python api * fix format * fix format * update testcase * fix testcase * fix format * fix cmakelist.txt * fix format * fix param problem * fix op and composite rule * fix bf16 cpu support problem * fix bf16 cpu issue * fix axis error log * add axis for maximum * revert commit * remove .orig * fix generic problem * revert max op * fix axis error * fix maximum axis * fix test_check_output * fix cinn * fix minimum maximum axis check
-