- 27 3月, 2023 4 次提交
-
-
由 risemeup1 提交于
* fix_gcc12_error * patch on eigen3 for fixing gcc12 error * Update multiary.cc
-
由 Sławomir Siwek 提交于
* extract Op and OPMaker to .h * extend pattern for fused_op * set "with_residual" default to false * adjust fuse passes * remove fc+eltwise flag * fused_output_scale * activation attrs * remove extra attrs * fix int8/bf16 unit tests * simplify RecomputeOutputDims * remove unused method * Add description for attributes * add extra check * adjust op compats * update quantize test * fix protobuf parsing error * fix int8 performance * fused elementwises * merge develop * remove activation * restore activation for existing add/sub ops
-
由 houj04 提交于
-
由 sneaxiy 提交于
* fix mea compile error * support 2-D bias * add inline to avoid compile error * polish codes
-
- 25 3月, 2023 3 次提交
-
-
由 张春乔 提交于
-
由 Ruibin Cheung 提交于
[Fix Bug] fix get_new_shape and get_new_data_from_tensor not support fallback to CPU on custom device (#52002)
-
由 Infinity_lee 提交于
* codestyle up027 * add to pyproject.toml
-
- 24 3月, 2023 9 次提交
-
-
由 TaoTao Li 提交于
* add all_reduce, reduce kernel and api * fix all_reduce reduce ut fix reduce op maker conflict fix merge conflicts * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops rename allreduce op, to remove * fix code format fix comments * modify test_collective_reduce_api ut timeout * fix PR-CI-Build fix comments: format phi operator
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs * decouple memory * deal with conflict * fix xpu compile bugs * fix xpu bugs * deal with xpu bugs * fix cmake bugs * fix windows bugs * fix ci bugs * fix ci bugs * delete redundance code * add code for pybind * fix py3 bugs * fix ci bugs
-
由 YuhangLi 提交于
* [CUSTOM DEVICE]analysis predictor custom device support * del debug log
-
由 Yuanle Liu 提交于
-
由 PuQing 提交于
* fix momentum dtype infer * fix momentum datatype * fix on cpu * add momentum
-
由 thunder95 提交于
* untracked files * kthvalue perf * remove unused files * fix isnan * fix isnan2 * fix bug * try to fix rocm error
-
由 ZhangDY-6483 提交于
* first version, notest * return final rst, notest * use infinity() instead of max * ut structure * start up of ut * generate lse * update * add depense * reconstruct cmake * move file * add memory efficient attention and fix blasimpl * update * update cmake * add namespace * update cmake * use .cu * update for pad3d * bug fix * bug fix * update * bug fix * update enforce * add test case * merge the lse pad * fix kernel_fn of backward * fix PADDLE_ENFORCE_EQ and phi_api * fix PADDLE_ENFORCE * fix PADDLE_ENFORCE * rerun coverage * fix memory efficient attention test * rerun ci * add cuda version condition * add cuda version condition * delete WIP test * replace PADDLE_ENFORCE * edit the namespace of datatype in multiple.cc * rerun * rerun --------- Co-authored-by: Nliuyuang <liuyuang@baidu.com>
-
由 zhangyikun02 提交于
-
由 Yuang Liu 提交于
-
- 23 3月, 2023 24 次提交
-
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
[Polish Log] Polish Tensor operants' log: 'OperantsManager reusing XXX mode API {func_name}' (#51991) * [Polish Log] Polish Tensor operants' log: 'OperantsManager reusing XXX mode API {func_name}' * Make API name more precise
-
由 zhangyikun02 提交于
-
由 Wangzheee 提交于
-
由 xiaoguoguo626807 提交于
* delete prim flag for matmul_2_grad * delete prim flag for matmul_2_grad * add new setgradoutmeta for matmul_double_grad_node * modify test and delete log * deal with review
-
由 chenjian 提交于
* add meshgrid composite rule * add meshgrid composite rule * update * add into CMakeLists * fix * update * update * optimize code * fix meshgrid op * update test
-
由 iSerendipity 提交于
-
由 ZhouMengLei1999 提交于
-
由 sneaxiy 提交于
* remove fluid deps in fused_linear_param_grad_add_kernel * fix compile error * fix ut error * follow comments
-
由 limingshu 提交于
* first commit * fix bugs * remove_useless sync
-
由 Huang Jiyi 提交于
* unify add_position_encoding * unify affine_channel * unify alloc_float_status * unify allreduce * unify alltoall * unify anchor_generator * unify ascend_trigger * fix bug * fix test
-
由 Huang Jiyi 提交于
* update * update * update * update * update * fix test
-
由 cxxly 提交于
-
由 zqw_1997 提交于
* to support cuda12, pybind need to upgrade to v2.10.0 * add DEPS of pybind in test_custom_plugin_creater.cc * only change the tag * please let CI pass * try pybind v2.10/3 * modify the include header in test * code check
-
由 Lin Manhui 提交于
* Add bf16 support for elementwise_pow * Update ut
-
由 Infinity_lee 提交于
-
由 Yuang Liu 提交于
-
由 yeliang2258 提交于
* add bf16 and fp16 tests * fix dtype check
-
由 Huang Zhengjie 提交于
* fix distributed graph engine
-
由 LoneRanger 提交于
* add fp16 and bfp16 for temporalshift * add fp16 and bfp16 for complex * fix bug * fix bug * add fp16 and bf16 for conj * fix bug * fix bug * Update complex_kernel.h fix bug * Update temporal_shift_grad_kernel.h fix bug * Update temporal_shift_kernel.h fix bug
-
由 PuQing 提交于
* add nanmedian output defs * remove the multiclass_nms3 momentum
-
由 PuQing 提交于
[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and unnecessary <list/tuple> passed to <list/tupule>() (#51928) * autofix * add select config * autofix C410 * add C410 select
-
由 denglianbin 提交于
* finish pr * skip cpu test for logical * change test style * fix error.
-