- 31 3月, 2023 8 次提交
-
-
由 huangjiyi 提交于
* update bipartite_match * update * fix bug * fix test * fix bug * fix Kunlun-KP-Build * Revert "fix Kunlun-KP-Build" This reverts commit ceab63cc23079fd6839c826bb52db893fb056355. * update
-
由 jameszhang 提交于
* [kunlun] prevent numerical overflow in collective softmax_with_ce * add fix in another branch
-
由 chenjian 提交于
* first commit * add registry * add unit test * fix format * add unit test * fix bug * replace unsuqeeze to reshape * fix * fix unit test * update test * update test * fix unit test * fix * fix
-
由 zyfncg 提交于
* add yaml for some op * fix inplace_abn * fix test_leaky_relu_grad_grad_functor * fix yaml * fix typo
-
由 YuanRisheng 提交于
* remove distribute * fix py3 bugs * fix gpu-ps bugs * fix compile bugs * fix unittest bugs
-
由 engineer1109 提交于
fix with auto&
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 Yuang Liu 提交于
-
- 30 3月, 2023 11 次提交
-
-
由 zhupengyang 提交于
-
由 Feiyu Chan 提交于
1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type; 2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative) 3. add support for cuda pinned place in `libpaddle.Tensor` element get and set --- 4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)
-
由 Wang Xin 提交于
* add autogen code support for spectral_norm * bug fixed * fix PR-CI-Static-Check fail
-
由 pangengzheng 提交于
* support run haokanctr model in heterps-models * polish setup.py * polish JVM_LIB in evn_dict * align infer auc with DistPsArch pre-stable * async and multi thread data feed * rewrite dense tensor intialization * async infer shape and reuse memory
-
由 huangjiyi 提交于
* update assign_pos * update attention_lstm * update barrier * update batch_fc * update beam_search * update beam_search_decode * update bilateral_slice * fix bug * Handle Structure kernel for InterpreterCore::RunOperator * fix bug * fix rocm compile * fix rocm compile * Revert "fix rocm compile" * test * revert test and update cmake --------- Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
由 Ghost Screaming 提交于
* Support ignore_index for c_softmax_with_cross_entropy_op. * Polish code. Remove useless comments and add Testcase. * Polish code for TestCase. * Polish code. * Polish code style. * Polish code. * Change loss calculation formula and ignore_index dtype. * Polish TestCase. * Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index' dtype is int64_t.
-
由 Ainavo 提交于
* support auto generate for prelu * op_compat 中增加输入参数 * del attrs ; add kernel data_type * add PreluGradInferMeta
-
由 gouzil 提交于
* add autogen code support for sigmoid_cross_entropy_with_logits * add inplace
-
由 Wang Xin 提交于
* add autogen code support for merge_selected_rows * bug fixed
-
由 lzydev 提交于
* change op with xpu * change range yaml * fix bug in generate_op.py
-
由 jiangcheng 提交于
* [CINN] pass global seed to CINN * fix cu not include cinn/runtime/flags.h bug * fix DefaultCUDAGenerator should has device id bug
-
- 29 3月, 2023 5 次提交
-
-
由 张春乔 提交于
* add up34 * modify var name in loop * revert changes in test_slice * Revert "modify var name in loop" This reverts commit 6d748e371afb417054ed0c6b36fd11e87959a90d. * temporarily ignore test_slice.py * add comment * empty commit, re-trigger all ci * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 QingshuChen 提交于
-
由 Ghost Screaming 提交于
* Support ignore_index for c_softmax_with_cross_entropy_op. * Polish code. Remove useless comments and add Testcase. * Polish code for TestCase. * Polish code. * Polish code style. * Polish code. * Change loss calculation formula and ignore_index dtype. * Polish TestCase.
-
由 zhangyikun02 提交于
-
由 huangjiyi 提交于
* fix kp compile * test * Revert "test" This reverts commit 3a1cbfaa0f23e6e06d3dcd8d0b0c28aa63a98e70. * update copyright * update cmake * update cmake * update cmake * update cmake
-
- 28 3月, 2023 6 次提交
-
-
由 cyberslack_lee 提交于
-
由 张春乔 提交于
* mv cumprod * add attrs * Update backward.yaml * Update backward.yaml
-
由 cyberslack_lee 提交于
* fix huber_loss * fix * fix ops.yaml add intermediate * fix * fix test
-
由 RedContritio 提交于
* support auto generate static for one_hot_v2 * format
-
由 Wang Xin 提交于
-
由 RedContritio 提交于
* support auto generate for log_softmax * add data_type
-
- 27 3月, 2023 7 次提交
-
-
由 YuanRisheng 提交于
* perfect structure kernel registry * fix ci bugs
-
由 HappyHeavyRain 提交于
* add fused_yaml fused_backward * fix eager_funciton bug * add some comment of fused yaml file * add 'support_dygraph_mode' configuration in fused yaml * delete some 'fused_api.h' in include file * add fused flag in api_gen
-
由 Xinyu Chen 提交于
-
由 HappyHeavyRain 提交于
* support assign op * support assign infer_var_type * change code according to review * change code according to review * only save 'get_infer_var_type_func' * rest file mode
-
由 wanghuancoder 提交于
This reverts commit 9c238d2b.
-
由 risemeup1 提交于
* fix_gcc12_error * fix gcc12 error * fix gcc12 error
-
由 Sławomir Siwek 提交于
* extract Op and OPMaker to .h * extend pattern for fused_op * set "with_residual" default to false * adjust fuse passes * remove fc+eltwise flag * fused_output_scale * activation attrs * remove extra attrs * fix int8/bf16 unit tests * simplify RecomputeOutputDims * remove unused method * Add description for attributes * add extra check * adjust op compats * update quantize test * fix protobuf parsing error * fix int8 performance * fused elementwises * merge develop * remove activation * restore activation for existing add/sub ops
-
- 25 3月, 2023 1 次提交
-
-
由 Infinity_lee 提交于
* codestyle up027 * add to pyproject.toml
-
- 24 3月, 2023 2 次提交
-
-
由 TaoTao Li 提交于
* add all_reduce, reduce kernel and api * fix all_reduce reduce ut fix reduce op maker conflict fix merge conflicts * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops rename allreduce op, to remove * fix code format fix comments * modify test_collective_reduce_api ut timeout * fix PR-CI-Build fix comments: format phi operator
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs * decouple memory * deal with conflict * fix xpu compile bugs * fix xpu bugs * deal with xpu bugs * fix cmake bugs * fix windows bugs * fix ci bugs * fix ci bugs * delete redundance code * add code for pybind * fix py3 bugs * fix ci bugs
-