- 10 4月, 2023 2 次提交
- 08 4月, 2023 1 次提交
-
-
由 张春乔 提交于
* mv WITH_ASCEND_CL * mv WITH_ASCEND * rollback
-
- 07 4月, 2023 3 次提交
-
-
由 shaojie_wang 提交于
* register bf16 for communication ops * fix bfloat16 type finding compile error in c_allreduce_max_op
-
由 Wang Xin 提交于
-
由 jameszhang 提交于
-
- 06 4月, 2023 2 次提交
- 04 4月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Improve new executor static build * Skip GC for static build * Skip infershape for static build * Handle read_op * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi * Fix argsort typos * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi * Fix skip share lod errors * Fix errors for adam * Fix errors for eigvals, memcpy and fake_quantize * Add static_build.cc * Add black list * Fix CI errors * Fix CI errors * Fix CI errors * Fix TensorArray * Fix TensorArray * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel * Fix copy * Fix errors * Fix momentum * Skip mkldnn * Fix CI errors * Fix c_sync_calc_stream_op * Fix CINN * Fix while op * All CI pass, disable FLAGS to merge code, enable it after more tests in future * Add UTs * Fix typos * Fix typos * Add mkldnn UT * Remove mkldnn test * Fix typos * Fix dist test * Fix typos * Fix CI errors * Fix CI errors * Add UTs * Fix typos * Fix typos * Add sparse tests * ToComplexType -> ToComplex * Add test_matmul_op_static_build to disable_win_inference_test
-
- 01 4月, 2023 1 次提交
-
-
由 jjyaoao 提交于
* Delete the /paddle/fluid/platform/device/npu directory * clear Cmakelists * Try removing npu in the header file
-
- 31 3月, 2023 3 次提交
-
-
由 huangjiyi 提交于
* update bipartite_match * update * fix bug * fix test * fix bug * fix Kunlun-KP-Build * Revert "fix Kunlun-KP-Build" This reverts commit ceab63cc23079fd6839c826bb52db893fb056355. * update
-
由 jameszhang 提交于
* [kunlun] prevent numerical overflow in collective softmax_with_ce * add fix in another branch
-
由 Yuang Liu 提交于
-
- 30 3月, 2023 2 次提交
-
-
由 huangjiyi 提交于
* update assign_pos * update attention_lstm * update barrier * update batch_fc * update beam_search * update beam_search_decode * update bilateral_slice * fix bug * Handle Structure kernel for InterpreterCore::RunOperator * fix bug * fix rocm compile * fix rocm compile * Revert "fix rocm compile" * test * revert test and update cmake --------- Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
由 Ghost Screaming 提交于
* Support ignore_index for c_softmax_with_cross_entropy_op. * Polish code. Remove useless comments and add Testcase. * Polish code for TestCase. * Polish code. * Polish code style. * Polish code. * Change loss calculation formula and ignore_index dtype. * Polish TestCase. * Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index' dtype is int64_t.
-
- 29 3月, 2023 2 次提交
-
-
由 QingshuChen 提交于
-
由 Ghost Screaming 提交于
* Support ignore_index for c_softmax_with_cross_entropy_op. * Polish code. Remove useless comments and add Testcase. * Polish code for TestCase. * Polish code. * Polish code style. * Polish code. * Change loss calculation formula and ignore_index dtype. * Polish TestCase.
-
- 24 3月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
* add all_reduce, reduce kernel and api * fix all_reduce reduce ut fix reduce op maker conflict fix merge conflicts * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops rename allreduce op, to remove * fix code format fix comments * modify test_collective_reduce_api ut timeout * fix PR-CI-Build fix comments: format phi operator
-
- 23 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* unify add_position_encoding * unify affine_channel * unify alloc_float_status * unify allreduce * unify alltoall * unify anchor_generator * unify ascend_trigger * fix bug * fix test
-
- 21 3月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
- 16 3月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* update env setting * update pass logic * dist op support bf16 * backward cast update * update setting * update backward * revert amp pass * update fp16 backward logic * register c_embedding bf16 * revert engine * add unitest * add unitest * update unitest * update cmake * update math * update math.py * update unitest * update unitest * revise unitest * revise unitest * update unitest * update unitest * update unitest
-
- 10 3月, 2023 1 次提交
-
-
由 QingshuChen 提交于
-
- 23 2月, 2023 1 次提交
-
-
由 jameszhang 提交于
* kunlun support c_softmax_with_cross_entropy * fix grad calc error * replace mutable_data() and ShareDataWith() * update xdnn * update xpu toolchain to 20230215 * remove fluid from test file
-
- 22 2月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
-
- 21 2月, 2023 1 次提交
-
-
由 QingshuChen 提交于
-
- 16 2月, 2023 1 次提交
-
-
由 shentanyue 提交于
* support xpu multi-card infer * add ut * clean code * clean code * fix * fix * fix * fix
-
- 13 1月, 2023 1 次提交
-
-
由 jameszhang 提交于
* kunlun add support for c_concat and c_split * replace mutable_data() and ShareDataWith()
-
- 10 1月, 2023 1 次提交
-
-
由 Wen Sun 提交于
* refactor: gloo comm context migration * fix: headers & avoid mutable_data usage * fix: cmake gloo dep * style: rename funcs * refactor: move to new files * fix: gloo deps * refactor: simplify create device
-
- 09 1月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 04 1月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* execute use kernel_key first * change OpKernelType->KernelKey * fix py3 compile error, remove redundant header files * fix build_strategy_test * fix DataType::RAW * fix custom_type test: operator_test.cc * fix transform place * fix backends_are_same_class * try fix place TransDataDevice * support all KernelKey * fix TransformData * fix place_are_same_class * fix merge * fix test_params_no_grad * fix specific place of GetExpectedKernelType * fix specific place of GetExpectedKernelType * fix GetKernelTypeForVar * fix dtype error * fix fetch_v2 * change GetKernelTypeForVar * fix interpreter * fix typo error * polish codes * polish codes * polish codes * fix conflict
-
- 29 12月, 2022 1 次提交
-
-
由 xu98bin 提交于
* auto parallel bf16
-
- 19 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 15 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
-
- 07 12月, 2022 1 次提交
-
-
由 张春乔 提交于
-
- 29 11月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
* get default calc stream from execution ctx instead of global dev ctx pool.
-
- 28 11月, 2022 1 次提交
-
-
由 张春乔 提交于
-
- 24 11月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* rm dependence to "convert_utils.h" in some files * fix bugs * replace DataType2String with DataTypeToString * replace framework::DataTypeSize with phi::SizeOf * mv convert_function from fluid to phi and rm old map * recommit with pre-commit * repalce ProtoVarType with ProtoDataType and update comment. * fix error about include "dnnl.hpp" * revert add dep mkldnn to convert_utils in phi * add mkldnn deps in convert_utils.h in phi * move deps to convert_utils.h in phi
-
- 21 11月, 2022 2 次提交
- 18 11月, 2022 1 次提交
-
-
由 Wang Xin 提交于
* remove "gpu_primitives.h" in fluid namespace * fix PR-CI-GpuPS fail * fix PR-CI-GpuPS fail
-