- 02 7月, 2021 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 TTerror 提交于
-
- 29 6月, 2021 2 次提交
-
-
由 taixiurong 提交于
-
由 Zhou Wei 提交于
* support Ninja and establish dependencies relationship between paddle with third_party * fix CI * support Ninja
-
- 24 6月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
-
- 22 6月, 2021 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 21 6月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 18 6月, 2021 2 次提交
- 17 6月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 16 6月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
-
- 15 6月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 09 6月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 08 6月, 2021 1 次提交
-
-
由 TTerror 提交于
-
- 07 6月, 2021 1 次提交
-
-
由 lidanqing 提交于
-
- 02 6月, 2021 2 次提交
- 01 6月, 2021 2 次提交
-
-
由 Zhou Wei 提交于
-
由 chentianyu03 提交于
* replace and remove complex64/128 types in custom OP and other files * fix custom_tensor_test fail bug * fix custom_conj_test fail bug * fix dispatch_test_op build fail bug
-
- 28 5月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
-
- 27 5月, 2021 2 次提交
-
-
由 Thunderbrook 提交于
* support ssd in PsCore * remove log * remove bz2 * defalut value * code style * parse table class * code style * add define
-
由 Zhou Wei 提交于
* Unify all external API error message mechanism and enhance third-party API error msg * fix some comment * fix some comment
-
- 26 5月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* fix ninja compilation bug on windows * polish windows ci * polish windows ci
-
- 24 5月, 2021 1 次提交
-
-
由 Jacek Czaja 提交于
* - bump up oneDNN to 2.2.2 (should reduce perf drops of mobilenet) * - more recnet onednn 2.2.2 (some more bugfixes)
-
- 20 5月, 2021 1 次提交
-
-
由 TTerror 提交于
* fix gather op and add logsumexp op on kunlun * update xpu depence * update tests and fix elementwise_add
-
- 19 5月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
* CI skip inference test if only python files modified, test=develop * fix compilation error on ROCM, test=develop * fix cmake error on PR-CI-ROCM-Compile, test=develop
-
- 18 5月, 2021 1 次提交
-
-
由 QingshuChen 提交于
-
- 12 5月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* fix windows CI * fix windows CI
-
- 11 5月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
-
- 10 5月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* pslib with cmake * heter util * vlog * heter server test * add dtor * cmake
-
- 07 5月, 2021 1 次提交
-
-
由 LielinJiang 提交于
* fix compile error on jetson platform
-
- 06 5月, 2021 2 次提交
-
-
由 zhulei 提交于
* [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op
-
由 ronnywang 提交于
* fix test_unpool_op * fix test_inplace_addto_strategy * fix test_conv2d_fusion_op * fix test_imperative_lod_tensor_to_selected_rows, test_imperative_selected_rows_to_lod_tensor * fix test_dot_op * fix test_correlation_op * fix tracer * fix test_memcpy_op
-
- 29 4月, 2021 1 次提交
-
-
由 LielinJiang 提交于
* add op read_file and decode_jpeg
-
- 24 4月, 2021 1 次提交
-
-
由 winter-wang 提交于
-
- 23 4月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* fix Windows CI MP compile and environment install script * clear Windows CI environment * clear Windows CI environment * clear Windows CI environment
-
- 22 4月, 2021 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 21 4月, 2021 1 次提交
-
-
由 zhang wenhui 提交于
* add allreduce and broadcast without test (#31024) add allreduce and broadcast without test * Refactor HCCLCommContext to be compatible with Paddle (#31359) Refactor HCCLCommContext to be compatible with Paddle (#31359) * [NPU] add npu kernel for communication op (#31437) * add allreduce and broadcast without test * add c_broadcast_test case * build c_comm_init and c_create_group operators * make the whole thing compile * add broadcast and init op test case but run failed * make unit test compile * fix broadcast test bug and change into hcom for ccl * change c_comm_init and c_create_group ops accordingly * make tests compile * transfer code to 27 * compiled successfully in 28, but run failed * test broadcast in 28, but failed * make hcom primitives work * change hccl data type for base.h * fix broadcast bug * make attributes work * fix group name bug * add allreduce but test failed * allreduce bug for qiuliang * allreduce finished * add allgather and reducescatter * merge all op code * add allgather test * finish run all ccl op test exclude send/recv * all all op and test exclude send/recv * send_v2_npu.cc recv_v2_npiu.cc compiled * fix ccl core dump bug and test allgather, reducescatter, broadcast op * fix allreduce bug just for test * hcom send&recv test pass, without hcom_destroy * for qiuliang test * Ascend Send&Recv Test Pass * all op (ex send/recv) ok * fix bug * merge all ccl op * style merge to PaddlePaddle * merge style * new merge style * merge style 2 * insert an empty at the end * disable ctest for hcom to pass ci Co-authored-by: Nvoid-main <voidmain1313113@gmail.com> Co-authored-by: Nf2hkop <f2huestc@outlook.com> * Add auto-increasing tag id for Hcom OPs (#31702) * add c_reduce_sum op (#31793) add c_reduce_sum op * update Ascendrc hccl to 20.3 (#32126) update Ascendrc hccl to 20.3 (#32126) * fix merge code * change cmake.txt1 * [NPU] Support npu kernel for c sync stream op (#31386) * sync stream npu op * add with_ascend_acl * update c++ unittest * compile all failed * try to pre commit * after pre commit * merge&compile&test hccl successfully! * fix code style * fix code style * fix bugs about hccl * fix some bugs * fix code style * fix style * fix style * fix * fixed * merge develop Co-authored-by: Nlw921014 <liuwei921014@yeah.net> Co-authored-by: NVoid Main <voidmain1313113@gmail.com> Co-authored-by: Nf2hkop <f2huestc@outlook.com> Co-authored-by: Nxiayanming <41795079@qq.com>
-
- 19 4月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* [NPU] support GarbageCollector for npu (#31874) * support GarbageCollector for npu * fix typo * fix gather_grad * disable NPUDefaultStreamGarbageCollector on NPU * [NPU] support npu for memcpy op (#31808) * support npu for memcpy op * add ut * fix ut * fix typo * 【NPU】fix bug of using temp vector (#31963) * fix bug when beta1_pow on cpu (#31995) * [NPU] support npu profiler (#31684) * support npu profiler * add python api * fix bugs * add wrapper for incomplete type * update profile proto * record npu wait * add xpu placeholder * fix adam (#32016) * [NPU] enable async copy and add wait before sync operation (#31956) * enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * make TensorFromVector/TensorToVector sync * [NPU] Support dataloader on npu place. (#31867) * [NPU] Wait on NPUPlace (#32086) * [NPU] fix cast op (#32121) * fix npu kernel of cast op to handle casting to same dtype * add comments * [NPU] support cann 20.3 (#32044) * fix compile problem on cann 20.3 * fix ut * fix test_mul * fix check_finite_and_scale * fix lookup_table_v2_grad * fix cmake * support print op * [NPU] Support npu save load (#31893) * support save load for NPU * add save load npu unittest * support np.array transform in NPU * fix errors * delete dygraph in unittest * add Wait * fix unittest * fix review comment * fix unittest problem * fix little problem * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performance (#32196) * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performace * refine code * fix NPUDeviceContext in all c++ unittest (#32198) * fix NPUDeviceContext in all c++ unittest * refine log Co-authored-by: Npangyoki <pangyoki@126.com> * [NPU] Remove TensorFromVector and avoid sync copy in npu op kernel for better performance (#31994) * enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * change TensorFromVector to FillNpuTensorWithConstant * fix ignored api * delete extra unittest * fix little error * fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu * change TensorCopySync to TensorCopy * delete useless Wait and add StreamWait * fix npu_stream error * fix check_finite_and_unscale_op_npu TensorCopy * only save stream wait * fix NPUDeviceContext in all c++ unittest * delete wait Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com> * delete useless unittest file (#32206) * Fix op test (#32231) * fix conditional block (#32243) * fix adam bug again (#32246) * fix compile * fix ut * fix ut Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com> Co-authored-by: Npangyoki <pangyoki@126.com>
-
- 15 4月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* pscore support heterps * fleet cmake * fleet wrapper * macro * solve conflict * solve conflict * add unitest * paddle enforce * unitest * unitest * unitest
-