- 13 8月, 2021 1 次提交
-
-
由 Baibaifan 提交于
-
- 30 7月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 30 4月, 2021 1 次提交
-
-
由 XiangGao 提交于
-
- 09 4月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* [feature] support npu allocator (#30840) [feature] support npu allocator * [feature] support npu operator (#30951) [feature] support npu operator * [feature] support npu allocator, part 2 (#30972) * support npu allocator * add npu device context * fix some compile problem * fix some compile problem * add npu info * compile ok * fix include dir * support naive_best_fit_allocator * run ut ok, bug failed to exit * call aclrtResetDevice before exit * fix aclFinilize * add system allocatot test * add selected_gpus in gtest * add tensor_test for npu * support npu op, initial commit * add npu stream * add elementwise_add_op * compile ok * fix typo * fix elementwise_add_op_npu_test * support op run * test can run but failed * change aclopExecuteV2 to aclopCompileAndExecute * support parsing ascend rank table file (#31000) support parsing ascend rank table file * Fix reshape on GE graph. (#31084) Fix reshape on GE graph * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973) * add npu sub op * fix typo * rename test * fix bug * fix bug * add fp16 kernel * fix typo * support sub grad op * support elementwise_sub_grad op Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com> * Fix compilation problem (#31100) Fix compilation problem (#31100) * fix compile * fix code stype * remove const_cast * support adding correct npu op in pybind.h (#31143) * support adding correct npu op in pybind.h * refine code * [NPU] Support executor with NPU (#31057) * [NPU] Support executor with NPU * Fix code according to reviews * Fix code * Add unittest for sub op npu * refactor npu device manager (#31154) refactor npu device manager (#31154) * fix selected npus * fix compile * fix reading flags from env * format Co-authored-by: Nxiayanming <41795079@qq.com> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com> Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
-
- 29 3月, 2021 1 次提交
-
-
由 ronnywang 提交于
-
- 22 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 24 12月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 25 11月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* default not show cpp statck & add hint * fix failed unittest * fix failed unittests
-
- 28 9月, 2020 1 次提交
-
-
由 arlesniak 提交于
* Add support for mkldnn ops types selection with FLAGS in dygraph * use regex to match DNNL verbose * python3 encoding fix
-
- 21 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
- 28 8月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
* update the demo code and the doc of varbase.backward. * update the doc of the fake interface `paddle.fluid.Variable`. * remove BackwardStrategy.
-
- 07 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* add flags_call_stack_level * update * refine code
-
- 28 7月, 2020 1 次提交
-
-
由 arlesniak 提交于
* Added DNNL cache management for DyGraph * move FLAGS_use_mkldnn to more general CMakeLists, getu use of the flag in ClearGradients * missing file * Fixes after review * Bringing back original idea of place for 'use_mkldnn' flag to be accessible from platform nad imperative. * Removed duplicate and added docs * Fixes for CI
-
- 21 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
* add the thread_local_allocator, test=develop * refactor the thread_local_allocator, test=develop * provides option setting strategy, test=develop
-
- 15 4月, 2020 1 次提交
-
-
由 guofei 提交于
Correct the name [`FLAGS_sync_nccl_allreduce`](https://www.paddlepaddle.org.cn/documentation/docs/zh/advanced_guide/flags/others_cn.html#flags-sync-nccl-allreduce) based on the information from our official website.
-
- 04 3月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
* add recorded cuda memory apis, fix typo, test=develop * add more ut, test=develop * follow comments, test=develop * fix py35 incompatible issues, test=develop
-
- 08 1月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 06 1月, 2020 2 次提交
-
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
- 20 10月, 2019 1 次提交
-
-
由 123malin 提交于
* test=develop, communicator_is_sgd_optimizer flags
-
- 11 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 23 9月, 2019 1 次提交
-
-
由 chengduo 提交于
* Add RecordHistoryLocalExecScopes test=develop
-
- 18 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 10 9月, 2019 1 次提交
-
-
由 XiaoguangHu 提交于
Add document annotations for FLAGS that need to be open to external developers test=develop (#19692) Add document annotations for FLAGS that need to be open to external developers
-
- 16 8月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-