- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 29 11月, 2021 1 次提交
-
-
由 piotrekobiIntel 提交于
-
- 27 11月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 03 9月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 28 6月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] fix RNN miopen as weight need to permuted, test=develop * [ROCM] fix data share when is_test, test=develop * update, test=develop
-
- 12 5月, 2021 1 次提交
-
-
由 liym27 提交于
-
- 26 4月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 09 4月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* [feature] support npu allocator (#30840) [feature] support npu allocator * [feature] support npu operator (#30951) [feature] support npu operator * [feature] support npu allocator, part 2 (#30972) * support npu allocator * add npu device context * fix some compile problem * fix some compile problem * add npu info * compile ok * fix include dir * support naive_best_fit_allocator * run ut ok, bug failed to exit * call aclrtResetDevice before exit * fix aclFinilize * add system allocatot test * add selected_gpus in gtest * add tensor_test for npu * support npu op, initial commit * add npu stream * add elementwise_add_op * compile ok * fix typo * fix elementwise_add_op_npu_test * support op run * test can run but failed * change aclopExecuteV2 to aclopCompileAndExecute * support parsing ascend rank table file (#31000) support parsing ascend rank table file * Fix reshape on GE graph. (#31084) Fix reshape on GE graph * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973) * add npu sub op * fix typo * rename test * fix bug * fix bug * add fp16 kernel * fix typo * support sub grad op * support elementwise_sub_grad op Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com> * Fix compilation problem (#31100) Fix compilation problem (#31100) * fix compile * fix code stype * remove const_cast * support adding correct npu op in pybind.h (#31143) * support adding correct npu op in pybind.h * refine code * [NPU] Support executor with NPU (#31057) * [NPU] Support executor with NPU * Fix code according to reviews * Fix code * Add unittest for sub op npu * refactor npu device manager (#31154) refactor npu device manager (#31154) * fix selected npus * fix compile * fix reading flags from env * format Co-authored-by: Nxiayanming <41795079@qq.com> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com> Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
-
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 01 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 15 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 11 12月, 2020 1 次提交
-
-
由 LoveAn 提交于
* Add the strategy of skipping cc/cu test compilation and execution in CI, test=develop * fix if error with CI_SKIP_TEST, test=develop * fix add properties to test error on Linux/MAC, test=develop * fix set test properties of test_code_generator error, test=develop * remove test codes and advance judgment of file modification on Linux, test=develop * rename CI_SKIP_TEST to CI_SKIP_CPP_TEST, test=document_fix * Add branch judgement on Linux, test=develop
-
- 06 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 04 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 23 7月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
test=develop
-
- 24 6月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine PADDLE_THROW, test=develop * refine error msg, test=develop * refine cuda error, test=develop * follow comments, test=develop * fix compile problem, test=develop * fix bug, test=develop
-
- 19 6月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 20 5月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
The background is that buddy_allocator_test depends on some big GPU allocations and current machine may not have it. It can cause random failure on cloud machine. After discussion with Shuo Tian, we decided to make buddy_allocator_test exclusive.
-
- 28 4月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
test=develop
-
- 04 3月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
* add recorded cuda memory apis, fix typo, test=develop * add more ut, test=develop * follow comments, test=develop * fix py35 incompatible issues, test=develop
-
- 13 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
-
- 06 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 05 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 30 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 24 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 17 10月, 2019 1 次提交
-
-
由 liuwei1031 提交于
* improve save and load behaviour, test=develop * code cleaning, test=develop * disable check_guards and update_guards in release version, test=develop * fix compilation issue, test=develop * add buddy_allocator speed test data, test=develop * fix compilation issue, test=develop * fix comment, test=develop * update function names according to the google C++ style guide, test=develop * tweak the test data format, test=develop * move buddy_allocator_test_data to paddle/fluid/testdata, test=develop * add accessor and mutator for Desc, test=develop
-
- 25 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 24 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 20 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* refine err msg of out of gpu memory, test=develop * refine err msg again, test=develop * refine errog message again, test=develop * follow reviewer's comments, test=develop
-
- 18 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 16 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 03 9月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
- 01 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* add retry_allocator for gpu, test=develop * follow chengduoZH's comments, test=develop * follow huihuang's comments,test=develop * change f,l in enforce.h to be file,line, test=develop * increase code coverage by adding unittests, test=develop * fix CMakeLists.txt, test=develop
-
- 30 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 20 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* replace part of PADDLE_ASSERT to PADDLE_ENFORCE test=develop * remove unused fallback_alloc_size_ * add unit-test of CUDAPinnedAllocator test=develop
-
- 01 8月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* fix gpu_info, test=develop * fix reserving gpu memory calculation bug, add fraction=1 unittest, test=develop * fix bug again for reserving size, test=develop
-
- 31 7月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
GPU allocation uses fraction of available memory, also fix the GetUsed without lock
-
- 05 5月, 2019 1 次提交
-
-
由 chengduo 提交于
* remove unnecessary set_devices
-