- 07 4月, 2021 1 次提交
-
-
由 zhang wenhui 提交于
* Ascend rc (#30483) * Fix compilcation on CANN20.1 and older (#30494) Fix compilcation on CANN20.1 and older * Add distribution supported (#30578) Add distribution supported * Build praser for Hcom* operators (#30627) Build praser for Hcom* operators * Pass device_ids info from launch to trainer. (#30632) Pass device_ids info from launch to trainer * Add Hccl program group (#30642) Add Hccl program group * Add startup bash files of test_ascend_group. (#30645) Add startup bash files of test_ascend_group * cleanup (#30646) cleanup test_ascend_group.py * [Feature] Build parser to support distributed training (#30658) [Feature] Build parser to support distributed training * fix compilation on ascend-20.1 (#30722) fix compilation on ascend-20.1 * Dev/fix ascend string (#30749) Dev/fix ascend string * code style (#30781) code style * Merge ascend_optimizer and ascend_parser. (#30776) Merge ascend_optimizer and ascend_parser. * Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug (#30797) Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug * Add paddle ascend distribution training supported (#30796) Add paddle ascend distribution training supported * pass cxx_flags to gloo cmake (#30857) * Destroy session first. (#30954) Destroy session first. * merge * fix, test=develop * fix, test=develop * fix style, test=develop * fix, test=develop * fix * fix log fatal, test=develop * fix enforce style, test=develop * fix, test=develop * fix, test=develop * fix rccl, test=develop * fix test, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix node_num, test=develop * fix ids str, test=develop * fix ids str, test=develop * fix ids str, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix style code, test=develop * fix style code, test=develop * fix style code, test=develop * fix style code, test=develop Co-authored-by: Nhutuxian <hutuxian2011@sina.cn> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: NVoid Main <voidmain1313113@gmail.com> Co-authored-by: NLeo Chen <chenqiuliang@baidu.com> Co-authored-by: Ndingsiyu <18369187719@163.com> Co-authored-by: NOleNet <olenet@126.com>
-
- 31 3月, 2021 1 次提交
-
-
由 taixiurong 提交于
-
- 03 3月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] update fluid elementwise op for rocm (part10), test=develop * update, test=develop * address review comments, test=develop
-
- 22 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 03 2月, 2021 2 次提交
-
-
由 QingshuChen 提交于
* try again if kunlun memory malloc failed * minor
-
由 石晓伟 提交于
* support xpu inference with analysis predictor, test=develop * merge the cmake of the xpu toolchain, test=develop * add c-apis, test=develop * fix a bug in extern_xpu, test=develop
-
- 01 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 19 1月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* unify calling cudaSetDevice * fix compile
-
- 15 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 14 1月, 2021 1 次提交
-
-
由 QingshuChen 提交于
-
- 13 1月, 2021 1 次提交
-
-
由 QingshuChen 提交于
* optimize memcpy perf for kunlun * remove useless unitest for kunlun mean * minor
-
- 12 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 25 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* try to add fwd op input dtypes * refactor base impl * return tmp_ins after dygraph prepare data * fix typo found in debug * polish comment & add complex net test * revert detail change * fix unittest failed * add complex kernel condition control * fix xpu test failed & polish comment * polish details by review comments
-
- 11 12月, 2020 1 次提交
-
-
由 LoveAn 提交于
* Add the strategy of skipping cc/cu test compilation and execution in CI, test=develop * fix if error with CI_SKIP_TEST, test=develop * fix add properties to test error on Linux/MAC, test=develop * fix set test properties of test_code_generator error, test=develop * remove test codes and advance judgment of file modification on Linux, test=develop * rename CI_SKIP_TEST to CI_SKIP_CPP_TEST, test=document_fix * Add branch judgement on Linux, test=develop
-
- 02 12月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
-
- 20 11月, 2020 1 次提交
-
-
由 gongweibao 提交于
-
- 06 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 04 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 23 10月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add compile limit for paddle enforce * polish elementwise_op_function.cu.h * fix failed unittest * fix windows compile failed * detail polish * revert no type constructor
-
- 22 10月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* fix bug of fetch_async_op_handle * revert some changes of test_buffer_shared_memory_reuse_pass * revert some changes of test_buffer_shared_memory_reuse_pass
-
- 27 9月, 2020 1 次提交
-
-
由 QingshuChen 提交于
* support elementwise add, activation, matmul on Baidu Kunlun * test=kunlun * minor * test=kunlun * reconstuct the xpu directory * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 25 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 16 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
-
- 21 8月, 2020 1 次提交
-
-
由 QingshuChen 提交于
* support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 23 7月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
test=develop
-
- 22 7月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* fix best_fit_allocator_test on windows, test=develop * enable best_fit_allocator_test and test_math_op_patch_var_base, test=develop
-
- 24 6月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine PADDLE_THROW, test=develop * refine error msg, test=develop * refine cuda error, test=develop * follow comments, test=develop * fix compile problem, test=develop * fix bug, test=develop
-
- 19 6月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 09 6月, 2020 1 次提交
-
-
由 liuwei1031 提交于
-
- 08 6月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 03 6月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* remove REPLACE_ENFORCE_GLOG compile option & add ci rule prohibit LOG(FATAL) using, test=develop * remove ci test case, test=develop * replace all LOG(FATAL) & polish message, test=develop * fix typo, test=develop * polish error info detail, test=develop
-
- 20 5月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
The background is that buddy_allocator_test depends on some big GPU allocations and current machine may not have it. It can cause random failure on cloud machine. After discussion with Shuo Tian, we decided to make buddy_allocator_test exclusive.
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 28 4月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
test=develop
-
- 21 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
* add the thread_local_allocator, test=develop * refactor the thread_local_allocator, test=develop * provides option setting strategy, test=develop
-
- 20 4月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop
-
- 04 3月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
* add recorded cuda memory apis, fix typo, test=develop * add more ut, test=develop * follow comments, test=develop * fix py35 incompatible issues, test=develop
-
- 02 3月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add lodtensor share memory & serialization, test=develop * fix windows compile error, test=develop * deal vartype pickle & fix unittest matching error message, test=develop * update timeout variable name, test=develop * refactor memory map implement, test=develop * clear mmap file discripter when exit unexpectedly, test=develop * remove the child process fd in advance, test=develop * remove mmap fds after Queue.put in child process, test=develop * add hard unittests for register exit func, test=develop * fix python2 compatibility problem in unittest, test=develop * fix exception unittest error, test=develop * polish code based review comment, test=develop
-