- 09 2月, 2021 3 次提交
-
-
由 Leo Chen 提交于
* support npu allocator * add npu device context * fix some compile problem * fix some compile problem * add npu info * compile ok * fix include dir * support naive_best_fit_allocator * run ut ok, bug failed to exit * call aclrtResetDevice before exit * fix aclFinilize * add system allocatot test * add selected_gpus in gtest * add tensor_test for npu * support npu op, initial commit * add npu stream * add elementwise_add_op * compile ok * fix typo * fix elementwise_add_op_npu_test * support op run * test can run but failed * change aclopExecuteV2 to aclopCompileAndExecute
-
由 Leo Chen 提交于
[feature] support npu operator
-
由 Leo Chen 提交于
[feature] support npu allocator
-
- 21 1月, 2021 1 次提交
-
-
由 gongweibao 提交于
Add distribution supported
-
- 15 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 13 1月, 2021 2 次提交
-
-
由 Huihuang Zheng 提交于
usleep function in <unistd.h> only takes argument less than 1,000,000. Current call can exceed this limit, we have to fix it. This PR can fix random CI error.
-
由 QingshuChen 提交于
* optimize memcpy perf for kunlun * remove useless unitest for kunlun mean * minor
-
- 12 1月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 11 1月, 2021 1 次提交
-
-
由 AshburnLee 提交于
-
- 09 1月, 2021 1 次提交
-
-
由 Jacek Czaja 提交于
* - Added UT for testing elementwise_mul caching * lint fixes
-
- 07 1月, 2021 1 次提交
-
-
由 WeiXin 提交于
-
- 06 1月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* Polish and Optimize the print/repr message of all layer * fix some code format
-
- 29 12月, 2020 2 次提交
-
-
由 石晓伟 提交于
-
由 Huihuang Zheng 提交于
PADDLE_RETRY_CUDA_SUCCESS used wrong sleep time so it can cause timeout in unittest. This PR fixed it. After we searched the doc in https://pubs.opengroup.org/onlinepubs/7908799/xsh/unistd.h.html, the time unit of sleep in unistd.h takes "seconds", usleep takes "microseconds", Sleep in windows.h takes "milliseconds".
-
- 28 12月, 2020 3 次提交
- 26 12月, 2020 1 次提交
-
-
由 liuyuhui 提交于
-
- 25 12月, 2020 2 次提交
-
-
由 LielinJiang 提交于
* enable bilateral_slice unittest on windows platform * reduce max threads
-
由 Chen Weihang 提交于
* add support for complex grad accumulated * add unittest for coverage * update test dtype * remove useless blank line
-
- 24 12月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 23 12月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 21 12月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
Add Retry Logic to CublasHandlerHolder to avoid random unittest failure.
-
- 19 12月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* - Reimplemented elementwise_add grad - lint * - fix after review * - Fix to fix after review
-
- 18 12月, 2020 1 次提交
-
-
由 Aurelius84 提交于
-
- 17 12月, 2020 3 次提交
-
-
由 wanghuancoder 提交于
* Windows generate pdb and dump, for debug * fix code style, test=develop * modify cmakelist
-
由 Huihuang Zheng 提交于
Modify CublasHandleHolder from using PADDLE_ENFORCE_CUDA_SUCCESS to PADDLE_RETRY_CUDA_SUCCESS to fix random unittest failure. We checked that the unittest log showed CUDA allocation error at this file, which may due to GPU not enough. We fixed similar failure in the past, so we applied PADDLE_RETRY_CUDA_SUCCESS here.
-
由 Jacek Czaja 提交于
-
- 16 12月, 2020 2 次提交
- 15 12月, 2020 1 次提交
-
-
由 AshburnLee 提交于
-
- 14 12月, 2020 2 次提交
-
-
由 arlesniak 提交于
-
由 Jacek Czaja 提交于
-
- 11 12月, 2020 1 次提交
-
-
由 taixiurong 提交于
* 1.fix matmul bug 2. add one hot * add xpu error msg
-
- 09 12月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
Add Sleep Time for CUDA Retry, which is similar to our GPU retry logic. This is a try to avoid init GPU allocation random failure in unit test.
-
- 08 12月, 2020 1 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * added external and internal reorders to profiler * added internal and external reorder to profiler * added formatting to int/ext reorder commit * removed unnecessary comment
-
- 07 12月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
-
- 04 12月, 2020 3 次提交
-
-
由 chentianyu03 提交于
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types * add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest * kron, reshape, transpose support complex types * sum and trace op support complex types * add test case of sum and trace op * fix the bug of imag part of complex not initialized * format file * format code style * kron support type promotion; modify test cases
-
由 卖鱼的哲学 提交于
* fix expand && concat/transpose to new api * update uniform_random_op * update xpu_header
-
由 lilong12 提交于
-