- 04 2月, 2021 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Update Xbyak and add bf16 fast performance verification * Fix formating * Change LOG message * Trigger an update of a new tag
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 03 2月, 2021 2 次提交
- 01 2月, 2021 1 次提交
-
-
由 QingshuChen 提交于
* fix malloc L3 failed bug for kunlun * minor
-
- 28 1月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] update fluid platform for rocm35 (part1), test=develop * address review comments, test=develop
-
- 25 1月, 2021 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 chentianyu03 提交于
* add abs test case * use std::abs to fix abs bug * fix the abs bug * fix abs bug
-
- 20 1月, 2021 4 次提交
-
-
由 Jacek Czaja 提交于
-
由 wanghuancoder 提交于
* delete empty line of pybing.cc, test=develop * use nvtx push pop in timeline, test=develop * change year, test=develop * add #ifdef PADDLE_WITH_CUDA, test=develop * add #ifndef WIN32, test=develop * is_pushed to is_pushed_, test=develop
-
由 chentianyu03 提交于
* rewrite abs op * rewrite abs op and remove abs in activation * remove abs register in old codes * fix abs_grad type error * fix abs double_grad output name error * modify abs_grad, abs_grad_grad functor for windows building * format code style * fix the bug of result is nan when the divisor is zero * add missing abs attr and add abs for float16
-
由 Wilber 提交于
-
- 19 1月, 2021 2 次提交
- 18 1月, 2021 2 次提交
-
-
由 liuyuhui 提交于
-
由 QingshuChen 提交于
-
- 15 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 13 1月, 2021 2 次提交
-
-
由 Huihuang Zheng 提交于
usleep function in <unistd.h> only takes argument less than 1,000,000. Current call can exceed this limit, we have to fix it. This PR can fix random CI error.
-
由 QingshuChen 提交于
* optimize memcpy perf for kunlun * remove useless unitest for kunlun mean * minor
-
- 12 1月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 11 1月, 2021 1 次提交
-
-
由 AshburnLee 提交于
-
- 09 1月, 2021 1 次提交
-
-
由 Jacek Czaja 提交于
* - Added UT for testing elementwise_mul caching * lint fixes
-
- 07 1月, 2021 1 次提交
-
-
由 WeiXin 提交于
-
- 06 1月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* Polish and Optimize the print/repr message of all layer * fix some code format
-
- 29 12月, 2020 2 次提交
-
-
由 石晓伟 提交于
-
由 Huihuang Zheng 提交于
PADDLE_RETRY_CUDA_SUCCESS used wrong sleep time so it can cause timeout in unittest. This PR fixed it. After we searched the doc in https://pubs.opengroup.org/onlinepubs/7908799/xsh/unistd.h.html, the time unit of sleep in unistd.h takes "seconds", usleep takes "microseconds", Sleep in windows.h takes "milliseconds".
-
- 28 12月, 2020 3 次提交
- 26 12月, 2020 1 次提交
-
-
由 liuyuhui 提交于
-
- 25 12月, 2020 2 次提交
-
-
由 LielinJiang 提交于
* enable bilateral_slice unittest on windows platform * reduce max threads
-
由 Chen Weihang 提交于
* add support for complex grad accumulated * add unittest for coverage * update test dtype * remove useless blank line
-
- 24 12月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 23 12月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 21 12月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
Add Retry Logic to CublasHandlerHolder to avoid random unittest failure.
-
- 19 12月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* - Reimplemented elementwise_add grad - lint * - fix after review * - Fix to fix after review
-
- 18 12月, 2020 1 次提交
-
-
由 Aurelius84 提交于
-
- 17 12月, 2020 3 次提交
-
-
由 wanghuancoder 提交于
* Windows generate pdb and dump, for debug * fix code style, test=develop * modify cmakelist
-
由 Huihuang Zheng 提交于
Modify CublasHandleHolder from using PADDLE_ENFORCE_CUDA_SUCCESS to PADDLE_RETRY_CUDA_SUCCESS to fix random unittest failure. We checked that the unittest log showed CUDA allocation error at this file, which may due to GPU not enough. We fixed similar failure in the past, so we applied PADDLE_RETRY_CUDA_SUCCESS here.
-
由 Jacek Czaja 提交于
-