- 18 6月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* Start implementing int8 eltwise add test=develop * - Fix to Michal PR * - Fix test=develop * - Lint fixes test=develop * - Added checking if elementwise_mul can be used test=develop * - Added attribs to skip_attrs_set test=develop * - Improved broadcasting test=develop - fixes to compilation - fix - fix - Lint fixes test=develop * - removed redundant condition test=develop Co-authored-by: NMichal Gallus <michal.gallus@intel.com>
-
- 16 6月, 2020 1 次提交
-
-
由 hutuxian 提交于
* Add a StatValue class in the backend to represent a stat. * Add a singleton StatRegistry to maintain the collection of stats. * For the sake of code neatness, we only support type of int and float, which can cover most of the scenarios.
-
- 09 6月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 08 6月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 05 6月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* support selectedrows allreduce in multi-cards dygraph, test=develop * remove useless import modules in unittests, test=develop * add nccl cmake to get nccl version, test=develop * add if-condition to compiled correctly, test=develop * add detail version parseing for old nccl, test=develop * polish camke details, test=develop * fix remove test cmake error, test=develop * fix cmake condition, test=develop * change unittest camke list, test=develop * fix unittest cmake rule, test=develop, test=framep0
-
- 03 6月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
* remove REPLACE_ENFORCE_GLOG compile option & add ci rule prohibit LOG(FATAL) using, test=develop * remove ci test case, test=develop * replace all LOG(FATAL) & polish message, test=develop * fix typo, test=develop * polish error info detail, test=develop
-
由 Zhang Ting 提交于
* fix negative framework overhead, test=develop * use overhead summary, test=develop
-
- 02 6月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* simply C++ error stack once again, test=develop * refactor code remove string pointer and recursive, test=develop
-
- 01 6月, 2020 2 次提交
- 28 5月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 26 5月, 2020 3 次提交
-
-
由 wangchaochaohu 提交于
-
由 Adam 提交于
-
由 lidanqing 提交于
* Update PADDLE_ENFORCE in DNNL related ops test=develop * Abstract macro of OP_GET_PLACE_CHECK test=develop * update according to reviews * update GET_PLACE_CPU_CHECK * fix typo test=develop * revert macro test=develop
-
- 25 5月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 21 5月, 2020 1 次提交
-
-
由 Adam 提交于
* Update isCached() to be thread freindly test=develop * Add isCached() check inside Softmax handler test=develop * Fix PaddleEnforce() message test=develop
-
- 19 5月, 2020 1 次提交
-
-
由 Leo Chen 提交于
-
- 18 5月, 2020 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 Yiqun Liu 提交于
* Add the check for whether CUDA Driver and NVRTC is available for the runtime system. * Call cuInit to initialize the CUDA Driver API before all CUDA callings. test=develop * Change the behavior when libnvrtc.so can not be found, printing a warning instead of exiting. test=develop * Do not initialize CUDA Driver API for windows and macos. test=develop * Remove the call of cuInit when entering paddle and enable the test_code_generator. test=develop * Add some built-in functions for __half. test=develop * Change save_intermediate_out to false in unittest. test=develop * Fix error reference to tempropary variable when seting including path for device_code. test=develop
-
- 15 5月, 2020 1 次提交
-
-
由 Adam 提交于
* Add isCached() mechanism to elementwise_add test=develop * Hide code inside handler test=develop
-
- 14 5月, 2020 1 次提交
-
-
由 pawelpiotrowicz 提交于
test=develop
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 08 5月, 2020 2 次提交
- 30 4月, 2020 1 次提交
-
-
由 Guo Sheng 提交于
* Fix cusolver loader for Windows in dynamic_loader.cc. test=develop * Fix missing CUSOLVER_ROUTINE_EACH_R1. test=gpu test=develop * Add unsupprot for cusolver on Windows temporarily. test=develop * Fix GetCusolverDsoHandle error message. test=develop
-
- 29 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
* update the analysis predictor, test=develop * update the unit test, test=develop * no priority set before the inferface determined, test=develop * interface name generalization, test=develop
-
- 28 4月, 2020 1 次提交
-
-
由 Sylwester Fraczek 提交于
-
- 27 4月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 25 4月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 24 4月, 2020 2 次提交
-
-
由 Guo Sheng 提交于
* Add cholesky_op forward part. test=develop * Complete cholesky_op forward part. test=develop * Add cholesky_op backward part. test=develop * Complete cholesky_op backward part. test=develop * Refine cholesky_op error check and docs. test=develop * Add grad_check unit test for cholesky_op. test=develop * Fix sample code in cholesky doc. test=develop * Refine some error messages of cholesky_op. test=develop * Refine some error messages of cholesky_op. test=develop * Remove unused input in cholesky_grad. test=develop * Remove unused input in cholesky_grad. test=develop * Fix stream for cusolverDnSetStream. test=develop * Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code. test=develop * Add CUSOLVER ERROR in enforce.h test=develop * Fix the missing return value in cholesky. test=develop
-
由 wangchaochaohu 提交于
-
- 23 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 22 4月, 2020 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 石晓伟 提交于
-
- 21 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
* add the thread_local_allocator, test=develop * refactor the thread_local_allocator, test=develop * provides option setting strategy, test=develop
-
- 20 4月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop
-
- 18 4月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* update eigen, test=develop * remove patches, test=develop * add definition of -fabi-version, test=develop * add patch for TensorBlock.h, test=develop * test windows, test=develop * only update eigen for Linux, test=develop * add code comments, test=develop
-
- 17 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
* supports thread-binding stream, test=develop * avoid using thread_local variables in dtor, test=develop * modify the stream priority enum, test=develop
-
- 15 4月, 2020 1 次提交
-
-
由 guofei 提交于
Correct the name [`FLAGS_sync_nccl_allreduce`](https://www.paddlepaddle.org.cn/documentation/docs/zh/advanced_guide/flags/others_cn.html#flags-sync-nccl-allreduce) based on the information from our official website.
-
- 14 4月, 2020 1 次提交
-
-
由 Yi Liu 提交于
eagerly release cuda resources before cuda enviroment destroying test=develop
-