- 23 10月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
Recently, test_parallel_executor_test_while_train randomly failed on CI. On all CI logs, it showed NCCL initialization failed or cusolver initialization failed. I found online that those failure is usually caused by GPU shortage. Those API calls CUDA APIs directly so it shouldn't be the problem of allocator. It may be somewhere in PaddlePaddle increases GPU usage. However, I run this test for 1000 times on my machine and the CI machine, either of them can reproduce the random failure. Maybe there is something related to the environment only happened in test env. To verify my assumption that somewhere in PaddlePaddle increases GPU usage and also fix this CI, I decreased the batch_size to see whether the random failure disappears in test env.
-
- 22 10月, 2020 4 次提交
-
-
由 Feiyu Chan 提交于
* fix strided_slice_op's GetExpectedKernelType when input tensor is at CUDAPinnedPlace * add unittest for tensors in cuda pinned place * skip test for cuda pinned place on cpu machines
-
由 Leo Chen 提交于
* fix bug of fetch_async_op_handle * revert some changes of test_buffer_shared_memory_reuse_pass * revert some changes of test_buffer_shared_memory_reuse_pass
-
由 Aurelius84 提交于
* remove some judgement * fix len(outputs) == 1
-
由 LielinJiang 提交于
* update hapi predict interface * fix code style * fix docs * fix docs * fix docs * update docs * fix codes style * fix unittest * fix unittest * fix coverage
-
- 21 10月, 2020 7 次提交
-
-
由 Chen Weihang 提交于
-
由 lilong12 提交于
* modify ut cmakefile, test=develop
-
由 cnn 提交于
* rename manual_seed to seed * rename xxx1d-->xxx1D, xxx2d-->xxx2D, xxx3d-->xxx3D * rename manual_seed --> seed * do not rename .cc, .cu and .h file * rename manual_seed --> seed * rename manual_seed --> seed * rename manual_seed --> seed * rename manual_seed --> seed * disable_static on doc example code * donot change manual_seed on generator * add enable_static on sample code * convert python/paddle/fluid/layers/nn.py to bak * fix typo * fix code style * fix seed to manual_seed when call functions of Generator() * fix bug
-
由 liu zhengxi 提交于
-
由 pangyoki 提交于
* add static_mode_white_list * add Mac CI static list * add Win CI white_list * add Coverage and Py3 CI white_list, add test_unittest
-
由 wangguanzhong 提交于
-
由 zhulei 提交于
* Add new api: is_tensor * Add new api: is_tensor * Add new api: is_tensor * Add new api: is_tensor
-
- 20 10月, 2020 8 次提交
-
-
由 lilong12 提交于
-
由 wangguanzhong 提交于
* fix generate_proposal_labels in cascade-rcnn series model, test=develop * fix example code & unittest, test=develop * update code from review comments, test=develop
-
由 Leo Chen 提交于
* fill_constant supports nan and inf * add ut
-
由 Aurelius84 提交于
* refine code of DygraphToStaticAst * add __init__ function
-
由 hong 提交于
-
由 zhang wenhui 提交于
-
由 Double_V 提交于
* add stack pool2d roi_align xpu op,test=kunlun * error message opt, test=kunlun * add xpu unittest,test=kunlun * skip check grad,test=kunlun * fix boostget , test=kunlun * error message opt for XPU, test=kunlun * add rois_num for roi_align xpu OP, test=develop
-
由 LielinJiang 提交于
* fix dataloader
-
- 19 10月, 2020 10 次提交
-
-
由 yinhaofeng 提交于
* lookup_table_xpu op report errors;test=kunlun * add adam xpu op;test=kunlun * reset lookup * change adam wrong;test=kunlun
-
由 TeslaZhao 提交于
-
由 LutaoChu 提交于
Fix diag OP bug on Windows Python3.8 ,remove the std::min
-
由 MRXLT 提交于
fleet support paddle.optimzier * bug fix * fix fleet_base * bug fix * fix coverage
-
由 liuyuhui 提交于
* transfer from paddle.fluid.layers.assign() into creation.py,test=develop * fix ut fail,add support for paddle.assign,test=develop * fix,test=develop * fix UT coverage,test=coverage * fix UT fail,test=coverage * fix doc,test=develop
-
由 huangxu96 提交于
* Still has bugs. * Fixed allclose_op bug, which cannot deal with some cases of fp64 inputs. * improved CUDA kernel performance. * Changed CUDA code. * Fixed a bug in cuda kernel which cannot deal with large dimension input, and added an unittest for it. * Add a test case for float32 input.
-
由 pangyoki 提交于
* fix multinomial doc * fix multinomial error message * little doc change * fix Categorical class doc * optimize format of error message * fix CPU Kernel error message format * fix isinf and isnan error in WindowsOPENBLAS CI * delete inf and nan * add manual_seed in sample code * little error message change * change error message to InvalidArgument * add full point for error message and add manual_seed in CPU environment
-
由 pangyoki 提交于
* Add truncated_gaussian_random_op XPU kernel * Add truncated_gaussian_random_op XPU kernel, test=kunlun * little change, test=kunlun * change boost_get to BOOST_GET_CONST * change boost_get to BOOST_GET_CONST, test=kunlun * little change, test=kunlun * use Generator to generate random number and optimize format, test=kunlun * little change, test=kunlun * add TODO, test=kunlun
-
由 pangyoki 提交于
* Add gaussian_random XPU kernels * commit kunlun, test=kunlun * new version, test=kunlun * change boost_get to BOOST_GET_CONST, test=kunlun * use Generator to generate random number and optimize format, test=kunlun * add TODO, test=kunlun
-
由 pangyoki 提交于
* support uniform_random op on Baidu Kunlun * change dtype of attr shape from int to int64_t * kunlun ci, test=kunlun * new version, test=kunlun * change boost_get to BOOST_GET_CONST * change boost_get to BOOST_GET_CONST, test=kunlun * use Generator to generate random number and optimize format * run Kunlun CI, test=kunlun * add TODO, test=kunlun
-
- 18 10月, 2020 1 次提交
-
-
由 liuyuhui 提交于
* addd * add cast_op_xpu, test=kunlun * fix bug for cast_op_xpu,test=kunlun * add concat_op_xpu, test=kunlun * slove conflicts, test=kunlun * fix bug,test=kunlun * add assign_op_xpu, test=kunlun * fix bug,test=kunlun * test=kunlun;test=develop * fix concat bug,test=kunlun * fix check_dygraph set in test_concat_op_xpu.py,test=kunlun * fix error message,test=kunlun Co-authored-by: Nmapingshuo <mps2012@yeah.net>
-
- 17 10月, 2020 2 次提交
-
-
由 littletomatodonkey 提交于
* add pad apis * rm pad2d test_layer * fix code example
-
由 Aurelius84 提交于
* fix test_lstm unittest failed * add more unittest * modify cmakelist * fix judgement
-
- 16 10月, 2020 7 次提交
-
-
由 YUNSHEN XIE 提交于
* disable test_lstm,test=document_fix * fix some error,test=document_fix
-
由 WangXi 提交于
-
由 Guo Sheng 提交于
* Incorporate cudnn_lstm into LSTM api. test=develop * Make coalesce_tensor support alignment optionally. test=develop * Reorganize RNN apis. test=develop * Fix cudnn rnn layout conversion. test=develop * Add sequence_length support for RNN cudnn implement. Add optional init_h and init_c gradient for cudnn_lstm_op. test=develop * Use create_parameter for rnn cudnn impl. test=develop * Move `self._flat_weight = self.create_parameter()` in RNNBase to main_program. test=develop * Update RNN api unittest to use set_device. test=develop * Fix set_place for unit tests of RNN apis. test=develop * Fix use_align in coalesce_tensor_op. test=develop * Adjust RNN apis arguments according to comments. test=develop * Polish documents for SimpleRNN apis. test=develop * Refine random seed in cudnn_lstm_op. Expose rnn params from sublayers to RNN. test=develop * Fix RNN saving for jit.save. Refine cudnn_lstm dropout behavior. test=develop * Fix doc of GRU. test=develop * Use ShareDataWith to avoid copying for cudnn_lstm_op test. test=develop * Remove updates on cudnn_lstm temporarily. test=develop * Use ShareDataWith to avoid copying for cudnn_lstm_op test. test=develop * Refine random seed in cudnn_lstm_op. test=develop * Fix test_lstm by adjust ConcreteProgram buffer getter. test=develop * Use create_parameter instead of create_var for rnn._flat_weight for static graph usage. test=develop * Remove W input for cudnn_lstm to pass unused_var_check. test=develop * Add test_predict for RNN unit tests coverage. test=develop * Fix code style of rnn. test=develop * Fix F.rnn usage in rnn.py. test=develop
-
由 Leo Chen 提交于
-
由 Aurelius84 提交于
* Fix save in eval mode * remove assert statement * fix test_partial_program failed * add more test * modify back into _train_program
-
由 chentianyu03 提交于
-
由 Jack Zhou 提交于
* test=kunlun; Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast): * elementwise_div op * elementwise_max op * elementwise_mul op (with grad op) * elementwise_sub op (with grad op) * 0.05->0.01 * add xpu error message description;test=kunlun
-