- 23 10月, 2020 2 次提交
-
-
由 Huihuang Zheng 提交于
Recently, test_parallel_executor_test_while_train randomly failed on CI. On all CI logs, it showed NCCL initialization failed or cusolver initialization failed. I found online that those failure is usually caused by GPU shortage. Those API calls CUDA APIs directly so it shouldn't be the problem of allocator. It may be somewhere in PaddlePaddle increases GPU usage. However, I run this test for 1000 times on my machine and the CI machine, either of them can reproduce the random failure. Maybe there is something related to the environment only happened in test env. To verify my assumption that somewhere in PaddlePaddle increases GPU usage and also fix this CI, I decreased the batch_size to see whether the random failure disappears in test env.
-
由 tianshuo78520a 提交于
-
- 22 10月, 2020 11 次提交
-
-
由 pangyoki 提交于
-
由 WangXi 提交于
-
由 Double_V 提交于
-
由 Feiyu Chan 提交于
* fix strided_slice_op's GetExpectedKernelType when input tensor is at CUDAPinnedPlace * add unittest for tensors in cuda pinned place * skip test for cuda pinned place on cpu machines
-
由 Zhou Wei 提交于
* fix optimizer init * Enhance the detection of whether to keep the build directory * Enhance the detection of whether to keep the build directory
-
由 Leo Chen 提交于
* fix bug of fetch_async_op_handle * revert some changes of test_buffer_shared_memory_reuse_pass * revert some changes of test_buffer_shared_memory_reuse_pass
-
由 Aurelius84 提交于
* remove some judgement * fix len(outputs) == 1
-
由 LielinJiang 提交于
* update hapi predict interface * fix code style * fix docs * fix docs * fix docs * update docs * fix codes style * fix unittest * fix unittest * fix coverage
-
由 WangXi 提交于
-
由 gongweibao 提交于
test=develop
-
由 tianshuo78520a 提交于
* update dockerfile * update dockerfile * update dockerfile * update dockerfile * add opencv in ci * update cidockerfile * test nccl * fix diff * fix dockerfile * update ubuntu nccl2.7.8 * update ubuntu nccl2.7.8
-
- 21 10月, 2020 15 次提交
-
-
由 Wilber 提交于
-
由 Chen Weihang 提交于
-
由 lilong12 提交于
* modify ut cmakefile, test=develop
-
由 swtkiwi 提交于
-
由 cnn 提交于
* rename manual_seed to seed * rename xxx1d-->xxx1D, xxx2d-->xxx2D, xxx3d-->xxx3D * rename manual_seed --> seed * do not rename .cc, .cu and .h file * rename manual_seed --> seed * rename manual_seed --> seed * rename manual_seed --> seed * rename manual_seed --> seed * disable_static on doc example code * donot change manual_seed on generator * add enable_static on sample code * convert python/paddle/fluid/layers/nn.py to bak * fix typo * fix code style * fix seed to manual_seed when call functions of Generator() * fix bug
-
由 Zhou Wei 提交于
-
由 liu zhengxi 提交于
-
由 pangyoki 提交于
* add static_mode_white_list * add Mac CI static list * add Win CI white_list * add Coverage and Py3 CI white_list, add test_unittest
-
由 danleifeng 提交于
* nccl init support hostname and ip; test=develop
-
由 wangguanzhong 提交于
-
由 Pei Yang 提交于
-
由 Double_V 提交于
-
由 李灿 提交于
-
由 Zhou Wei 提交于
-
由 zhulei 提交于
* Add new api: is_tensor * Add new api: is_tensor * Add new api: is_tensor * Add new api: is_tensor
-
- 20 10月, 2020 12 次提交
-
-
由 lilong12 提交于
-
由 tianshuo78520a 提交于
* Add build paddle inference;test=document_fix * Add build paddle inference;test=document_fix
-
由 wangguanzhong 提交于
* fix generate_proposal_labels in cascade-rcnn series model, test=develop * fix example code & unittest, test=develop * update code from review comments, test=develop
-
由 YUNSHEN XIE 提交于
-
由 LoveAn 提交于
-
由 Leo Chen 提交于
* fill_constant supports nan and inf * add ut
-
由 LielinJiang 提交于
* add pil backend
-
由 Aurelius84 提交于
* refine code of DygraphToStaticAst * add __init__ function
-
由 zhupengyang 提交于
-
由 tianshuo78520a 提交于
-
由 hong 提交于
-
由 zhang wenhui 提交于
-