- 30 8月, 2019 2 次提交
- 27 8月, 2019 1 次提交
-
-
由 Yi Liu 提交于
* supports multiple NCCL communicators preserved in NCCLCommContext test=develop * add ut for c_comm_init_all operator and fix cuda resource release problem test=develop
-
- 22 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* increase test_parallel_executor_seresnext time limit test=develop * split test_parallel_executor_seresnext test=develop * temporally disable reduce_and_allreduce test because of the random failure. test=develop * split gpu and cpu test=develop
-
- 19 8月, 2019 2 次提交
-
-
由 Aurelius84 提交于
* add matrch_matrix_tensor op test=develop * fix ignore unittest if with_mkl=off test=develop * clean code and rm is_test param test=develop * modify API.spec test=develop * rm useless code in search_compute.h test=develop * modify api.spec test=develop * modify default_grad.spec test=develop * Add API test code test=develop * clean code in search_computer.h * modify PADDLE_ENFORCE and clean search_compute.h test=develop * fix code style test=develop
-
由 Zeng Jinle 提交于
-
- 18 8月, 2019 1 次提交
-
-
由 gongweibao 提交于
Unset unittests http_proxy env to avoid timeout.
-
- 12 8月, 2019 1 次提交
-
-
由 gongweibao 提交于
Polish fleet API to support cuda collective mode and nccl2 mode
-
- 06 8月, 2019 1 次提交
-
-
由 Kevin 提交于
* fix overflow by int32 mul test=develop * fix reference nullptr * fix codestyle test=develop * modify to point in ContextProjectFunctor test=develop * modify to point in ContextProjectFunctor test=develop * modify . to -> test=develop * add var_conv_2d op test=develop * edit api.spec test=develop * ignore unittest if with_mkl=off test=develop * fix python3 division test=develop * fix ignore unittest bug test=develop * remove useless code test=develop * modify api.spec test=develop * modify default_grad.spec test=develop
-
- 04 8月, 2019 1 次提交
-
-
由 Dong Daxiang 提交于
make listen and server as exclusive run
-
- 31 7月, 2019 1 次提交
-
-
由 Dong Daxiang 提交于
make dist unit test exclusive run
-
- 30 7月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 28 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 26 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* first version memory optimize pass, test=develop * remove move_tensor_sharing_pass, test=develop * refine code comments, add unittests, test=develop * turn off memory_optimize by default, test=develop * follow huihuang's comments, test=develop * follow chengduoZH's comments, test=develop * fix grammar error, add const qualifier, fix pass_test exception message, test=develop * follow chengduoZH's comments 2nd, test=develop
-
- 25 7月, 2019 1 次提交
-
-
由 guru4elephant 提交于
* split test_dist_se_resnext.py into 4 testcases
-
- 24 7月, 2019 1 次提交
-
-
由 Bob Zhu 提交于
* extend matmul op to support multiple head multiplication With the support of multiple head, the multiplication of two big matrixes is split into multiplication of several (head_number) small matrixes. e.g. if Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].
-
- 22 7月, 2019 1 次提交
-
-
由 guru4elephant 提交于
* split different comm method for mnist distributed training
-
- 18 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* feature/auto_growth_allocator, test=develop * add unittest of AlignedAllocator, test=develop * try to turn on auto_growth to test on CI, test=develop * fix segmentation fault in mixed_vector.h, test=develop * add unittests, test=develop
-
- 15 7月, 2019 1 次提交
-
-
由 guru4elephant 提交于
test=develop
-
- 12 7月, 2019 1 次提交
-
-
由 kh2se2013 提交于
2)add pip install coverage in Dockerfile.tmp test=develop
-
- 11 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* feature/buffer_shared_inplace, test=develop * refine code, test=develop * fix elementwise_add op cpu inplace and sum inplace bug, test=develop * add unittest and debug log, test=develop * fix parallel_executor scope bug, polish code, test=develop * fix sum op, activation op, single_in_place_inference bug, test=develop * remove kLocalExecScopeName, test=develop * fix unittest,test=develop * fix out_var first version bug, test=develop * follow comments,test=develop
-
- 27 6月, 2019 2 次提交
-
-
由 kh2se2013 提交于
* add WITH_COVERAGE option, default OFF test=develop * add coverage for python sdk test=develop * fix code style * fix COVERAGE_FILE path test=develop * remove coverage package test=develop * test = develop, run coverage as module
-
由 HaoRen 提交于
* fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O
-
- 26 6月, 2019 3 次提交
-
-
由 hutuxian 提交于
-
由 Jiabin Yang 提交于
-
由 Jiabin Yang 提交于
-
- 25 6月, 2019 1 次提交
-
-
由 Hongyu Liu 提交于
* sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-*x to elmentwise_op; test=develop * add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop
-
- 21 6月, 2019 1 次提交
-
-
由 guru4elephant 提交于
* add more print function for timeout issue, make timeout value larger
-
- 20 6月, 2019 1 次提交
-
-
由 Jiabin Yang 提交于
* test=develop, fix test_imperative_transformer and ocr * test=develop, remove ocr recovery part
-
- 19 6月, 2019 2 次提交
-
-
由 Qiao Longfei 提交于
-
由 Jiabin Yang 提交于
* test=develop, add add_multi_gpu_install_check * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, support multi cpu
-
- 18 6月, 2019 1 次提交
-
-
由 chengduo 提交于
* remove nccl dep when the number of GPU is 1 test=develop
-
- 12 6月, 2019 1 次提交
-
-
由 tangwei12 提交于
* fix save/load in Fleet * add UT framework of Fleet
-
- 06 6月, 2019 3 次提交
-
-
由 gongweibao 提交于
-
由 Huihuang Zheng 提交于
test=develop
-
由 gongweibao 提交于
-
- 31 5月, 2019 1 次提交
-
-
由 lilong12 提交于
Split the unittest test_dist_mmist into multiple unittests (test_dist_mnist, test_dist_mnist_nccl and test_dist_mnist_lars) to avoid timeout (#17707)
-
- 30 5月, 2019 2 次提交
-
-
由 Hongyu Liu 提交于
-
由 Hongyu Liu 提交于
-
- 29 5月, 2019 1 次提交
-
-
由 Jiabin Yang 提交于
* test=develop, add ocr in dygraph test * test=develop, add cudnn determinist * test=develop, remove useless code * test=develop, fix cmake error
-