- 28 6月, 2019 7 次提交
-
-
由 Leo Zhao 提交于
1. some key generation method is not aligned with PR#17965 2. enlarge ptr lifetime to avoid memory release if SetBlob fails otherwise it will get core dump. test=develop
-
由 tianshuo78520a 提交于
* change nproc 8
-
由 Zeng Jinle 提交于
* add_elementwise_add_inplace_test,test=develop * rename file, test=develop
-
由 Jiabin Yang 提交于
* test=develop, add some comments for Program.clone * test=develop, add API.spec * test=develop, refine comments * refine Program doc and clone doc * test=develop, refine doc
-
由 Jiabin Yang 提交于
-
由 chengduo 提交于
* add cuda_is_available test=develop * Fix api.spec test=develop * fix api doc test=develop
-
由 Wojciech Uss 提交于
test=develop
-
- 27 6月, 2019 12 次提交
-
-
由 lujun 提交于
Fix dygraph show style for FluidDoc.
-
由 HaoRen 提交于
* add dependecy of collective_helper * test=develop fix dependecy of collective_helper
-
由 翟飞跃 提交于
-
由 chengduo 提交于
* update pe reduce config test=develop * drop the local_exe_scopes of the previous parallel_executor test=develop
-
由 tangwei12 提交于
* add is_runnning in communicator, test=develop
-
由 tianshuo78520a 提交于
需要在avx_noavx build时候,生成dockerfile。 使用combine_avx_noavx 参数生成whl后发现不能build镜像,原因:没有生成dockerfile。需要添加生成dockerfile选项。
-
由 kh2se2013 提交于
* add WITH_COVERAGE option, default OFF test=develop * add coverage for python sdk test=develop * fix code style * fix COVERAGE_FILE path test=develop * remove coverage package test=develop * test = develop, run coverage as module
-
由 Michał Gallus 提交于
test=develop
-
由 HaoRen 提交于
* fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O
-
由 Sylwester Fraczek 提交于
add prior_box quantization code add scale algo rules for prior box test=develop
-
由 lidanqing 提交于
* some fixes for int8 mobilenet_ssd tester test=develop * change wrong data file name test=develop * change test images bin file from 200 images to 100 images * change directory existence to file existence during downloading test=develop * reuse download_data test=develop * run full dataset when iterations=0 test=develop
-
由 Jacek Czaja 提交于
* - Reusing of reuder used in elementwise_add_mkldnn - Added MKL-DNN sum prim reusing test=develop - Compilation fixes test=develop - Yet another compilation fix test=develop - Yet another compilation fix test=develo - Yet another linking fix test=develop - Final compilation fix test=develop - lint fixes test=develop - Lint fixes test=develop * - Fixes after review test=develop
-
- 26 6月, 2019 9 次提交
-
-
由 qingqing01 提交于
* Simplify multi_box_head API in detection.py and remove assign op.
-
由 hutuxian 提交于
-
由 Zeng Jinle 提交于
* refine cuda place error msg, test=develop * use LOG(ERROR)+exit(-1), test=develop
-
由 Tao Luo 提交于
test=develop
-
由 Jiabin Yang 提交于
-
由 Yibing Liu 提交于
* Update lamb optimizer test=develop, test=document_preview * Regenerate api spec test=develop, test=document_preview
-
由 chengduo 提交于
test=develop
-
由 whs 提交于
Socket can't be pickled. test=develop
-
由 Jiabin Yang 提交于
-
- 25 6月, 2019 9 次提交
-
-
由 tensor-tang 提交于
* fix softrelu doc test=develop * update API doc test=develop
-
由 Jiabin Yang 提交于
* test=develop, add_install_check_for_multigpu * test=develop, refine code to use cuda_devices
-
由 Zeng Jinle 提交于
-
由 Hongyu Liu 提交于
* sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-*x to elmentwise_op; test=develop * add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop
-
由 Jiabin Yang 提交于
* Revert "Add multi gpu install check (#18229)" This reverts commit 61ed06b2. * test=develop, start ci
-
由 Qiao Longfei 提交于
* optimize communicator merge sparse gradient test=develop * revert multithread selected rows merge add test=develop * follow comment test=develop
-
由 Jie Fang 提交于
test=develop
-
由 chengduo 提交于
* fix default value of fluid.memory_optimize test=develop * fix api.spec test=develop
-
由 Zhaolong Xing 提交于
test=develop
-
- 24 6月, 2019 3 次提交
-
-
由 Yibing Liu 提交于
* Use TensorCopySync for sequence_unpad op test=develop * Fix the tensor memory alloc bug test=develop
-
由 hutuxian 提交于
-
由 chengduo 提交于
* clean build_strategy test=develop * DataBalanceOpHandle has been removed test=develop * debug * update build_strategy. test=develop
-