- 27 6月, 2019 1 次提交
-
-
由 HaoRen 提交于
* fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O
-
- 25 6月, 2019 1 次提交
-
-
由 chengduo 提交于
* fix default value of fluid.memory_optimize test=develop * fix api.spec test=develop
-
- 31 5月, 2019 1 次提交
-
-
由 tangwei12 提交于
* add example to get_startup_program() * fix example to get_startup_program()
-
- 30 5月, 2019 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 29 5月, 2019 2 次提交
-
-
由 gongweibao 提交于
-
由 tangwei12 提交于
* fix doc in transpiler, test=develop
-
- 27 5月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 24 5月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
* fuse mul and elementwise add to fc * Reimplement the FC forward operator * Fix FC MKLDNN integration by transposing weights * Add FC MKLDNN Pass test=develop * FC MKLDNN Pass: change memcpy to std::copy * Fix MKLDNN FC handling of mismatch input and weights dims * Lower tolerance for MKL-DNN in resnet50 test test=develop * Adjust FC to support MKLDNN Op placement test=develop * Adjust Placement Op to set use_mkldnn attribute for graph test=develop * MKLDNN FC: fix weights format so that gemm version is called test=develop * FC MKLDNN: Remove tolerance decrease from tester_helper * FC MKL-DNN: Refactor the code, change input reorder to weight reorder * MKL-DNN FC: Introduce operator caching test=develop * FC MKL-DNN: Fix the tensor type in ExpectedKernelType test=develop * FC MKL-DNN: fix style changes test=develop * FC MKL-DNN: fallback to native on non-supported dim sizes test=develop * FC MKLDNN: fix CMake paths test=develop * FC MKLDNN: Refine placement pass graph mkldnn attribute test=develop * Fix Transpiler error for fuse_conv_eltwise test=develop * Fix missing STL includes in files test=develop * FC MKL-DNN: Enable new output size computation Also, refine pass to comply with newest interface. test=develop * FC MKL-DNN: enable only when fc_mkldnn_pass is enabled * FC MKL-DNN: Allow Weights to use oi or io format * FC MKL-DNN: Adjust UT to work with correct dims test=develop * Enable MKL DEBUG for resnet50 analyzer test=develop * FC MKL-DNN: Improve Hashing function test=develop * FC MKL-DNN: Fix shape for fc weights in transpiler * FC MKL-DNN: Update input pointer in re-used fc primitive * Add log for not handling fc fuse for unsupported dims test=develop * FC MKL-DNN: Move transpose from pass to Op Kernel test=develop * FC MKL-DNN: Disable transpose in unit test test=develop * FC MKL-DNN: Remove fc_mkldnn_pass from default list * Correct Flag for fake data analyzer tests test=develop * FC MKL-DNN: Add comment about fc mkldnn pass disablement test=develop * FC MKL-DNN: Disable fc in int8 tests test=develop
-
- 23 5月, 2019 2 次提交
-
-
由 Qiao Longfei 提交于
* fix distribute doc
-
由 Qiao Longfei 提交于
Async exe support communicator
-
- 20 5月, 2019 1 次提交
-
-
由 liuwei1031 提交于
* improve the doc of paddle.fluid.memory_optimize, test=develop * fix typo, test=develop
-
- 16 5月, 2019 1 次提交
-
-
由 liuwei1031 提交于
* improve the API Sample of DataFeeder, memory_optimize and release_memory, test=develop * update API.spec, test=develop, test=document_preview * tweak the code format of feed API, test=develop * update API.spec, test=develop * improve doc for DataFeeder and default_main_program, test=develop
-
- 26 4月, 2019 1 次提交
-
-
由 tangwei12 提交于
-
- 25 4月, 2019 1 次提交
-
-
由 tangwei12 提交于
* implement distributed transpiler with fleet
-
- 27 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 25 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 23 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 04 3月, 2019 2 次提交
- 27 2月, 2019 2 次提交
- 20 2月, 2019 1 次提交
-
-
由 tangwei12 提交于
* fix params with only 1 dim * test=develop
-
- 14 2月, 2019 2 次提交
- 08 2月, 2019 2 次提交
-
-
由 Qiao Longfei 提交于
-
由 Qiao Longfei 提交于
-
- 06 2月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 31 1月, 2019 1 次提交
-
-
由 dzhwinter 提交于
-
- 30 1月, 2019 2 次提交
- 29 1月, 2019 1 次提交
-
-
由 dzhwinter 提交于
-
- 25 1月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 24 1月, 2019 1 次提交
-
-
由 Wu Yi 提交于
-
- 23 1月, 2019 1 次提交
-
-
由 tangwei12 提交于
checkpoint for distributed training.
-
- 08 1月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 28 12月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
test=develop
-
- 27 12月, 2018 1 次提交
-
-
由 haowang101779990 提交于
test=develop
-
- 26 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
add module cleanup test=develop
-
- 18 12月, 2018 2 次提交
-
-
由 JiabinYang 提交于
-
由 dzhwinter 提交于
* follow comments. test=develop * Fix typo * fix compile error. test=develop * merge develop branch. test=develop * Remove set_equal * Polish code * Delete unused functions test=develop * polish code. test=develop * follow comment * polish code. * fix windows compile error. test=develop * fix op handle. * rerun ci. test=develop * rerun ci. test=develop * rerun macci. test=develop * polish code. test=develop * rewrite sort code. test=develop * remove unused code. test=develop * fix tests. test=develop * fix conflict. test=develop * follow comment. test=develop * merge develop branch. test=develop * fix tests. test=develop * remove ToTypeIndex. test=develop * rerun ci. test=develop
-