- 08 11月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 
 
- 
- 03 11月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于* [CodeStyle][py2][U008] remove unnecessary args in `super()` * remove remained args * revert changes in test_pylayer_op * Revert "revert changes in test_pylayer_op" This reverts commit ff185a9ae738afac3b0264f61bde6c6b7f72e7c4. * revert some changes in example code 
 
- 
- 01 11月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于* [CodeStyle][E711] use `is`/`is not` for comparison with `None` * `self.assertTrue($A is None)` -> `self.assertIsNone($A)` * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)` * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)` * `self.assertEqual($A, None)` -> `self.assertIsNone($A)` * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)` 
 
- 
- 23 10月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 
 
- 
- 27 9月, 2022 1 次提交
- 
- 
由 Nyakku Shigure 提交于* [CodeStyle] remove all future import * revert test_error.py * restore future import in example code 
 
- 
- 05 6月, 2022 1 次提交
- 
- 
由 Sing_chan 提交于* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 
 
- 
- 28 1月, 2022 1 次提交
- 
- 
由 Fan Zhang 提交于* [PSLIB] Add Metrics Module, Support User-defined Add Metric * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI Coverage * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI Coverage * [PSLIB] Modify According to CI Coverage * [PSLIB] Modify According to CI Coverage * modify role_maker * update CMakeLists.txt 
 
- 
- 18 1月, 2022 1 次提交
- 
- 
由 kuizhiqing 提交于
 
- 
- 28 9月, 2021 1 次提交
- 
- 
由 kuizhiqing 提交于
 
- 
- 16 6月, 2021 1 次提交
- 
- 
由 Jiangxinz 提交于
 
- 
- 11 3月, 2021 1 次提交
- 
- 
由 Thunderbrook 提交于* heter bug * format * format 
 
- 
- 24 2月, 2021 1 次提交
- 
- 
由 Thunderbrook 提交于* push multi node * multi node * MultiThread * remove log * solve bug in 30829 
 
- 
- 29 9月, 2020 1 次提交
- 
- 
由 lilong12 提交于* add gloo initializer, test=develop 
 
- 
- 28 9月, 2020 3 次提交
- 30 8月, 2020 1 次提交
- 
- 
由 Chengmo 提交于* Support Heter Parameter Server 
 
- 
- 06 8月, 2020 1 次提交
- 
- 
由 Thunderbrook 提交于* add heter ps mode * code style test=develop * add with_pslib test=develop * unitest test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * test monitor test=develop * prepare trainer test=develop * code style test=develop 
 
- 
- 30 7月, 2020 1 次提交
- 
- 
由 tangwei12 提交于Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957) * Integrated Trainer of Parameter Server 
 
- 
- 04 6月, 2020 1 次提交
- 
- 
由 xujiaqi01 提交于* add mock barrier all test=develop * fix test=develop * fix test=develop * fix test=develop 
 
- 
- 30 4月, 2020 1 次提交
- 
- 
由 xujiaqi01 提交于* add timeout and http store in communication, add revert and confirm in fleet * test=develop 
 
- 
- 18 3月, 2020 1 次提交
- 
- 17 3月, 2020 1 次提交
- 
- 
由 tangwei12 提交于Fleet Parameter Server API Integrated 
 
- 
- 23 2月, 2020 1 次提交
- 
- 
由 tianshuo78520a 提交于
 
- 
- 02 2月, 2020 1 次提交
- 
- 
由 xujiaqi01 提交于* add GeneralRoleMaker which is for general usage * test=develop 
 
- 
- 20 11月, 2019 1 次提交
- 
- 
由 Dong Daxiang 提交于test=develop 
 
- 
- 31 10月, 2019 1 次提交
- 
- 
由 Chengmo 提交于* fix PaddleCloud Role maker & add warning in distribute transpiler & change rpc_retry_times 
 
- 
- 23 9月, 2019 1 次提交
- 
- 
由 tangwei12 提交于* optimize cloud rolemaker, test=develop 
 
- 
- 06 9月, 2019 1 次提交
- 
- 
由 123malin 提交于* fleet api add input check, test=develop 
 
- 
- 16 8月, 2019 1 次提交
- 
- 
由 gongweibao 提交于node_num is not needed for users, so remove them and fix the bugs about it! 
 
- 
- 12 8月, 2019 1 次提交
- 
- 
由 gongweibao 提交于Polish fleet API to support cuda collective mode and nccl2 mode 
 
- 
- 25 7月, 2019 1 次提交
- 
- 
由 guru4elephant 提交于refine launch_ps and role_maker 
 
- 
- 22 7月, 2019 1 次提交
- 
- 
由 tangwei12 提交于do some odd jobs, test=develop 
 
- 
- 10 7月, 2019 1 次提交
- 
- 
由 guru4elephant 提交于* upgrade collective fleet api 
 
- 
- 08 7月, 2019 1 次提交
- 
- 
由 guru4elephant 提交于* add random port 
 
- 
- 02 7月, 2019 1 次提交
- 
- 
由 guru4elephant 提交于make fleet support mpi job submit directly. 
 
- 
- 27 6月, 2019 1 次提交
- 
- 
由 HaoRen 提交于* fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O 
 
- 
- 23 6月, 2019 1 次提交
- 
- 
由 guru4elephant 提交于* fix paddle cloud role maker bug 
 
- 
- 17 6月, 2019 1 次提交
- 
- 
由 guru4elephant 提交于add paddle cloud role maker for customized usage, note this is only for industrial users that have cloud environment pre-configuration (#18121) add paddle cloud role maker for specific cloud usage. This pr will simplifies user's configuration in distributed training. 
 
- 
- 12 6月, 2019 1 次提交
- 
- 
由 tangwei12 提交于* fix save/load in Fleet * add UT framework of Fleet 
 
- 
