- 17 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* Fix the dowanload bug in the case of multiple machines (#29551) * fix the dowanload bug * add sort for ips * Fix bug of matmul_v2 for broadcast case (#29599) * fix bug of matmul_v2 for broadcast * Rebuild group automatically in dynamic graph distributed (#29255) * add tensor_indices in AssignGroupBySize * add rebuild group in reducer * fix error message of gather nd (#29521)
-
- 08 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix the bug of reducer in embedding
-
- 04 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 03 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* Change the api of DataParallel and Fleet (#29224)
-
- 27 11月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add reducer * refine envent for memorycopy * add concat&split for allreduce * apply concat & split for fuse tensor * fix nccl dep * fix the untest, compile problem and ddp initialize problem * fix untest for mac & add some comments & solve the repeated param in sublayers * fix untest for windows & fix document
-
- 26 11月, 2020 1 次提交
-
-
由 gongweibao 提交于
-
- 24 11月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish parallel api impl & doc details * add unittest for coverage * remove spawn test in py2.7 * add parallel api into white list
-
- 14 10月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 29 9月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
* remove data parallel scale loss & apply collective_grads * move apply in minimize * fix failed unittests
-
由 Chen Weihang 提交于
* polish Program api doc & example * polish CompiledProgram api doc & example * polish ParallelEnv api doc & examples * polish details, test=document_fix * polish program doc details, test=document_fix * polish details, test=document_fix * fix note format error, test=document_fix * add lost example, test=document_fix * fix lost example, test=document_fix
-
- 28 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* remove inplace argument * fix sample code * fix sample code
-
- 23 9月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* init commit of new save/load * fix failed unittests * fix save_load_v2 unittest failed * fix failed unittest & polish doc * add tests for coverage * add more tests & move static apis * fix example code error * polish emample code * fix detail example code problem
-
- 03 9月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* update set_dict method name & add aliases * fix var name error * fix alias formats * use set_state_dict in unittest * add decorator solve compatible problem * polish decorator * replace layer set_state_dict by patched method * remove import monkey path layer * fix import function error * add unittest for coverage
-
- 02 9月, 2020 1 次提交
-
-
由 Yang Zhang 提交于
-
- 31 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* remove backend argument of init_parallel_env * remove keep name table in transformer * add cpu version check * add skip unittest for init_parallel_env * polish doc: remove func use & update example
-
- 28 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add dygraph parallel run interface * polish implement & unified env property name * add print config arg * refactor init_parallel_env function * Compatible with multiprocessing and launch modes * set default trainer start port * support run in python 2 * polish python2 support code * remove python2 support * refine launch import * polish dome design details * refactor api implemention & path * use new method _set_expected_place * add spawn unittest framework & mnist test * add more unittests & doc * fix unittest failed * polish english doc * self review and polish details * refactor code by reviewer's comments * fix unittest failed * fix parallel_env unittest * fix several typos * fix error introduced when fixing typos * add unpublic note for start_processes * polish details by xiaoguang's comment * verify correctly when spawn nprocs=-1 * refactor spawn & init_parallel_env design * polish doc details * open spawn unittests * try to fix doc compile error * try to fix unknown doc format error * add skip unittest when not gpu
-
- 19 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 14 8月, 2020 1 次提交
-
-
由 Yang Zhang 提交于
* Upgrade `no_grad` decorator test=develop - match torch decorator usage (i.e., with parenthesis) - handle generator functions - add `paddle.no_grad` alias * Switch from `functools` to `decorator` preserves signature * Reword decorator usage note
-
- 05 6月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* support selectedrows allreduce in multi-cards dygraph, test=develop * remove useless import modules in unittests, test=develop * add nccl cmake to get nccl version, test=develop * add if-condition to compiled correctly, test=develop * add detail version parseing for old nccl, test=develop * polish camke details, test=develop * fix remove test cmake error, test=develop * fix cmake condition, test=develop * change unittest camke list, test=develop * fix unittest cmake rule, test=develop, test=framep0
-
- 14 5月, 2020 1 次提交
-
-
由 Cindy Cai 提交于
* test=develop, test=document_fix * test=develop, test=document_fix Co-authored-by: Nswtkiwi <1208425345@qq.com>
-
- 03 4月, 2020 1 次提交
-
-
由 zhongpu 提交于
* support if logic for Variable in dygraph, test=develop * fix test_learning_rate_scheduler.py, test=develop * fix optest, test=develop * fix error message, test=develop * fix optest,test=develop * fix comment, test=develop
-
- 12 3月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add dygraph parallel env doc, test=develop * polish details, test=develop, test=document_fix * fix examples error in other apis, test=develop * fix more example error in other api, test=develop * add white list for gpu examples, test=develop, test=document_fix
-
- 15 1月, 2020 1 次提交
-
-
由 hong 提交于
* DataParallel state dict don't include _layers.; test=develop * add unitest of data parallel; test=develop * add load state test; test=develop
-
- 08 1月, 2020 1 次提交
-
-
由 zhongpu 提交于
* modify fc to linear in sample code, test=develop * remove FC, test=develop * remove warnings, test=develop * drop fluid/imperative/README.md , test=develop * change fc to linear, test=develop * polish code style, test=develop
-
- 05 12月, 2019 1 次提交
-
-
由 Leo Chen 提交于
* test=develop, fix docker with paddle nccl problem * don't expose numerous Tensor.set(), test=develop * fix condition, test=develop * fix float16 bug, test=develop * feed should be Tensor or np.array, not Variable or number, test=develop * use forcecast to copy numpy slice to new array, test=develop * remove float16-uint16 hacking, test=develop * add variable method to varbase and refactor to_variable to support return varbase * support kwargs in varbase constructor * add VarBase constructor to support default python args * refine varbase initial method * reset branch * fix ut for change VarBase error info to PaddleEnforce * cherry is parameter change before * overload isinstance to replace too many change of is_variable * rm useless files * rm useless code merged by git * test=develop, fix some ut failed error * test=develop, fix test_graph_wrapper * add some tests, test=develop * refine __getitem__, test=develop * add tests, test=develop * fix err_msg, test=develop
-
- 29 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* add param & grad shape check for sgd op * add _reshape_inplece interface for dygraph parallel * refine unittest based paddle/models scripts, test=develop * add unittest for parallel grad fuse, test=develop
-
- 05 9月, 2019 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * test=develop, refoctor name to make it easier to understand * test=develop, refoctor name to make it easier to understand * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ * test=develop, fix ut failed on parallel se-resnext * test=develop, change one more PADDLE_ENFORCE
-
- 22 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* update parallel.py test=develop
-
- 31 7月, 2019 1 次提交
-
-
由 chengduo 提交于
* update parallel.py test=develop
-
- 24 5月, 2019 1 次提交
-
-
由 chengduo 提交于
* This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters.
-
- 17 5月, 2019 1 次提交
-
-
由 Yan Xu 提交于
* add var grad hook test=develop
-
- 25 4月, 2019 1 次提交
-
-
由 Yan Xu 提交于
implement dygraph.parallel.DataParallel to hook reduce op.
-
- 04 4月, 2019 1 次提交
-
-
由 Yan Xu 提交于
add NCCLParallelContext for parallel dygraph
-