- 11 7月, 2019 2 次提交
-
-
由 gongweibao 提交于
-
由 Zeng Jinle 提交于
* feature/buffer_shared_inplace, test=develop * refine code, test=develop * fix elementwise_add op cpu inplace and sum inplace bug, test=develop * add unittest and debug log, test=develop * fix parallel_executor scope bug, polish code, test=develop * fix sum op, activation op, single_in_place_inference bug, test=develop * remove kLocalExecScopeName, test=develop * fix unittest,test=develop * fix out_var first version bug, test=develop * follow comments,test=develop
-
- 10 7月, 2019 1 次提交
-
-
由 lujun 提交于
remove dygraph.enable from __all__ hidden dygraph. profiler add doc to dygraph. no_grad
-
- 09 7月, 2019 2 次提交
-
-
由 Physher 提交于
-
由 LielinJiang 提交于
* fix transform matrix bug, test=develop * modify API.spec
-
- 05 7月, 2019 3 次提交
-
-
由 zhaoyuchen2018 提交于
* Fix topk cannot handle 1D vector bug Add path to handle 1D vector test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com> * refine code test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 Jiabin Yang 提交于
* test=develop, fix docker with paddle nccl problem * test=develop, hide no_support api and add ut for it
-
由 LielinJiang 提交于
* add_distributions_of_normal_and_uniform * paddle/fluid/API.spec * modify API.spec * modified paddle/fluid/API.spec, test=develop * modify paddle/fluid/API.spec, test=develop * modify paddle/fluid/API.spec, test=develop * fix some comment, test=develop * modify API.spec, test=develop * add comment for init function, modify hard code, test=develop * modify API.spec, test=develop * modify API.spec, test=develop * make unit test function shorter, test=develop * modify paddle/fluid/API.spec
-
- 04 7月, 2019 2 次提交
-
-
由 qingqing01 提交于
* make it support float/int learning as input.
-
由 chengduo 提交于
-
- 03 7月, 2019 7 次提交
-
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
* test=develop support Tensor input for chunk_eval op * test=develop fix testcase for chunk_eval op * test=develop fix typos in nn.py
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 hutuxian 提交于
move the thread-check code from train_from_dataset to a single function add UT for the thread check function
-
- 02 7月, 2019 2 次提交
-
-
由 Yi Liu 提交于
1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops 2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext 3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis
-
由 chengduo 提交于
* add not_been_used_vars to no_grad_set test=develop
-
- 01 7月, 2019 1 次提交
-
-
由 LielinJiang 提交于
* modify roi_perspective_transform_op to output mask and transform matrix * modify comment * modify comment * modify API.spec * update API.spec * remove no use header, test=develop * resolve conflict
-
- 27 6月, 2019 2 次提交
-
-
由 kh2se2013 提交于
* add WITH_COVERAGE option, default OFF test=develop * add coverage for python sdk test=develop * fix code style * fix COVERAGE_FILE path test=develop * remove coverage package test=develop * test = develop, run coverage as module
-
由 HaoRen 提交于
* fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O
-
- 26 6月, 2019 4 次提交
-
-
由 hutuxian 提交于
-
由 Jiabin Yang 提交于
-
由 Yibing Liu 提交于
* Update lamb optimizer test=develop, test=document_preview * Regenerate api spec test=develop, test=document_preview
-
由 Jiabin Yang 提交于
-
- 25 6月, 2019 1 次提交
-
-
由 Hongyu Liu 提交于
* sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-*x to elmentwise_op; test=develop * add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop
-
- 23 6月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 21 6月, 2019 2 次提交
-
-
由 xiaoting 提交于
* set src_idx > 0, test=develop * add unittest and cu, test=develop
-
由 guru4elephant 提交于
* add more print function for timeout issue, make timeout value larger
-
- 20 6月, 2019 2 次提交
-
-
由 Hongyu Liu 提交于
* fix slice op bug; test=develop * fix variabel test bug; test=develop * remove slice while true; test=develop
-
由 Jiabin Yang 提交于
* test=develop, fix test_imperative_transformer and ocr * test=develop, remove ocr recovery part
-
- 19 6月, 2019 3 次提交
-
-
由 Qiao Longfei 提交于
-
由 翟飞跃 提交于
* fix spelling errors; test=develop * Update API.spec update md5 * Update API.spec * change the order of api;test=develop
-
由 Jiabin Yang 提交于
* test=develop, add add_multi_gpu_install_check * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, support multi cpu
-
- 18 6月, 2019 1 次提交
-
-
由 chengduo 提交于
* remove nccl dep when the number of GPU is 1 test=develop
-
- 17 6月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* fix py_reader iterable bug, test=develop * move data from buffered_reader,test=develop
-
- 16 6月, 2019 3 次提交
-
-
由 qingqing01 提交于
* Update backward.py: - If there is no input grad var in all outputs of previous ops, do not append this op into graph. - Only apply this stragety when double backward. * Update some double backward op. * Update sum_op to judge whether a tensor is empty by numel or IsInitialized().
-
由 FlyingQianMM 提交于
* test=develop add detection output for supporting retinanet * test=develop add test_layers.py * test=develop add API.spec * test=develop alter test_retinanet_detection_output.py * test=develop alter round 2 * test=develop alter retinanet_detection_output * test=develop alter paddle/fluid/API.spec * test=devlop alter detection.py * test=develop alter retinanet_detection_output * test=develop alter paddle/fluid/API.spec * test=develop alter detection.py * test=develop alter API.spec * test=develop alter retinanet_detection_output * test=develop alter paddle/fluid/API.spec * test=develop alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py * test=develop alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py * test=develop fix grammer error * test=develop fix grammer error * test=develop fix grammer error * test=develop alter python/paddle/fluid/tests/unittests/test_layers.py * test=develop alter paddle/fluid/API.spec
-
由 guru4elephant 提交于
* add class name and timeline for test_dist_base.py
-