1. 21 9月, 2019 1 次提交
    • J
      Feature/auto prune in dygraph (#19757) · 45425411
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      
      * support auto prune in dygraph mode
      
      * test=develop, support auto prune
      
      * test=develop, merge develop conflict
      
      * test=develop, fix test_layer and test_tracer ut
      
      * test=develop, fix bug which may cause stop_gradient disabled with a list of backward inputs
      45425411
  2. 17 9月, 2019 2 次提交
    • L
      add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208) · 88628016
      liym27 提交于
      add support parameter inference when arguments starts or ends is a list containing integer and tensor variable;
      test=develop,test=document_preview
      
      improve slice op according to review(from hongyu). test=develop
      
      fix slice op according to review: infer_flags, test=develop
      
      fix slice op: improve overload operator __getitem__ to support attrs(starts and ends) are Variable.
      test=develop,test=document_preview
      
      fix test_slice_op: add TestSliceOp_decs_dim_6 to resolve conflict with test_slice_ngraph_op. test=develop
      
      add stop_gradient=True when attr(starts) or attr(ends) is tensor Variable.
      test=develop,test=document_preview
      88628016
    • C
      Fix example error of Variable and Operator (#19821) · 2729c174
      chengduo 提交于
      * fix example error
      test=develop
      
      * Remove set_desc
      test=develop
      2729c174
  3. 16 9月, 2019 1 次提交
  4. 09 9月, 2019 1 次提交
    • C
      Fix train error when test_program.clone is executed after optimizer.minimize (#19397) · c78a4781
      Chen Weihang 提交于
      * add prune when test_program.clone is executed after optimizer.minimize
      
      * add unittest, test=develop
      
      * add resnet and transformer test case, test=develop
      
      * add regularization for optimizer & program compare function, test=develop
      
      * add lstm unittest, test=develop
      
      * polish code based on review comment, test=develop
      
      * adapt to interface change in framework._prune, test=develop
      
      * update API.spec, test=develop
      c78a4781
  5. 05 9月, 2019 2 次提交
    • J
      Refactor dygraph (#19107) · e9233d1c
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      e9233d1c
    • M
      add feed_var_names to Prune interface (#19589) · dca9b6c5
      mapingshuo 提交于
      * Fix bug: add feed_vars to the prune function
      dca9b6c5
  6. 04 9月, 2019 1 次提交
    • Z
      add detach API for Variable in dygraph mode, test=develop (#19477) · 4d26274d
      zhongpu 提交于
      * add to and detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add exception check, test=develop
      4d26274d
  7. 23 8月, 2019 1 次提交
  8. 21 8月, 2019 1 次提交
  9. 19 8月, 2019 1 次提交
  10. 12 8月, 2019 1 次提交
  11. 05 8月, 2019 1 次提交
  12. 27 7月, 2019 1 次提交
  13. 23 7月, 2019 1 次提交
    • Y
      supports distributed classification (#18690) · 157211c4
      Yi Liu 提交于
      * supports distributed classification training
      * update API.spec
      * fix evenly division in python3
      * change "index_range" to "index_num" in shard_index operator
      test=document_preview
      test=develop
      157211c4
  14. 11 7月, 2019 1 次提交
  15. 01 7月, 2019 1 次提交
  16. 28 6月, 2019 2 次提交
    • J
      Fix/program doc (#17908) · 43f64a17
      Jiabin Yang 提交于
      * test=develop, add some comments for Program.clone
      
      * test=develop, add API.spec
      
      * test=develop, refine comments
      
      * refine Program doc and clone doc
      
      * test=develop, refine doc
      43f64a17
    • C
      Add is_compiled_with_cuda (#18356) · 871cc15e
      chengduo 提交于
      *  add cuda_is_available
      test=develop
      
      * Fix api.spec
      test=develop
      
      * fix api doc
      test=develop
      871cc15e
  17. 27 6月, 2019 1 次提交
    • H
      supports collective communicated training (#18175) · b7128bac
      HaoRen 提交于
      * fix prepare context redundant code problem, optimize executor by caching create_varaiables
      test=develop
      
      * supports collective training in executor
      
      * make fetch_list runable with variables, add more unittest for use_program_cache
      test=develop
      
      * fix comment
      test=develop
      
      * use unique name for nccl_id
      
      * supports output to stream in program_to_code
      
      * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code
      
      * set op role in collective training
      
      * add collective op role
      
      * remove orig file
      
      * add build optimizer by strategy
      
      * add collective strategy
      
      * refine collective strategy
      
      * add multi-process role maker
      
      * refine strategy building factory so that we can easily plugin more strategy
      
      * scale loss grad in collective sgd transpiler
      
      * add support for distributed fc
      
      * code format
      
      * revert some features for dist fc
      
      * add support for distributed fc training
      
      * fix prepare context redundant code problem, optimize executor by caching create_varaiables
      test=develop
      
      * supports collective training in executor
      
      * make fetch_list runable with variables, add more unittest for use_program_cache
      test=develop
      
      * use unique name for nccl_id
      
      * supports output to stream in program_to_code
      
      * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code
      
      * set op role in collective training
      
      * add collective op role
      
      * fix comment
      test=develop
      
      * remove orig file
      
      * add build optimizer by strategy
      
      * add collective strategy
      
      * refine collective strategy
      
      * add multi-process role maker
      
      * refine strategy building factory so that we can easily plugin more strategy
      
      * scale loss grad in collective sgd transpiler
      
      * add support for distributed fc
      
      * code format
      
      * revert some features for dist fc
      
      * add support for distributed fc training
      
      * test=develop
      add collective op unittest standard
      
      * test=develop
      remove the test_collective directory
      
      * test=develop
      remove the test_collective directory
      
      * remove slicegather test
      
      * code format for reducescatter
      
      * update attr of shard_index_op
      
      * Modify macro nccl_helper
      
      * remove test without distribute
      
      * macro collective_helper
      
      * marcro update
      
      * test=develop
      update support python3.5
      
      * test=develop change gpu memory use to 0.1 when test
      
      * test=develop
      update ut equal func
      
      * test=develop
      set flags to 1.5
      
      * test=develop fix pickle dumple  py35
      
      * test=develop
      fix divide in slice and add sync_comm_stream
      update atol and rtol to 1e-05
      rm shard_index op and test
      modify read input from file to read from memory
      remove origin_program in framework and add i/o in c_sync_calc_stream
      
      * test=develop update unittest sync operator I/O
      b7128bac
  18. 21 6月, 2019 1 次提交
  19. 20 6月, 2019 1 次提交
  20. 16 6月, 2019 1 次提交
  21. 15 6月, 2019 1 次提交
  22. 14 6月, 2019 1 次提交
  23. 13 6月, 2019 1 次提交
  24. 12 6月, 2019 1 次提交
    • T
      combine noavx and avx package (#17889) · 5c06bff2
      tensor-tang 提交于
      * support avx and noavx core
      
      * add catch and give some log
      
      test=develop
      
      * fix build
      
      test=develop
      
      * add missing package
      
      test=develop
      
      * fix pybind name
      
      test=develop
      
      * fix import error
      
      test=develop
      
      * conbime noavx core
      
      test=develop
      
      * add requirements
      
      test=develop
      
      * fix unkown message
      
      test=develop
      
      * fix api spec
      
      test=develop
      
      * refine and clean
      
      test=develop
      
      * update
      
      * pass dist ut
      
      * follow comments
      
      test=develop
      
      * refine scripts
      
      test=develop
      5c06bff2
  25. 11 6月, 2019 1 次提交
    • H
      Pipeline Concurrency (#17402) · 969e6378
      hutuxian 提交于
      Add Pipeline Concurrency Train Mode:
      - Cpp: pipeline_trainer & section_worker
      - Python: PipelineOptimizer
      - Add a new data_feed type: PrivateInstantDataFeed
      - Add a test demo of pipeline trainer and the test model is gnn
      - Do not support win32 now
      969e6378
  26. 10 6月, 2019 1 次提交
  27. 06 6月, 2019 1 次提交
  28. 04 6月, 2019 1 次提交
    • H
      ont hot support tensor depth (#16972) · 82358bfd
      Hongyu Liu 提交于
      * support some input tensor remain on cpu; test=develop
      
      * fix input = none; test=develop
      
      * fix unfound bug; test=develop
      
      * fix proto None case; test=develop
      
      * fix bug; test=develop
      
      * fix proto null bug; test=develop
      
      * remove conv check; test=develop
      
      * fix test bug; test=develop
      
      * move fill constant; test=develop
      
      * no change in proto; test=develop
      
      * fix bug; test=develop
      
      * change attr detph name; test=develop
      
      * remove remain cpu; test=develop
      
      * fix bug; test=develop
      
      * merge develop; test=develop
      
      * fix one_hot bug; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix python api bug; test=develop
      82358bfd
  29. 03 6月, 2019 1 次提交
  30. 27 5月, 2019 1 次提交
  31. 24 5月, 2019 1 次提交
  32. 21 5月, 2019 1 次提交
  33. 16 5月, 2019 1 次提交
  34. 15 5月, 2019 1 次提交
  35. 13 5月, 2019 2 次提交
  36. 08 5月, 2019 1 次提交
    • L
      Repair api example (#17221) · e388a1fb
      lujun 提交于
      Fix the following API examples:
      
      paddle.fluid.scope_guard
      paddle.fluid.backward.append_backward
      paddle.fluid.cpu_places
      paddle.fluid.cuda_pinned_places
      paddle.fluid.cuda_places
      paddle.fluid.in_dygraph_mode
      paddle.fluid.CUDAPlace
      paddle.fluid.CPUPlace
      paddle.fluid.CUDAPinnedPlace
      e388a1fb