1. 08 1月, 2021 2 次提交
    • L
      [cherry-pick 2.0] Fix bug: In dynamic mode, if start or end is negetive,... · 5fe3da39
      liym27 提交于
      [cherry-pick 2.0] Fix bug: In dynamic mode, if start or end is negetive, __getitem__  return wrong result(#30003) (#30146)
      
      1. when slice_item is a slice:
       1) the start of __getitem__ should be std::max(start, 0) if slice
       2) the start of __getitem__ should be std::min(end, dim)
      2. when slice_item is an integer, it should be in [-dim_len, dim_len)
      3. Fix error message to use accurate data
      5fe3da39
    • 1
      【2.0API CherryPick】LookAhead, ModelAverage, IndexSelect (#30205) · 3ce4d34d
      123malin 提交于
      * Add Lookahead and ModelAverage Optimizer (#30004)
      
      * test=develop, add model_average and lookahead
      
      * Improve Index select cuda kernel (#30139)
      
      * test=develop, add index_select_cuda kernel
      3ce4d34d
  2. 06 1月, 2021 1 次提交
    • L
      [Cherry-Pick 2.0][Dynamic Inplace] Support ShareInplaceVersionCounterWith for... · 743649b5
      liym27 提交于
      [Cherry-Pick 2.0][Dynamic Inplace] Support ShareInplaceVersionCounterWith for C++ Tensor (#29842) (#30105)
      
      Before this PR, SharePlaceHolderWith share Tensor between different C++ Variable, which meas sharing the data, shape, and inplace_version_counter_ of Tensor.
      But in some cases, Sharing data and inplace_version_counter_ but not sharing shape is needed. For example, inplace op reshape, can't share shape.
      
      This PR, discard SharePlaceHolderWith, and expose ShareInplaceVersionCounterWith for C++ Tensor.
      This reverts commit b10ecd9d.
      
      * Support ShareInplaceVersionCounterWith to share the same inplace version counter for VarBase
      743649b5
  3. 05 1月, 2021 1 次提交
  4. 04 1月, 2021 1 次提交
  5. 29 12月, 2020 2 次提交
  6. 25 12月, 2020 1 次提交
    • T
      2 0 ps core 2 (#29894) · f781ab08
      tangwei12 提交于
      * add ps table (#29463)
      
      * add ps table
      
      Change-Id: I468a04bd071d21ff52654926fcf4d5f3da19e178
      
      * add service (#29560)
      
      * add service, remove ut on mac
      
      * fix heter_profiler & add heter stop method
      
      * fix code style
      
      * merge pscore
      
      Change-Id: Ie7f60d1cdde6755a0c29db26863c6283e9843d57
      
      * fix cmake
      
      Change-Id: I6773509a7b4ca79139ecc40b7bf3eb318ceff8bb
      
      * fix conflit
      
      Change-Id: I35575be0c96a8520f9d756ea7f1ff0b904a165ba
      
      * fix conflit
      
      Change-Id: Ic926ea0b0d67803226d51241397ba3b510226bfa
      f781ab08
  7. 22 12月, 2020 1 次提交
  8. 21 12月, 2020 1 次提交
  9. 17 12月, 2020 1 次提交
    • S
      [cherry-pick]fix matmulv2 bug & add rebuild group & fix bug of download (#29726) · df0430dc
      ShenLiang 提交于
      * Fix the dowanload bug in the case of multiple machines (#29551)
      
      * fix the dowanload bug
      * add sort for ips
      
      * Fix bug of matmul_v2 for broadcast case (#29599)
      
      * fix bug of matmul_v2 for broadcast
      
      * Rebuild group automatically in dynamic graph distributed (#29255)
      
      * add tensor_indices in AssignGroupBySize
      
      * add rebuild group in reducer
      
      * fix error message of gather nd (#29521)
      df0430dc
  10. 05 12月, 2020 1 次提交
    • myq406450149's avatar
      update unbind norm add CUDAPlace api doc information (#29322) (#29391) · 7e322b3c
      myq406450149 提交于
      * enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop
      
      * fix format. test=develop
      
      * format fix. test=develop
      
      * add lod_rank_table. test=develop
      
      * fix format. test=develop
      
      * fix doc info. test=develop
      
      * fix np error
      
      * add unbind dygraph api. test=develop
      
      * fix unbind doc.test=develop
      7e322b3c
  11. 04 12月, 2020 2 次提交
  12. 03 12月, 2020 1 次提交
    • Z
      [Cherry-pick] Add pure fp16 training with master weights. (#29301) · d8ea8a06
      Zhen Wang 提交于
      * Add pure fp16 training with master weights. (#27712)
      
      * add the weight decay func for the momentum op
      
      * Add the multi_precision function in Momentum Optimizer.
      
      * Make sure that the initial value of master weights are same with the fp16 weights.
      
      * add static loss scaling.
      
      * add the rescale_grad function in the pure fp16 training.
      
      * use the original momentum updating method.
      
      * Polish some codes, such as variable names.
      
      * add docstring for apis.
      
      * update the var creation details of _create_master_weight.
      
      * not modify codes about imperative momentum updating.
      
      * Fix the error of test_dist_sparse_tensor_load_momentum UT.
      
      * add unit test for multi precision fp16 training.
      
      * add more unit tests for CI.
      
      * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
      d8ea8a06
  13. 01 12月, 2020 2 次提交
  14. 30 11月, 2020 1 次提交
    • L
      Check whether there is any inplace operation affecting gradient calculation. (#27901) · 865a4598
      liym27 提交于
      * Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.
      
      * Add a new attribute `_inplace_version` for VarBase.
      
      * Raise exception if an inplace operation can result in incorrect gradient computation.
      
      * Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.
      
      * For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.
      
      * Use original var_wrapper if the inplace_version is not changed.
      
      * Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.
      865a4598
  15. 27 11月, 2020 1 次提交
    • S
      Support dynamic graph distributed (#28997) · e2d01eb6
      ShenLiang 提交于
      * add reducer
      
      * refine envent for memorycopy
      
      * add concat&split for allreduce
      
      * apply concat & split for fuse tensor
      
      * fix nccl dep
      
      * fix the untest, compile problem and ddp initialize problem
      
      * fix untest for mac & add some comments & solve the repeated param in sublayers
      
      * fix untest for windows & fix document
      e2d01eb6
  16. 26 11月, 2020 1 次提交
  17. 25 11月, 2020 1 次提交
  18. 23 11月, 2020 1 次提交
  19. 20 11月, 2020 2 次提交
  20. 18 11月, 2020 1 次提交
  21. 13 11月, 2020 1 次提交
  22. 11 11月, 2020 1 次提交
  23. 05 11月, 2020 1 次提交
  24. 04 11月, 2020 2 次提交
    • L
      Add broadcast_shape api (#28257) · 8b2436a7
      Leo Chen 提交于
      * add broadcast_shape api
      
      * add ut
      
      * follow comments
      
      * add example code, test=dodument_fix
      
      * update example code, test=document_fix
      8b2436a7
    • enhance the op_version_registry, test=develop (#28347) · 21a63f6f
      石晓伟 提交于
      * enhance the op_version_registry, test=develop
      
      * add unittests, test=develop
      
      * enhance the op_version_registry, test=develop
      
      * fix bugs, test=develop
      
      * revert pybind_boost_headers.h, test=develop
      
      * fix a attribute bug, test=develop
      21a63f6f
  25. 03 11月, 2020 2 次提交
  26. 02 11月, 2020 1 次提交
  27. 30 10月, 2020 1 次提交
  28. 29 10月, 2020 2 次提交
  29. 28 10月, 2020 1 次提交
  30. 27 10月, 2020 2 次提交
    • Z
      add Fuse bn add act pass (#28196) · fdc06f21
      Zhang Ting 提交于
      * add fuse_bn_add_act pass
      fdc06f21
    • C
      Enrich the python error types of paddle & polish format (#28124) · 813b2ade
      Chen Weihang 提交于
      * add multiple exception type
      
      * define all exception & polish compile pystack
      
      * mapping paddle error to python exception
      
      * polish static mode error format
      
      * fix failed unittests
      
      * fix dytostatic test_error
      
      * fix check_nan_inf failed
      
      * add unittest for coverage
      
      * revert some code try to solve compile error
      
      * refactor enforce & error change
      
      * polish code & add unittest
      813b2ade
  31. 26 10月, 2020 1 次提交