1. 09 1月, 2021 1 次提交
    • P
      add View(reuse allocation) strategy on squeeze, unsqueeze, reshape, flatten op (#29913) · da16b33f
      pangyoki 提交于
      * add view strategy on squeeze,unsqueeze,reshape,flatten
      
      * add squeeze unittest
      
      * add unittests
      
      * use View strategy as name rather than Reuse Allacation
      
      * fix view api doc
      
      * fix format
      
      * use core.ops when input of reshape2 is Tensor
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * delete selected_rows
      
      * change op_function
      
      * little change
      
      * solve HandleViewBetweenInputAndOutput
      da16b33f
  2. 08 1月, 2021 3 次提交
  3. 07 1月, 2021 1 次提交
  4. 06 1月, 2021 2 次提交
  5. 05 1月, 2021 1 次提交
  6. 04 1月, 2021 1 次提交
  7. 27 12月, 2020 1 次提交
  8. 26 12月, 2020 1 次提交
  9. 24 12月, 2020 1 次提交
  10. 23 12月, 2020 1 次提交
  11. 22 12月, 2020 1 次提交
  12. 16 12月, 2020 2 次提交
  13. 15 12月, 2020 2 次提交
  14. 09 12月, 2020 2 次提交
  15. 05 12月, 2020 1 次提交
    • myq406450149's avatar
      update unbind norm add CUDAPlace api doc information (#29322) · 7c508d86
      myq406450149 提交于
      * enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop
      
      * fix format. test=develop
      
      * format fix. test=develop
      
      * add lod_rank_table. test=develop
      
      * fix format. test=develop
      
      * fix doc info. test=develop
      
      * fix np error
      
      * add unbind dygraph api. test=develop
      
      * fix unbind doc.test=develop
      7c508d86
  16. 04 12月, 2020 2 次提交
  17. 02 12月, 2020 1 次提交
    • Z
      Add pure fp16 training with master weights. (#27712) · be3777a5
      Zhen Wang 提交于
      * add the weight decay func for the momentum op
      
      * Add the multi_precision function in Momentum Optimizer.
      
      * Make sure that the initial value of master weights are same with the fp16 weights.
      
      * add static loss scaling.
      
      * add the rescale_grad function in the pure fp16 training.
      
      * use the original momentum updating method.
      
      * Polish some codes, such as variable names.
      
      * add docstring for apis.
      
      * update the var creation details of _create_master_weight.
      
      * not modify codes about imperative momentum updating.
      
      * Fix the error of test_dist_sparse_tensor_load_momentum UT.
      
      * add unit test for multi precision fp16 training.
      
      * add more unit tests for CI.
      
      * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
      
      * For CI Coverage Checking.
      be3777a5
  18. 01 12月, 2020 2 次提交
  19. 30 11月, 2020 1 次提交
    • L
      Check whether there is any inplace operation affecting gradient calculation. (#27901) · 865a4598
      liym27 提交于
      * Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.
      
      * Add a new attribute `_inplace_version` for VarBase.
      
      * Raise exception if an inplace operation can result in incorrect gradient computation.
      
      * Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.
      
      * For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.
      
      * Use original var_wrapper if the inplace_version is not changed.
      
      * Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.
      865a4598
  20. 27 11月, 2020 1 次提交
    • S
      Support dynamic graph distributed (#28997) · e2d01eb6
      ShenLiang 提交于
      * add reducer
      
      * refine envent for memorycopy
      
      * add concat&split for allreduce
      
      * apply concat & split for fuse tensor
      
      * fix nccl dep
      
      * fix the untest, compile problem and ddp initialize problem
      
      * fix untest for mac & add some comments & solve the repeated param in sublayers
      
      * fix untest for windows & fix document
      e2d01eb6
  21. 26 11月, 2020 1 次提交
  22. 25 11月, 2020 1 次提交
  23. 23 11月, 2020 1 次提交
  24. 20 11月, 2020 2 次提交
  25. 18 11月, 2020 1 次提交
  26. 13 11月, 2020 1 次提交
  27. 11 11月, 2020 1 次提交
  28. 05 11月, 2020 1 次提交
  29. 04 11月, 2020 2 次提交
    • L
      Add broadcast_shape api (#28257) · 8b2436a7
      Leo Chen 提交于
      * add broadcast_shape api
      
      * add ut
      
      * follow comments
      
      * add example code, test=dodument_fix
      
      * update example code, test=document_fix
      8b2436a7
    • enhance the op_version_registry, test=develop (#28347) · 21a63f6f
      石晓伟 提交于
      * enhance the op_version_registry, test=develop
      
      * add unittests, test=develop
      
      * enhance the op_version_registry, test=develop
      
      * fix bugs, test=develop
      
      * revert pybind_boost_headers.h, test=develop
      
      * fix a attribute bug, test=develop
      21a63f6f
  30. 03 11月, 2020 1 次提交
    • S
      TensorRT中ernie模型推理性能优化,支持变长输入 (#28367) · ea851796
      Shang Zhizhou 提交于
      * fp16 result ok
      
      * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
      
      * auto detect special slice op converter for ernie with trt oss
      
      * ernie oss only support fp16
      
      * fix special_slice_plugin serialize bug
      
      * matmul in tensorrt ok
      
      * ernie unittest ok
      
      * add matmul tensorrt unittest
      
      * remove demo code
      ea851796