1. 02 4月, 2021 1 次提交
    • T
      [Cherry-Pick] logclean & embedding doc (#32009) · 8140485a
      tangwei12 提交于
      * fix en doc for emb (#31980)
      
      * fix en doc for emb, test=document_fix;
      Change-Id: I4757e67caacd7189f068493ed45a7445f87ffb40
      
      * LOG CLEAN (#31819)
      
      * upgrade vlog
      
      * train from dataset fetch optimize
      8140485a
  2. 31 3月, 2021 2 次提交
  3. 02 3月, 2021 1 次提交
    • L
      [CP] align fleet param (#31220) · d15e73b0
      lilong12 提交于
      * update, test=develop (#30692)
      
      * align the default value of some configuration for fleet to that of single cards (#30740)
      
      * update, test=develop
      d15e73b0
  4. 01 3月, 2021 5 次提交
  5. 27 2月, 2021 1 次提交
    • A
      [Cherry-Pick] Split Macros and Add modeling unittest (#31266) · 52f7e773
      Aurelius84 提交于
      * [CustomOp] Add Modeling with Custom op unittest (#31218)
      
      * add unittest for static/dygraph/dy2stat
      
      * add PE unittet
      
      * remove usless code
      
      * add unittest in CMakeList.txt
      
      * [CustomOp] Split build op marco & polish details (#31229)
      
      * split build op marco & polish details
      
      * revert register api del
      
      * fix other unittest
      
      * [CustomOP]Support Incremental compilation and Add Version management (#31228)
      
      * Support Incremental compilation and Add Version management
      
      * replace hash with hashlib
      
      * fix test_op_num unittest
      
      * Revert "fix test_op_num unittest"
      
      This reverts commit 2f78de976e1d7ca60915b2310717b38a32ae204a.
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      52f7e773
  6. 26 2月, 2021 2 次提交
  7. 24 2月, 2021 1 次提交
  8. 23 2月, 2021 2 次提交
  9. 02 2月, 2021 1 次提交
    • A
      Conv bn fuse fix (#30830) · b4be9717
      alncat 提交于
      * fixed compilation error on gcc 4.8.x due to the usage of isfinite (#30733)
      
      * modified conv+bn fuse pass to fix wrong mask in mask rcnn (#30704)
      b4be9717
  10. 19 1月, 2021 3 次提交
  11. 18 1月, 2021 1 次提交
    • P
      Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in... · 27c2f1ea
      pangyoki 提交于
      Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) (#30496)
      
      * add view strategy on squeeze,unsqueeze,reshape,flatten
      
      * add squeeze unittest
      
      * add unittests
      
      * use View strategy as name rather than Reuse Allacation
      
      * fix view api doc
      
      * fix format
      
      * use core.ops when input of reshape2 is Tensor
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * add inplace strategy
      
      * add elementwise_add sub
      
      * let backward op not use inplace
      
      * grad op do not use inplace
      
      * fix memory increase error and add leaf error message
      
      * delete selected_rows
      
      * change op_function
      
      * little change
      
      * solve HandleViewBetweenInputAndOutput
      
      * add unittest and leaf error message
      
      * merge view error
      
      * optimize op_function_generator format and support sum inplace op
      
      * fix format of basic_engine
      
      * fix format for framework
      
      * little change of variable wrapper
      
      * add reshape, squeeze, unsqueeze, scatter api
      
      * add relu elu tanh softmax inplace api
      
      * fix test_squeeze_op unittest
      
      * fix test_relu_op unittest
      
      * fix comment problems
      
      * delete sample code of inplace api
      
      * add reference of grad_pending_nodes in basic_engine
      
      * fix unittest name
      
      * add inplace apis into wlist
      
      * fix error message
      
      * add PADDLE_ENFORCE for set grad op twice
      
      * fix head file error
      27c2f1ea
  12. 14 1月, 2021 3 次提交
  13. 13 1月, 2021 3 次提交
  14. 12 1月, 2021 6 次提交
  15. 11 1月, 2021 4 次提交
    • L
      [Cherry-Pick] Support vector<double> as type of op attribute and op set_value... · d839761e
      liym27 提交于
      [Cherry-Pick] Support vector<double> as type of op attribute and op set_value suppport vector<double> as value (#30126) (#30305)
      
      Cherry-Pick #30126
      1. Support vector<float64> as type of op attribute.
      2. op set_value suppports float64 numpy.array
      d839761e
    • L
      [cherry-pick] Async drop scope in executor (#29714) #30285 · 93ce7f69
      Leo Chen 提交于
      [cherry-pick] Async drop scope in executor (#29714)
      93ce7f69
    • Z
      [Cherry-Pick] Support pure fp16 training for AMP API. (#29544) (#30241) · d8dfef54
      Zhen Wang 提交于
      * Support pure fp16 training for AMP API. (#29544)
      
      * add cast ops before and after unsupported fp16 ops.
      
      * Keep partial net in FP32 pattern.
      
      * Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.
      
      * Add fp16 support for adam op.
      
      * add multi precision attr for adam.
      
      * Fix the bug of test_multi_precision_fp16_train UT.
      
      * Code format for CI.
      
      * Fix the redefine error about MPTypeTrait on windows.
      
      * fix bugs of the _create_accumulators func in Momentum.
      
      * fix bug when inserting post cast op.
      
      * Add the update_loss_scaling op in allow_set of UnusedVarCheck.
      
      * Update for ci coverage.
      
      * Add some doc for OptimizerWithMixedPrecision.
      
      * Fix the code style.
      
      * Imporve the doc of `amp_init`.
      
      * Change for fp16 testing if users have the infer program defined in separate way.
      
      * Remove tensor copy in the update_loss_scaling op. (#29426)
      
      * remove tensor copy in the update_loss_scaling op
      
      * not use thrust.
      
      * fix some cuda memory access error.
      d8dfef54
    • W
      [cherry-pick 2.0] optimize gradient merge (#30185) · e283dc6f
      WangXi 提交于
      * Optimization grad merge performance (#29784)
      
      * [fleet] combine amp and gradient merge, test=develop (#30086)
      
      * fix assign_op_xpu concat_op_xpu warining (#30120)
      Co-authored-by: Nliuyuhui <liuyuhui@baidu.com>
      e283dc6f
  16. 08 1月, 2021 1 次提交
  17. 07 1月, 2021 1 次提交
  18. 06 1月, 2021 1 次提交
    • L
      [Cherry-Pick 2.0][Dynamic Inplace] Support ShareInplaceVersionCounterWith for... · 743649b5
      liym27 提交于
      [Cherry-Pick 2.0][Dynamic Inplace] Support ShareInplaceVersionCounterWith for C++ Tensor (#29842) (#30105)
      
      Before this PR, SharePlaceHolderWith share Tensor between different C++ Variable, which meas sharing the data, shape, and inplace_version_counter_ of Tensor.
      But in some cases, Sharing data and inplace_version_counter_ but not sharing shape is needed. For example, inplace op reshape, can't share shape.
      
      This PR, discard SharePlaceHolderWith, and expose ShareInplaceVersionCounterWith for C++ Tensor.
      This reverts commit b10ecd9d.
      
      * Support ShareInplaceVersionCounterWith to share the same inplace version counter for VarBase
      743649b5
  19. 05 1月, 2021 1 次提交