1. 19 1月, 2021 6 次提交
  2. 18 1月, 2021 6 次提交
    • Z
      [cherry-pick] avoid calling cast twice #30528 · 2967624b
      Zhang Ting 提交于
       cherry-pick #30527 
      2967624b
    • G
      [cherry-pick]Modify the calculation logic of LambOptimizer (#29313) (#30510) · b3fa899b
      guofei 提交于
      * Modify the calculation logic of LambOptimizer (#29313)
      
      * Modify the calculation logic of LambOptimizer
      
      * Modify the calculation logic of LambOptimizer
      
      * Modify the calculation logic of LambOptimizer
      b3fa899b
    • C
      [cherry-pick] add pad and concat double grad #29549 (#30432) · 5e4d54a1
      ceci3 提交于
      * add pad and concat double grad
      
      * resolve conflict
      5e4d54a1
    • Z
      [cherry-pick] improve perfomance of cast and tril op (#30498) · de003cee
      Zhang Ting 提交于
      * add fp16 support for tril_triu op (#30186)
      
      * add VecCastCUDAKernel (#30296)
      Co-authored-by: Nfurnace <34057289+windstamp@users.noreply.github.com>
      de003cee
    • 1
      test=develop, fix fleet.metric (#30438) (#30473) · 2c3799d1
      123malin 提交于
      * test=develop, fix fleet.metrics(mse, rmse, mae)
      2c3799d1
    • P
      Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in... · 27c2f1ea
      pangyoki 提交于
      Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) (#30496)
      
      * add view strategy on squeeze,unsqueeze,reshape,flatten
      
      * add squeeze unittest
      
      * add unittests
      
      * use View strategy as name rather than Reuse Allacation
      
      * fix view api doc
      
      * fix format
      
      * use core.ops when input of reshape2 is Tensor
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * add inplace strategy
      
      * add elementwise_add sub
      
      * let backward op not use inplace
      
      * grad op do not use inplace
      
      * fix memory increase error and add leaf error message
      
      * delete selected_rows
      
      * change op_function
      
      * little change
      
      * solve HandleViewBetweenInputAndOutput
      
      * add unittest and leaf error message
      
      * merge view error
      
      * optimize op_function_generator format and support sum inplace op
      
      * fix format of basic_engine
      
      * fix format for framework
      
      * little change of variable wrapper
      
      * add reshape, squeeze, unsqueeze, scatter api
      
      * add relu elu tanh softmax inplace api
      
      * fix test_squeeze_op unittest
      
      * fix test_relu_op unittest
      
      * fix comment problems
      
      * delete sample code of inplace api
      
      * add reference of grad_pending_nodes in basic_engine
      
      * fix unittest name
      
      * add inplace apis into wlist
      
      * fix error message
      
      * add PADDLE_ENFORCE for set grad op twice
      
      * fix head file error
      27c2f1ea
  3. 15 1月, 2021 6 次提交
  4. 14 1月, 2021 9 次提交
    • C
      skip quantizing ops in cpu inference (#30342) (#30405) · 2f16e0c6
      cc 提交于
      2f16e0c6
    • W
    • C
      [Cherry-pick] Fix prune input bug of jit.save #30425 · 2cdc36f4
      Chen Weihang 提交于
      [Cherry-pick] Fix prune input bug of jit.save
      
      cheryy-pick of #30384
      2cdc36f4
    • Q
      optimize memcpy perf for kunlun (#30291) (#30382) · 9de42be2
      QingshuChen 提交于
      * optimize memcpy perf for kunlun (#30291)
      
      * optimize memcpy perf for kunlun
      
      * remove useless unitest for kunlun mean
      
      * minor
      
      * fix bug that cann't find mkldnn(kunlun) (#30394)
      9de42be2
    • L
      [cherrypick 2.0] add double grad for conv_transpose and depthwise_conv (#30429) · 1552343a
      LielinJiang 提交于
      * Add double grad for conv_transpose (#29706)
      
      * add double grad for conv_transpose
      
      * register cudnn conv double grad for depthwise conv (#29807)
      1552343a
    • B
      cherry-pick 30354 (#30407) · 5d30d072
      Bai Yifan 提交于
      5d30d072
    • C
      fix bug of celoss when using ignore_index and reduction (#30395) · c22ee575
      chajchaj 提交于
      * fix bug of celoss when using ignore_index and reduction (#30180)
      
      * fix bug of using ignore_index and reduction,test=develop
      
      * fix bug of celoss when using ignore_index and reduction, test=develop
      
      * improve performance when ignore_index=-100, test=develop
      
      * add test in test_cross_entropy_loss.py for coverage rate, test=develop
      
      * rm comment in test_cross_entropy_loss.py, test=develop
      
      * del  hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * del comment in python/paddle/nn/functional/loss.py, test=develop
      
      * del hard code and change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * fix bug of celoss when using ignore_index and reduction (#30180)
      
      * fix bug of using ignore_index and reduction,test=develop
      
      * fix bug of celoss when using ignore_index and reduction, test=develop
      
      * improve performance when ignore_index=-100, test=develop
      
      * add test in test_cross_entropy_loss.py for coverage rate, test=develop
      
      * rm comment in test_cross_entropy_loss.py, test=develop
      
      * del  hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * del comment in python/paddle/nn/functional/loss.py, test=develop
      
      * del hard code and change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      c22ee575
    • C
      fix (#30399) · e1bad4d7
      Chengmo 提交于
      Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
      e1bad4d7
    • W
      fix compile error on ARM (#30390) · 14b60947
      Wilber 提交于
      14b60947
  5. 13 1月, 2021 7 次提交
  6. 12 1月, 2021 6 次提交