1. 18 1月, 2021 3 次提交
    • Z
      [cherry-pick] improve perfomance of cast and tril op (#30498) · de003cee
      Zhang Ting 提交于
      * add fp16 support for tril_triu op (#30186)
      
      * add VecCastCUDAKernel (#30296)
      Co-authored-by: Nfurnace <34057289+windstamp@users.noreply.github.com>
      de003cee
    • 1
      test=develop, fix fleet.metric (#30438) (#30473) · 2c3799d1
      123malin 提交于
      * test=develop, fix fleet.metrics(mse, rmse, mae)
      2c3799d1
    • P
      Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in... · 27c2f1ea
      pangyoki 提交于
      Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) (#30496)
      
      * add view strategy on squeeze,unsqueeze,reshape,flatten
      
      * add squeeze unittest
      
      * add unittests
      
      * use View strategy as name rather than Reuse Allacation
      
      * fix view api doc
      
      * fix format
      
      * use core.ops when input of reshape2 is Tensor
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * add inplace strategy
      
      * add elementwise_add sub
      
      * let backward op not use inplace
      
      * grad op do not use inplace
      
      * fix memory increase error and add leaf error message
      
      * delete selected_rows
      
      * change op_function
      
      * little change
      
      * solve HandleViewBetweenInputAndOutput
      
      * add unittest and leaf error message
      
      * merge view error
      
      * optimize op_function_generator format and support sum inplace op
      
      * fix format of basic_engine
      
      * fix format for framework
      
      * little change of variable wrapper
      
      * add reshape, squeeze, unsqueeze, scatter api
      
      * add relu elu tanh softmax inplace api
      
      * fix test_squeeze_op unittest
      
      * fix test_relu_op unittest
      
      * fix comment problems
      
      * delete sample code of inplace api
      
      * add reference of grad_pending_nodes in basic_engine
      
      * fix unittest name
      
      * add inplace apis into wlist
      
      * fix error message
      
      * add PADDLE_ENFORCE for set grad op twice
      
      * fix head file error
      27c2f1ea
  2. 15 1月, 2021 6 次提交
  3. 14 1月, 2021 6 次提交
    • C
      skip quantizing ops in cpu inference (#30342) (#30405) · 2f16e0c6
      cc 提交于
      2f16e0c6
    • C
      [Cherry-pick] Fix prune input bug of jit.save #30425 · 2cdc36f4
      Chen Weihang 提交于
      [Cherry-pick] Fix prune input bug of jit.save
      
      cheryy-pick of #30384
      2cdc36f4
    • Q
      optimize memcpy perf for kunlun (#30291) (#30382) · 9de42be2
      QingshuChen 提交于
      * optimize memcpy perf for kunlun (#30291)
      
      * optimize memcpy perf for kunlun
      
      * remove useless unitest for kunlun mean
      
      * minor
      
      * fix bug that cann't find mkldnn(kunlun) (#30394)
      9de42be2
    • L
      [cherrypick 2.0] add double grad for conv_transpose and depthwise_conv (#30429) · 1552343a
      LielinJiang 提交于
      * Add double grad for conv_transpose (#29706)
      
      * add double grad for conv_transpose
      
      * register cudnn conv double grad for depthwise conv (#29807)
      1552343a
    • B
      cherry-pick 30354 (#30407) · 5d30d072
      Bai Yifan 提交于
      5d30d072
    • C
      fix bug of celoss when using ignore_index and reduction (#30395) · c22ee575
      chajchaj 提交于
      * fix bug of celoss when using ignore_index and reduction (#30180)
      
      * fix bug of using ignore_index and reduction,test=develop
      
      * fix bug of celoss when using ignore_index and reduction, test=develop
      
      * improve performance when ignore_index=-100, test=develop
      
      * add test in test_cross_entropy_loss.py for coverage rate, test=develop
      
      * rm comment in test_cross_entropy_loss.py, test=develop
      
      * del  hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * del comment in python/paddle/nn/functional/loss.py, test=develop
      
      * del hard code and change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * fix bug of celoss when using ignore_index and reduction (#30180)
      
      * fix bug of using ignore_index and reduction,test=develop
      
      * fix bug of celoss when using ignore_index and reduction, test=develop
      
      * improve performance when ignore_index=-100, test=develop
      
      * add test in test_cross_entropy_loss.py for coverage rate, test=develop
      
      * rm comment in test_cross_entropy_loss.py, test=develop
      
      * del  hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * del comment in python/paddle/nn/functional/loss.py, test=develop
      
      * del hard code and change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      c22ee575
  4. 13 1月, 2021 4 次提交
  5. 12 1月, 2021 7 次提交
  6. 11 1月, 2021 12 次提交
  7. 10 1月, 2021 1 次提交
  8. 08 1月, 2021 1 次提交
    • L
      [cherry-pick] [Dy2Stat] Don't convert to paddle.shape if var_x.shape is not... · 2ba9bdd7
      liym27 提交于
      [cherry-pick] [Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive #29965 (#30235)
      
      * [Cherry-Pick 2.0] [Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive (#29965)
      
      1. When x is Variable, call nn.shape(x) only in following cases:
       1)The shape of x is used in control flow condition.
       2)The dim to be used is negetive
      2. When x is Variable, but x.shape or x.shape[idx] doesn't contain negetive value, don't convert to paddle.shape()
      
      * [Cherry-Pick 2.0] [Dy2Stat] Use Paddle2.0 api paddle.tensor.array_* (#30156)
      2ba9bdd7