1. 18 1月, 2021 1 次提交
    • P
      Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in... · 27c2f1ea
      pangyoki 提交于
      Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) (#30496)
      
      * add view strategy on squeeze,unsqueeze,reshape,flatten
      
      * add squeeze unittest
      
      * add unittests
      
      * use View strategy as name rather than Reuse Allacation
      
      * fix view api doc
      
      * fix format
      
      * use core.ops when input of reshape2 is Tensor
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * add inplace strategy
      
      * add elementwise_add sub
      
      * let backward op not use inplace
      
      * grad op do not use inplace
      
      * fix memory increase error and add leaf error message
      
      * delete selected_rows
      
      * change op_function
      
      * little change
      
      * solve HandleViewBetweenInputAndOutput
      
      * add unittest and leaf error message
      
      * merge view error
      
      * optimize op_function_generator format and support sum inplace op
      
      * fix format of basic_engine
      
      * fix format for framework
      
      * little change of variable wrapper
      
      * add reshape, squeeze, unsqueeze, scatter api
      
      * add relu elu tanh softmax inplace api
      
      * fix test_squeeze_op unittest
      
      * fix test_relu_op unittest
      
      * fix comment problems
      
      * delete sample code of inplace api
      
      * add reference of grad_pending_nodes in basic_engine
      
      * fix unittest name
      
      * add inplace apis into wlist
      
      * fix error message
      
      * add PADDLE_ENFORCE for set grad op twice
      
      * fix head file error
      27c2f1ea
  2. 15 1月, 2021 1 次提交
    • P
      Cherry pick 30072 (#30499) · 590e718b
      pangyoki 提交于
      * Cherry-pick 30072, add dispenable input for core.ops.reshape2/expand/slice (#30072)
      
      * add dispenable input 'shape' for core.ops.reshape2
      
      * add dispenable inputs for core.ops.reshape2/expand/slice
      
      * add ut
      
      * save reshape update in pr 30180
      
      * save reshape update v2 in pr 30180
      Co-authored-by: NLeo Chen <chenqiuliang@baidu.com>
      590e718b
  3. 14 1月, 2021 1 次提交
    • C
      fix bug of celoss when using ignore_index and reduction (#30395) · c22ee575
      chajchaj 提交于
      * fix bug of celoss when using ignore_index and reduction (#30180)
      
      * fix bug of using ignore_index and reduction,test=develop
      
      * fix bug of celoss when using ignore_index and reduction, test=develop
      
      * improve performance when ignore_index=-100, test=develop
      
      * add test in test_cross_entropy_loss.py for coverage rate, test=develop
      
      * rm comment in test_cross_entropy_loss.py, test=develop
      
      * del  hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * del comment in python/paddle/nn/functional/loss.py, test=develop
      
      * del hard code and change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * fix bug of celoss when using ignore_index and reduction (#30180)
      
      * fix bug of using ignore_index and reduction,test=develop
      
      * fix bug of celoss when using ignore_index and reduction, test=develop
      
      * improve performance when ignore_index=-100, test=develop
      
      * add test in test_cross_entropy_loss.py for coverage rate, test=develop
      
      * rm comment in test_cross_entropy_loss.py, test=develop
      
      * del  hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * del comment in python/paddle/nn/functional/loss.py, test=develop
      
      * del hard code and change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      
      * change mask to a more simplified implementation, test=develop
      c22ee575
  4. 11 1月, 2021 1 次提交
  5. 09 1月, 2021 1 次提交
  6. 07 1月, 2021 3 次提交
    • L
      [cherry pick] Some optimizations of elementwise_add, gelu and dropout for AMP (#30152) · 07f68fad
      Leo Chen 提交于
      * Improve performance of elementwise_add grad op (#29187)
      
      * pass stop_gradient for cast op
      
      * improve performance of elementwise_add grad
      
      * use tensor copy async
      
      * dygraph branch
      
      * fix dygraph branch
      
      * add ut
      
      * make gelu fp16 computing more robust (#29484)
      
      * Add fast path for dropout when p == 0  (#29553)
      
      * add fast path for p == 0 in dropout
      
      * add ut
      07f68fad
    • F
      [Cherry-pick] Layer norm fp16 and Nvidia optimize (#29169 #29434 #29522 #29576) (#30110) · 44b81e63
      furnace 提交于
      * Layer norm fp16 (#29169)
      
      * add fp16 for layer_norm op
      
      * revert layernorm api
      
      * fix forward
      
      * fix forward
      
      * fix backward for layernorm with fp16
      
      * fix unit test for layernorm with fp16
      
      * fix with_mkldnn compile error for layernorm with fp16
      
      * 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U>
      
      * fix with_mkldnn compile error for layernorm with fp16
      
      * fix with_mkldnn compile error for layernorm with fp16
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      
      * fix layer_norm accuracy (#29434)
      
      * Layernorm opt (#29522)
      
      * layernorm fw opt
      
      * layernorm bw opt
      
      * fix typo, test=develop
      
      * remove const dim3 for windows CI compatibility
      
      * merge develop
      Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
      
      * Fix compile problem when cuda_arch < 6000 (#29576)
      
      * fix compile problem when cuda_arch < 6000
      
      * refine code
      
      * refine code
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
      44b81e63
    • C
      Cherry pick bn (#30136) · 157ff094
      ceci3 提交于
      * fix bn docs (#30096)
      
      * add attribute for batch_norm (#29950)
      
      * add attribute for batch_norm
      157ff094
  7. 06 1月, 2021 1 次提交
    • H
      [Cherry-pick]cherry-pick to Release/2.0 (#30076) · 1ad7fcbf
      huangxu96 提交于
      * add fp16 check into max and avg pool (#29479)
      
      * Add ReserveSpace in dygraph batch_norm. (#29221)
      
      * Add ReserveSpace in dygraph batch_norm.
      
      * Add unittest for reservespace
      
      * add float16 into adaptive_avg_pool2d check list. (#29547)
      1ad7fcbf
  8. 31 12月, 2020 1 次提交
  9. 30 12月, 2020 1 次提交
    • C
      [cherry-pick] Fix 2.0 bugs (#29992) · faf2bb39
      Chen Long 提交于
      * fix doc bugs test=document_fix
      
      * fix code bugs test=document_fix
      
      * fix code bugs test=document_fix
      
      * fix doc bugs test=document_fix
      
      * fix doc bugs test=document_fix
      
      * fix doc bugs test=document_fix
      faf2bb39
  10. 29 12月, 2020 1 次提交
    • X
      [cherry-pick] clean redundant API alias in 2.0 - part 1 #29928 (#29960) · c9c835b5
      XiaoguangHu 提交于
      * [cherry-pick] cherry-pick of PR#29928
      
      * delete paddle.metric.chunk_eval and paddle.metric.mean_iou
      
      * delete paddle.nn.clip and paddle.nn.clip_by_norm
      
      * delete paddle.nn.functional.activation.hard_sigmoid and paddle.nn.functional.activation.hard_swish
      
      * [cherry-pick] cherry-pick of PR#29928
      
      * fix extension import error
      c9c835b5
  11. 07 12月, 2020 3 次提交
  12. 05 12月, 2020 1 次提交
  13. 04 12月, 2020 1 次提交
  14. 03 12月, 2020 2 次提交
  15. 01 12月, 2020 1 次提交
  16. 30 11月, 2020 2 次提交
  17. 29 11月, 2020 1 次提交
  18. 27 11月, 2020 3 次提交
    • fix doc of erf,rank,mm,cross_entropy,pixel_shuffle,kron... (#29126) · 3c2a46bd
      徐铭远 提交于
      * fix doc example, test=develop, test=document_fix
      3c2a46bd
    • X
      Fix interpolate doc (#29104) · 9cc0e726
      xiaoting 提交于
      * fix interpolate example, test=develop;test=document_fix
      
      * fix format, test=develop, test=document_fix
      
      * update upsample doc, test=develop, test=document_fix
      9cc0e726
    • W
      Fix docs in 2.0 API (#29081) · 9b39af3f
      whs 提交于
      * 1. grid_sample
      1.1 fix has_print
      2. conv1d_transpose
      2.1 fix code_example error
      3. conv1d
      4. affine_grid
      4.1 has_print
      4.2 has_disable_static
      5. Conv1DTranspose
      5.1 fix code_example error
      5.2 has_disable_static
      6. Conv1d
      6.1 code_example
      6.2 has_disable_static
      9b39af3f
  19. 26 11月, 2020 3 次提交
  20. 25 11月, 2020 3 次提交
  21. 24 11月, 2020 4 次提交
  22. 23 11月, 2020 2 次提交
  23. 20 11月, 2020 2 次提交