1. 09 1月, 2021 1 次提交
  2. 08 1月, 2021 1 次提交
  3. 07 1月, 2021 4 次提交
    • L
      [cherry pick] Some optimizations of elementwise_add, gelu and dropout for AMP (#30152) · 07f68fad
      Leo Chen 提交于
      * Improve performance of elementwise_add grad op (#29187)
      
      * pass stop_gradient for cast op
      
      * improve performance of elementwise_add grad
      
      * use tensor copy async
      
      * dygraph branch
      
      * fix dygraph branch
      
      * add ut
      
      * make gelu fp16 computing more robust (#29484)
      
      * Add fast path for dropout when p == 0  (#29553)
      
      * add fast path for p == 0 in dropout
      
      * add ut
      07f68fad
    • F
      [Cherry-pick] Layer norm fp16 and Nvidia optimize (#29169 #29434 #29522 #29576) (#30110) · 44b81e63
      furnace 提交于
      * Layer norm fp16 (#29169)
      
      * add fp16 for layer_norm op
      
      * revert layernorm api
      
      * fix forward
      
      * fix forward
      
      * fix backward for layernorm with fp16
      
      * fix unit test for layernorm with fp16
      
      * fix with_mkldnn compile error for layernorm with fp16
      
      * 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U>
      
      * fix with_mkldnn compile error for layernorm with fp16
      
      * fix with_mkldnn compile error for layernorm with fp16
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      
      * fix layer_norm accuracy (#29434)
      
      * Layernorm opt (#29522)
      
      * layernorm fw opt
      
      * layernorm bw opt
      
      * fix typo, test=develop
      
      * remove const dim3 for windows CI compatibility
      
      * merge develop
      Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
      
      * Fix compile problem when cuda_arch < 6000 (#29576)
      
      * fix compile problem when cuda_arch < 6000
      
      * refine code
      
      * refine code
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
      44b81e63
    • T
      pre padding in dygraph (#30179) · a2b0357d
      tangwei12 提交于
      Change-Id: Ia5279b0cbb6a5b3970aff66e9510e0d85efa70ce
      a2b0357d
    • C
      Cherry pick bn (#30136) · 157ff094
      ceci3 提交于
      * fix bn docs (#30096)
      
      * add attribute for batch_norm (#29950)
      
      * add attribute for batch_norm
      157ff094
  4. 06 1月, 2021 1 次提交
    • H
      [Cherry-pick]cherry-pick to Release/2.0 (#30076) · 1ad7fcbf
      huangxu96 提交于
      * add fp16 check into max and avg pool (#29479)
      
      * Add ReserveSpace in dygraph batch_norm. (#29221)
      
      * Add ReserveSpace in dygraph batch_norm.
      
      * Add unittest for reservespace
      
      * add float16 into adaptive_avg_pool2d check list. (#29547)
      1ad7fcbf
  5. 31 12月, 2020 2 次提交
  6. 30 12月, 2020 1 次提交
    • C
      [cherry-pick] Fix 2.0 bugs (#29992) · faf2bb39
      Chen Long 提交于
      * fix doc bugs test=document_fix
      
      * fix code bugs test=document_fix
      
      * fix code bugs test=document_fix
      
      * fix doc bugs test=document_fix
      
      * fix doc bugs test=document_fix
      
      * fix doc bugs test=document_fix
      faf2bb39
  7. 29 12月, 2020 2 次提交
  8. 16 12月, 2020 1 次提交
  9. 07 12月, 2020 3 次提交
  10. 05 12月, 2020 1 次提交
  11. 04 12月, 2020 1 次提交
  12. 03 12月, 2020 3 次提交
  13. 01 12月, 2020 1 次提交
  14. 30 11月, 2020 3 次提交
  15. 29 11月, 2020 1 次提交
  16. 27 11月, 2020 3 次提交
    • fix doc of erf,rank,mm,cross_entropy,pixel_shuffle,kron... (#29126) · 3c2a46bd
      徐铭远 提交于
      * fix doc example, test=develop, test=document_fix
      3c2a46bd
    • X
      Fix interpolate doc (#29104) · 9cc0e726
      xiaoting 提交于
      * fix interpolate example, test=develop;test=document_fix
      
      * fix format, test=develop, test=document_fix
      
      * update upsample doc, test=develop, test=document_fix
      9cc0e726
    • W
      Fix docs in 2.0 API (#29081) · 9b39af3f
      whs 提交于
      * 1. grid_sample
      1.1 fix has_print
      2. conv1d_transpose
      2.1 fix code_example error
      3. conv1d
      4. affine_grid
      4.1 has_print
      4.2 has_disable_static
      5. Conv1DTranspose
      5.1 fix code_example error
      5.2 has_disable_static
      6. Conv1d
      6.1 code_example
      6.2 has_disable_static
      9b39af3f
  17. 26 11月, 2020 5 次提交
  18. 25 11月, 2020 4 次提交
  19. 24 11月, 2020 2 次提交