1. 11 10月, 2022 2 次提交
    • S
      [cherry-pick] [PHI] relu6_grad kernel (#46501) (#46862) · 2bcbf8b0
      Sławomir Siwek 提交于
      * [PHI] Migrate gelu kernels (#45596)
      
      * gaussian random
      
      * mkldnn to onednn renaming
      
      * fix merge conflicts
      
      * remove fluid code
      
      * onednn renaming
      
      * gelu fwd
      
      * sort activations
      
      * gelu gradient
      
      * remove unused macros
      
      * merge conflicts
      
      * fix merge conflicts
      
      * remove extra contraint from gelu op
      
      * [PHI] relu6_grad kernel (#46501)
      
      * Relu6
      
      * remove fluid handler
      
      * add individual kernel signature
      
      * coding style
      
      * replace bounded_relu with clip
      
      * whitespace
      
      * code style
      2bcbf8b0
    • Y
      [BugFix]Fix concat bugs when call onednn kernel (#46518) (#46845) · 6a6c7493
      YuanRisheng 提交于
      * fix concat bug
      
      * fix ci bugs
      
      * fix ci bugs
      6a6c7493
  2. 20 9月, 2022 1 次提交
  3. 19 9月, 2022 1 次提交
  4. 17 9月, 2022 2 次提交
  5. 14 9月, 2022 1 次提交
  6. 07 9月, 2022 1 次提交
  7. 06 9月, 2022 1 次提交
  8. 01 9月, 2022 2 次提交
  9. 25 8月, 2022 1 次提交
  10. 24 8月, 2022 1 次提交
  11. 19 8月, 2022 1 次提交
  12. 15 8月, 2022 2 次提交
  13. 10 8月, 2022 1 次提交
  14. 05 8月, 2022 1 次提交
    • Y
      [MKLDNN]Move mkldnn activation kernel to phi (#44365) · 2dfa88d2
      YuanRisheng 提交于
      * move mkldnn activation kernel
      
      * fix compile bugs
      
      * fix compile bugs
      
      * deal with conflict
      
      * fix compile bugs
      
      * fix windows compile bugs
      
      * mkldnn unittest fix
      
      * change mutable to alloc
      
      * fix unittest bugs
      
      * modify code according comment
      2dfa88d2
  15. 04 8月, 2022 2 次提交
  16. 03 8月, 2022 1 次提交
  17. 02 8月, 2022 1 次提交
  18. 01 8月, 2022 4 次提交
  19. 29 7月, 2022 1 次提交
  20. 28 7月, 2022 1 次提交
    • H
      [XPU] add top_k op (#44656) · acf07c74
      houj04 提交于
      * [XPU] add top_k op. test=kunlun
      
      * [XPU] add top_k op. test=kunlun
      
      * use PADDLE_ENFORCE_XDNN_NOT_NULL to check pointer. test=kunlun
      acf07c74
  21. 27 7月, 2022 3 次提交
  22. 26 7月, 2022 1 次提交
    • Z
      add horizontal federation learning ps feature (#44327) · 4bc22b69
      ziyoujiyi 提交于
      * back fl
      
      * delete ssl cert
      
      * .
      
      * make warning
      
      * .
      
      * unittest paral degree
      
      * solve unittest
      
      * heter & multi cloud commm ready
      
      * .
      
      * .
      
      * fl-ps v1.0
      
      * .
      
      * support N + N mode
      
      * .
      
      * .
      
      * .
      
      * .
      
      * delete print
      
      * .
      
      * .
      
      * .
      
      * .
      
      * fix bug
      
      * .
      
      * .
      
      * fl-ps with coordinator ready
      
      * merge dev
      
      * update message parse only
      
      * update fl client scheduler
      
      * fix bug
      
      * update multithreads sync
      
      * fix ci errors
      
      * update role_maker.py
      
      * update role_maker.py
      
      * fix ci error: windows py import error
      
      * fix ci error: windows py import error
      
      * fix windows ci pylib import error
      
      * add dump fields & params
      
      * try to fix windows import fleet error
      
      * fix ps FLAGS error
      4bc22b69
  23. 22 7月, 2022 2 次提交
  24. 20 7月, 2022 1 次提交
  25. 18 7月, 2022 1 次提交
  26. 15 7月, 2022 2 次提交
  27. 14 7月, 2022 2 次提交
    • L
      refine allocation cmake (#44241) · dc5a0420
      Leo Chen 提交于
      * build into one static library
      
      * move memory/detail to memory/allocation
      
      * fix bug
      
      * fix profiler
      
      * fix framework_proto
      
      * fix deps
      
      * fix inference compilation
      
      * fix rocm compile
      
      * follow comments
      
      * fix buddy_allocator_test
      dc5a0420
    • Y
      [Phi]Improve the mechanism for mkldnn kernel in PHI (#43941) · e9b4d0be
      YuanRisheng 提交于
      * adapt mkldnn kernel in PHI
      
      * fix ci compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * delete comment
      
      * fix compile bugs in windows-inference
      
      * delete code for converage
      
      * modify code by review
      
      * modify code by review
      
      * add todo
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix unittest bugsx
      e9b4d0be