1. 21 9月, 2022 1 次提交
  2. 02 7月, 2022 1 次提交
    • L
      unify cpu context, part2 (#44012) · 755438a7
      Leo Chen 提交于
      * fix init()
      
      * delete test_device_context
      
      * replace CPUDeviceContext with CPUContext
      
      * fix test_scalar
      
      * remove dot_op.cc
      
      * fix compile
      755438a7
  3. 26 6月, 2022 1 次提交
  4. 05 6月, 2022 1 次提交
  5. 16 4月, 2022 1 次提交
  6. 02 3月, 2022 1 次提交
    • F
      [Pten] Gru lstm migration (#39729) · e4dba69a
      Feiyu Chan 提交于
      * move sequence2batch
      
      * move lstm and gru
      
      * Add phi/kernels directory into exclusion to stop using hipcc to compile non .cu files in it.
      e4dba69a
  7. 20 2月, 2022 1 次提交
  8. 18 2月, 2022 1 次提交
  9. 12 6月, 2021 1 次提交
  10. 30 3月, 2021 1 次提交
  11. 04 3月, 2021 1 次提交
  12. 27 1月, 2021 1 次提交
    • J
      REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719) · f8da5536
      jakpiase 提交于
      * added external reorder to profiler
      
      * resolved conflict
      
      * added enable_static
      
      * initial version of lstm, not working yet
      
      * added lstm to operators.cmake
      
      * added vanilla lstm mkldnn op
      
      * added peephole weights integration
      
      * minor changes
      
      * added formatting
      
      * added fusion_lstm_mkldnn to static_whitelist
      
      * added formatting
      
      * removed comment
      
      * moved use_peepholes attribute inside is_cached block
      
      * reverted wrong changes
      
      * minor formatting change
      
      * minor changes
      
      * changed stream handling
      
      * minor change
      
      * added datatype to GetExpectedKernelType()
      
      * added reading stream from TLS
      f8da5536
  13. 26 1月, 2021 2 次提交
  14. 10 4月, 2020 1 次提交
  15. 28 11月, 2019 1 次提交
  16. 28 10月, 2019 1 次提交
  17. 11 9月, 2019 1 次提交
    • Y
      Implement the GPU kernel of fc operator (#19687) · a65c728e
      Yiqun Liu 提交于
      * Refine the codes related to fc op.
      
      * Add GPU implementation for fc functor.
      
      * Apply fc_fuse_pass in GPU inference.
      test=develop
      
      * Change the cmake for fc op.
      
      * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.
      
      * Add an attribute to set the activation type in fc_op.
      
      * Enhance the unittest of fc_op.
      test=develop
      
      * Remove the declaration of FCOpGrad back to the header file.
      test=develop
      
      * Set default value for newly added arguments in test_fc_op.
      test=develop
      a65c728e
  18. 14 8月, 2019 1 次提交
  19. 19 6月, 2019 1 次提交
    • fix spelling errors (#17941) · 802ea509
      翟飞跃 提交于
      * fix spelling errors; test=develop
      
      * Update API.spec
      
      update md5
      
      * Update API.spec
      
      * change the order of api;test=develop
      802ea509
  20. 08 3月, 2019 1 次提交
  21. 07 3月, 2019 1 次提交
  22. 20 12月, 2018 1 次提交
  23. 18 12月, 2018 1 次提交
  24. 17 12月, 2018 1 次提交
  25. 12 12月, 2018 1 次提交
  26. 21 11月, 2018 1 次提交
  27. 16 11月, 2018 1 次提交
    • W
      Refine operator cmake (#14413) · a2d9b344
      Wu Yi 提交于
      * wip simplify operator framework
      
      * wip
      
      * wip
      
      * done test=develop
      
      * clean test=develop
      
      * fix test=develop
      
      * fix deps test=develop
      
      * fix cpu build test=develop
      
      * fix tensorrt build test=develop
      
      * fix tests test=develop
      
      * fix test=develop
      
      * fix cpu build test=develop
      a2d9b344
  28. 17 10月, 2018 1 次提交
  29. 12 10月, 2018 1 次提交
  30. 11 10月, 2018 1 次提交
  31. 09 10月, 2018 1 次提交
  32. 30 9月, 2018 1 次提交
  33. 26 9月, 2018 1 次提交
  34. 14 9月, 2018 3 次提交
  35. 12 9月, 2018 1 次提交
  36. 11 9月, 2018 2 次提交