1. 31 3月, 2021 5 次提交
    • T
      delete cuda9 code (#31883) · ea738dda
      tianshuo78520a 提交于
      ea738dda
    • W
      Update eigen version to f612df27 (#31832) · 495e7f9c
      wuhuanzhou 提交于
      * update eigen version to f612df27, test=develop
      
      * fix compilation error, test=develop
      
      * remove patch command in eigen, test=develop
      
      * fix compilation error caused by call Eigen function with float16 and bfloat16, test=develop
      
      * fix unittest error, test=develop
      
      * fix unittest error caused by precision, test=develop
      
      * remove patch files used by old version eigen, test=develop
      495e7f9c
    • W
      update compilation with C++14 (#31815) · 587d99ae
      wuhuanzhou 提交于
      * update compilation with C++14, test=develop
      
      * fix compilation error in eigen, test=develop
      587d99ae
    • T
      fix some bug in transformer training in xpu (#31918) · 52b05bac
      taixiurong 提交于
      52b05bac
    • F
      [ROCM] Add ROCm support for warpctc op (#31817) · ef8323d4
      furnace 提交于
      * bugfix for warpctc
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix WARPCTC_WITH_HIP invalid
      
      * Add logs to find out why can not dlopen libwarpctc.so
      
      * fix warpctc commit id
      
      * fix unit test test_warpctc_op
      
      * Optime failed log for dlopen
      
      * Optime failed log for dlopen
      
      * Delete extra changes
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * Add is_compiled_with_rocm for test_warpctc_op
      
      * fix warpctc commit id
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * fix code style problems
      ef8323d4
  2. 30 3月, 2021 1 次提交
  3. 23 3月, 2021 1 次提交
  4. 19 3月, 2021 1 次提交
  5. 18 3月, 2021 1 次提交
  6. 17 3月, 2021 1 次提交
  7. 16 3月, 2021 1 次提交
    • W
      Optimize compilation with Ninja (#31449) · 41e9ecfd
      wuhuanzhou 提交于
      * Optimize compilation with Ninja, notest, test=windows_ci, test=windows_op
      
      * no cache on windows ci, notest, test=windows_ci, test=windows_op
      
      * delete /Zc:inline compiled in NVCC, notest, test=windows_ci, test=windows_op
      
      * fix test_warpctc_op, notest, test=windows_ci
      
      * remove test code, test=develop
      41e9ecfd
  8. 09 3月, 2021 1 次提交
  9. 08 3月, 2021 1 次提交
  10. 04 3月, 2021 2 次提交
  11. 03 3月, 2021 1 次提交
  12. 02 3月, 2021 2 次提交
    • Q
      [ROCM] update fluid operators for rocm (part5), test=develop (#31258) · 65bcaeb0
      Qi Li 提交于
      * [ROCM] update fluid operators for rocm (part5), test=develop
      
      * address review comments, test=develop
      
      * fix typo, test=develop
      65bcaeb0
    • G
      lamb_op_xpu;test=kunlun (#31012) · d79fdc3d
      Gradie 提交于
      * lamb_op_xpu;test=kunlun
      
      * modify lamb_op_xpu.cc;test=kunlun
      
      * delete atol lamb_op_xpu; test=kunlun
      
      * update xpu.cmake;test=kunlun
      
      * test_error 1e-5,lamb_op_xpu;test=kunlun
      
      * error1e-5,lamb_op_xpu,test=kunlun
      
      * delete atol lamb_xpu;test=kunlun
      
      * modify atol,lamb_op_xpy;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu, XPUOptest;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu,modify xpu_cmake; test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu,modify xpucmake;test=kunlun
      d79fdc3d
  13. 01 3月, 2021 1 次提交
  14. 27 2月, 2021 1 次提交
  15. 26 2月, 2021 1 次提交
  16. 25 2月, 2021 1 次提交
  17. 23 2月, 2021 2 次提交
  18. 20 2月, 2021 2 次提交
  19. 19 2月, 2021 1 次提交
  20. 09 2月, 2021 1 次提交
  21. 04 2月, 2021 1 次提交
  22. 03 2月, 2021 2 次提交
  23. 02 2月, 2021 1 次提交
  24. 27 1月, 2021 1 次提交
    • J
      REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719) · f8da5536
      jakpiase 提交于
      * added external reorder to profiler
      
      * resolved conflict
      
      * added enable_static
      
      * initial version of lstm, not working yet
      
      * added lstm to operators.cmake
      
      * added vanilla lstm mkldnn op
      
      * added peephole weights integration
      
      * minor changes
      
      * added formatting
      
      * added fusion_lstm_mkldnn to static_whitelist
      
      * added formatting
      
      * removed comment
      
      * moved use_peepholes attribute inside is_cached block
      
      * reverted wrong changes
      
      * minor formatting change
      
      * minor changes
      
      * changed stream handling
      
      * minor change
      
      * added datatype to GetExpectedKernelType()
      
      * added reading stream from TLS
      f8da5536
  25. 26 1月, 2021 2 次提交
  26. 25 1月, 2021 1 次提交
  27. 22 1月, 2021 1 次提交
  28. 21 1月, 2021 2 次提交
  29. 19 1月, 2021 1 次提交