1. 30 6月, 2021 1 次提交
    • J
      Added matmul_v2 BF16/FP32 FWD kernel (#33750) · 24783c84
      jakpiase 提交于
      * added matmul_v2 bf16/fp32 FWD kernel
      
      added matmul_v2 bf16/fp32 FWD kernel
      
      * added formatting
      
      * removed some tests due to timeout in CI
      
      * refactored tests
      
      * merged tests classes into one file
      
      * minor change
      
      * removed test guard for CUDA
      
      * remove skipIf
      
      * changes after review
      
      * formated one file
      
      * minor change
      
      * added skipping UT in CUDA place
      24783c84
  2. 24 6月, 2021 1 次提交
  3. 23 6月, 2021 1 次提交
    • J
      Added split op bf16/fp32 oneDNN kernel (#33584) · 68106509
      jakpiase 提交于
      * base changes for split op
      
      * 90% of split functionality added
      
      * full fp32 functionality
      
      * added bf16 test
      
      * added submemory caching
      
      * added bf test to static mode whitelist
      
      * minor change
      
      * enabled split op for inference
      
      * minor fix
      
      * minor fix
      68106509
  4. 17 6月, 2021 2 次提交
  5. 07 6月, 2021 1 次提交
  6. 26 5月, 2021 2 次提交
  7. 25 5月, 2021 1 次提交
  8. 29 4月, 2021 1 次提交
  9. 21 4月, 2021 1 次提交
  10. 14 4月, 2021 2 次提交
  11. 30 3月, 2021 1 次提交
  12. 22 3月, 2021 1 次提交
  13. 19 3月, 2021 1 次提交
  14. 04 3月, 2021 1 次提交
  15. 02 3月, 2021 1 次提交
    • G
      lamb_op_xpu;test=kunlun (#31012) · d79fdc3d
      Gradie 提交于
      * lamb_op_xpu;test=kunlun
      
      * modify lamb_op_xpu.cc;test=kunlun
      
      * delete atol lamb_op_xpu; test=kunlun
      
      * update xpu.cmake;test=kunlun
      
      * test_error 1e-5,lamb_op_xpu;test=kunlun
      
      * error1e-5,lamb_op_xpu,test=kunlun
      
      * delete atol lamb_xpu;test=kunlun
      
      * modify atol,lamb_op_xpy;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu, XPUOptest;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu,modify xpu_cmake; test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu,modify xpucmake;test=kunlun
      d79fdc3d
  16. 18 2月, 2021 1 次提交
    • J
      Add Conv Transpose BF16 (#30877) · caf9d398
      joanna.wozna.intel 提交于
      * Add conv transpose BF16
      
      * Share function GetWeightsTz
      
      * Adjust to review and fix op compatibility
      
      * Add bias to unique handler name
      
      * Remove errors related to paddle enforce
      
      * Add conv2d_transpose to bf16 list and kernel refator
      caf9d398
  17. 03 2月, 2021 1 次提交
  18. 27 1月, 2021 1 次提交
    • J
      REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719) · f8da5536
      jakpiase 提交于
      * added external reorder to profiler
      
      * resolved conflict
      
      * added enable_static
      
      * initial version of lstm, not working yet
      
      * added lstm to operators.cmake
      
      * added vanilla lstm mkldnn op
      
      * added peephole weights integration
      
      * minor changes
      
      * added formatting
      
      * added fusion_lstm_mkldnn to static_whitelist
      
      * added formatting
      
      * removed comment
      
      * moved use_peepholes attribute inside is_cached block
      
      * reverted wrong changes
      
      * minor formatting change
      
      * minor changes
      
      * changed stream handling
      
      * minor change
      
      * added datatype to GetExpectedKernelType()
      
      * added reading stream from TLS
      f8da5536
  19. 26 1月, 2021 2 次提交
  20. 31 12月, 2020 1 次提交
  21. 21 12月, 2020 1 次提交
  22. 09 12月, 2020 1 次提交
    • W
      remove addcmul (#28937) · dc8bb76c
      Wei Shengyu 提交于
      * remove addcmul
      
      * remove unittest and other related code of addcmul
      
      * fix bug
      
      * fix merge conflict
      dc8bb76c
  23. 26 11月, 2020 1 次提交
  24. 25 11月, 2020 1 次提交
  25. 24 11月, 2020 1 次提交
  26. 20 11月, 2020 1 次提交
  27. 19 11月, 2020 1 次提交
  28. 17 11月, 2020 1 次提交
  29. 16 11月, 2020 1 次提交
  30. 06 11月, 2020 1 次提交
  31. 05 11月, 2020 1 次提交
  32. 02 11月, 2020 1 次提交
  33. 22 10月, 2020 1 次提交
  34. 21 10月, 2020 1 次提交
    • P
      add static_mode_white_list (#28112) · 2d45d9a0
      pangyoki 提交于
      * add static_mode_white_list
      
      * add Mac CI static list
      
      * add Win CI white_list
      
      * add Coverage and Py3 CI white_list, add test_unittest
      2d45d9a0