1. 01 8月, 2023 1 次提交
    • H
      [NewIR]New ir support print op (#55648) · 75c29ac1
      hong 提交于
      * new ir support print op
      
      * fix gpu bug
      
      * fix bug
      
      * update
      
      * remove layout to string
      
      * remove usless header
      
      * polish code
      
      * fix bug
      
      * posolis code
      75c29ac1
  2. 31 7月, 2023 6 次提交
  3. 28 7月, 2023 1 次提交
  4. 27 7月, 2023 1 次提交
  5. 26 7月, 2023 2 次提交
  6. 25 7月, 2023 3 次提交
  7. 20 7月, 2023 2 次提交
  8. 19 7月, 2023 1 次提交
    • Z
      delete relu6_raw (#55383) · 56d46ccc
      zhangyuqin1998 提交于
      * delete relu6_raw
      
      * fix codestyle
      
      * Update test_mkldnn_matmul_activation_fuse_pass.py
      
      * fix
      
      * Update backward.yaml
      
      * Update ops.yaml
      
      * Update backward.yaml
      56d46ccc
  9. 14 7月, 2023 1 次提交
  10. 13 7月, 2023 3 次提交
  11. 12 7月, 2023 1 次提交
  12. 11 7月, 2023 2 次提交
    • MarDino's avatar
      Integrate rmsnorm kernel (#54998) · 97d3d6ee
      MarDino 提交于
      * add rmsnorm kernel
      * add static graph test
      * fix round type
      * use alignas to avoid msvc compile error
      * remove redundant headerfile to avoid rocm compile error
      * fix rocm compile not found cub
      * Add document
      97d3d6ee
    • FormlessUnit's avatar
      Linear compress (#55128) · f4290a92
      FormlessUnit 提交于
      * rename weight_only/llm.int8
      f4290a92
  13. 07 7月, 2023 1 次提交
  14. 06 7月, 2023 1 次提交
  15. 05 7月, 2023 1 次提交
  16. 03 7月, 2023 4 次提交
  17. 29 6月, 2023 1 次提交
    • N
      Add fused_rope forward op (#54351) · a215c46a
      niuliling123 提交于
      * style
      
      * more
      
      * update ctest
      
      * Update legacy_backward.yaml
      
      * Update legacy_ops.yaml
      
      * Update legacy_ops.yaml
      
      * update
      
      * update
      
      * update for move
      a215c46a
  18. 28 6月, 2023 3 次提交
  19. 27 6月, 2023 2 次提交
  20. 26 6月, 2023 1 次提交
    • S
      remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85
      Sonder 提交于
      * remove ops from OpsWithFluidKernelNeedMoveToPhi set
      
      * open static build flag
      
      * OpsWithFluidKernelNeedMoveToPhi
      
      * open new_executor_static_build
      
      * add infermate for cudnn_lstm
      
      * fix
      
      * update
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * fix pow2 decay
      
      * fix pow2 decay
      
      * recover analysis_predictor.cc
      
      * fix pow2 decay
      
      * fix cudnn lstm
      
      * add output register info for svd
      
      * fix pow2_decay_with_linear_warmup_kernel
      
      * recover test lstm cudnn
      
      * recover svg register codes
      
      * fix register info
      
      * fix reduce sum register info
      
      * add output info for adadelta
      
      * add output info for adadelta
      
      * add output info for adamax
      
      * fix complex abs register info
      
      * add register info for cudnn_lstm_grad
      
      * recover
      
      * fix lstm cudnn
      
      * fix
      
      * fix xpu output registe info
      
      * remove std::cout
      
      * add backend
      
      * remove output info in pow2_decay_with_linear_warmup_kernel
      
      * add judgment in TensorShouldBeFakeInitialized
      
      * recover power_
      
      * close new_executor_static_build
      
      * fix set_value_xpu
      733eca85
  21. 20 6月, 2023 2 次提交