1. 14 7月, 2023 5 次提交
  2. 13 7月, 2023 5 次提交
  3. 12 7月, 2023 1 次提交
    • H
      Support selected rows new ir (#54987) · fc66b5d7
      hong 提交于
      * refine program translator
      
      * fix warning: not override
      
      * fix bug
      
      * merge new modifications
      
      * modify by reviews
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix
      
      * fix
      
      * update
      
      * support selected rows
      
      * update
      
      * add selectrows
      
      * fix bug
      
      * add ut
      
      * refine code
      
      * refien code
      
      * update
      
      * update
      
      * support selected rows
      
      * support selected rows
      
      * support dense tensor
      
      * remove useless code
      
      * polish code
      
      * remote standalone executor test
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
      fc66b5d7
  4. 11 7月, 2023 4 次提交
    • MarDino's avatar
      Integrate rmsnorm kernel (#54998) · 97d3d6ee
      MarDino 提交于
      * add rmsnorm kernel
      * add static graph test
      * fix round type
      * use alignas to avoid msvc compile error
      * remove redundant headerfile to avoid rocm compile error
      * fix rocm compile not found cub
      * Add document
      97d3d6ee
    • H
      [NewIR] Fix new ir unsqueeze op bug (#55212) · 852d7a12
      hong 提交于
      * suport optional input in new_ir
      
      * polish code
      
      * add coverate test
      
      * update
      
      * update
      
      * add unitest
      
      * remove reduplicate code
      
      * udpate
      
      * fix assign error
      
      * revert test arg min max
      
      * update
      
      * fix bug
      
      * polish code
      
      * update
      
      * fix unique and close op bug
      
      * update
      
      * update
      
      * revert test code
      
      * revert unique test
      
      * polish code
      
      * remove useless code
      
      ---------
      Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
      852d7a12
    • FormlessUnit's avatar
      Linear compress (#55128) · f4290a92
      FormlessUnit 提交于
      * rename weight_only/llm.int8
      f4290a92
    • H
      fix new ir sigmoid cross entropy op (#55284) · 22c49634
      hong 提交于
      22c49634
  5. 10 7月, 2023 2 次提交
  6. 07 7月, 2023 3 次提交
  7. 06 7月, 2023 1 次提交
  8. 05 7月, 2023 2 次提交
  9. 04 7月, 2023 2 次提交
  10. 03 7月, 2023 6 次提交
  11. 01 7月, 2023 1 次提交
    • K
      [IR] refine program translator (#54719) · 5d40f2a2
      kangguangli 提交于
      * refine program translator
      
      * fix warning: not override
      
      * fix bug
      
      * merge new modifications
      
      * modify by reviews
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix
      
      * fix
      
      * fix conflicts
      
      * add unittest for special op transcriber
      
      * set cpu as default backend
      
      * modify by reviews
      5d40f2a2
  12. 30 6月, 2023 6 次提交
  13. 29 6月, 2023 2 次提交
    • H
      Refactor op info parser (#54859) · f18d538b
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * update
      
      * update
      
      * fix bug
      
      * split file
      
      * fix conflict
      
      * polish code and fix conflict
      
      * support place transformer
      
      * finish bug
      
      * add gpu flags
      
      * fix with cuda macro
      
      * add fetch kernel
      
      * support fetch var in new ir
      
      * fix bug
      
      * polish code
      
      * change array equal to np.testing
      
      * support feed in new ir
      
      * update
      
      * fix bug
      
      * try to hack combine op
      
      * add scope guard
      
      * revert atan2 op
      
      * add scope guard
      
      * update
      
      * polish code
      
      * update
      
      * refactor build kernel context
      
      * fix unitest bug
      
      * polish code
      
      * use original order
      
      * remove useless code
      
      * polish code
      
      * fix bug
      f18d538b
    • N
      Add fused_rope forward op (#54351) · a215c46a
      niuliling123 提交于
      * style
      
      * more
      
      * update ctest
      
      * Update legacy_backward.yaml
      
      * Update legacy_ops.yaml
      
      * Update legacy_ops.yaml
      
      * update
      
      * update
      
      * update for move
      a215c46a