1. 14 2月, 2022 1 次提交
    • C
      [pten] add split kernel (#39060) · d0df5632
      chentianyu03 提交于
      * add split kernel
      
      * add split kernel signature
      
      * fix split bug
      
      * modify MakePtenScalarArrayFromVarList
      
      * modify MakePtenScalarArrayFromVarList
      
      * fix split windows register error
      
      * add test case for split kernel
      
      * replace raw split kernel with pten kernel
      
      * fix makeScalar/ScalarArray bug
      
      * remove debug log
      
      * remove int64_t type in buildPtcontext
      
      * update by code review
      
      * fix split dev test failed
      
      * change DenseTensorMeta to MetaTensor
      
      * change split api code from auto gen to manual
      
      * split cuda kernel support bfloat16 type
      
      * fix conflict
      
      * rm raw split kernel
      
      * merge develop branch
      
      * change to pten::errors
      d0df5632
  2. 11 2月, 2022 1 次提交
  3. 10 2月, 2022 2 次提交
    • H
      move Masked select to pten (#39193) · e2ad433b
      hong 提交于
      * move masked select cpu kernel
      
      * add masked selected gpu kernel; test=develop
      
      * fix bugs; test=develop
      
      * bug fix; test=develop
      
      * bug fix; test=develop
      
      * add namespace to set mask array; test=develop
      
      * fix bug; test=develop
      
      * fix bugs; test=develop
      
      * fix ddim bug; test=develop
      
      * fix npu op bug; test=develop
      
      * fix xpu dependecy bug; test=develop
      
      * move kernel args to sig.cc; test=develop
      e2ad433b
    • C
      [PTen] Add standard kernel suffix set (#39404) · c7c1db33
      Chen Weihang 提交于
      * add standard_suffix_set_and_remove_reshape_with_xshape
      
      * revert reshape change
      
      * polish reduce name
      c7c1db33
  4. 09 2月, 2022 3 次提交
    • H
      Move trace op to pten (#39227) · d7dddf94
      hong 提交于
      * add trace op
      
      * bug fix
      
      * bug fix; test=develop
      
      * thrust bug fix; test=develop
      
      * remove useless register; test=develop
      
      * fix bug; test=develop
      
      * update trace kernel; test=develop
      
      * move kernel args to trace_sig; test=develop
      d7dddf94
    • L
      [pten] fix typo, muliply_raw -> multiply_raw (#39391) · f810d755
      Leo Chen 提交于
      f810d755
    • H
      Move norm to pten (#39324) · ece200b3
      hong 提交于
      * add norm cpu
      
      * update code;
      
      * norm bug fix
      
      * move norm op to pten; test=develop
      
      * move norm op to pten; test=develop
      
      * add norm util; test=develop
      
      * fix norm npu bug; test=develop
      
      * fix norm kernel bug; test=develop
      
      * move kernel args to pten; test=develop
      
      * move kernel args to pten sig; test=develop
      ece200b3
  5. 08 2月, 2022 1 次提交
  6. 02 2月, 2022 1 次提交
  7. 28 1月, 2022 1 次提交
  8. 26 1月, 2022 1 次提交
    • C
      [PTen] Unify InferMeta(Shape) Function in pten and fluid op (#38976) · b75507d3
      Chen Weihang 提交于
      * infermeta context init design
      
      * support infermeta called in fluid op
      
      * add hasattr and attr methods
      
      * add dygraah GetVarPtrs support
      
      * rename arg_map_context to arg_map_utils
      
      * add registry for arg map func
      
      * resolve conflit
      
      * refactor op utils design
      
      * polish meta config
      
      * fix details
      
      * remove hasattr method
      
      * resolve conflit
      
      * revert cmake order change
      
      * revert some change
      
      * change init pos
      
      * fix compile faileed
      
      * fix typo
      
      * fix inference failed
      
      * fix windows ccompile failed
      
      * polish format
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      b75507d3
  9. 22 1月, 2022 1 次提交
  10. 10 1月, 2022 1 次提交
  11. 09 12月, 2021 1 次提交