1. 18 2月, 2022 1 次提交
  2. 17 2月, 2022 4 次提交
  3. 16 2月, 2022 3 次提交
  4. 15 2月, 2022 5 次提交
    • H
      move histogram to pten (#39496) · 556f6eb0
      hong 提交于
      * move histogram to pten; test=develop
      
      * fix format error; test=develop
      
      * fix histogram kernel format; test=develop
      556f6eb0
    • F
      Move Abs OP to pten (#39492) · fb473067
      From00 提交于
      * Move Abs op to pten
      
      * Fix NPU compilation error
      
      * Fix CI error
      
      * Use LaunchSameDimsElementwiseCudaKernel in pten
      fb473067
    • L
      [Pten]Move expand_v2 to pten (#39471) · 2d16d69b
      Linjie Chen 提交于
      * move expand to pten
      
      * move expand_v2 to pten
      
      * move expand_v2 to pten
      
      * fix grad register
      
      * fix grad register
      
      * fix tensorcpry
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix ci
      
      * fix tensorcopy
      2d16d69b
    • C
      [PTen] Polish trace moving (#39510) · ab866777
      Chen Weihang 提交于
      * polish trace moving
      
      * remove useless header
      ab866777
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  5. 14 2月, 2022 1 次提交
    • C
      [pten] add split kernel (#39060) · d0df5632
      chentianyu03 提交于
      * add split kernel
      
      * add split kernel signature
      
      * fix split bug
      
      * modify MakePtenScalarArrayFromVarList
      
      * modify MakePtenScalarArrayFromVarList
      
      * fix split windows register error
      
      * add test case for split kernel
      
      * replace raw split kernel with pten kernel
      
      * fix makeScalar/ScalarArray bug
      
      * remove debug log
      
      * remove int64_t type in buildPtcontext
      
      * update by code review
      
      * fix split dev test failed
      
      * change DenseTensorMeta to MetaTensor
      
      * change split api code from auto gen to manual
      
      * split cuda kernel support bfloat16 type
      
      * fix conflict
      
      * rm raw split kernel
      
      * merge develop branch
      
      * change to pten::errors
      d0df5632
  6. 11 2月, 2022 2 次提交
  7. 10 2月, 2022 1 次提交
    • H
      move Masked select to pten (#39193) · e2ad433b
      hong 提交于
      * move masked select cpu kernel
      
      * add masked selected gpu kernel; test=develop
      
      * fix bugs; test=develop
      
      * bug fix; test=develop
      
      * bug fix; test=develop
      
      * add namespace to set mask array; test=develop
      
      * fix bug; test=develop
      
      * fix bugs; test=develop
      
      * fix ddim bug; test=develop
      
      * fix npu op bug; test=develop
      
      * fix xpu dependecy bug; test=develop
      
      * move kernel args to sig.cc; test=develop
      e2ad433b
  8. 09 2月, 2022 6 次提交
  9. 08 2月, 2022 3 次提交
  10. 06 2月, 2022 1 次提交
  11. 29 1月, 2022 1 次提交
    • C
      [PTen] Tidy pten core headers (#39188) · dd990981
      Chen Weihang 提交于
      * open header for custom kernel
      
      * add core utils
      
      * tidy core code
      
      * tify header
      
      * tidy include
      
      * tidy namespace
      
      * resolve conflit
      
      * fix unittest and coverage
      
      * remove platform using
      
      * resolve conflict
      
      * resolve conflict
      
      * fix digamma namespace error
      
      * fix xpu full kernel error
      
      * fix xpu full kernel error
      
      * polish details
      
      * add place for lib storage
      dd990981
  12. 28 1月, 2022 2 次提交
  13. 27 1月, 2022 4 次提交
    • Z
      implement AllocateFrom (#39280) · d89f246c
      zhangkaihuo 提交于
      d89f246c
    • Y
      refactor elementwise sub grad (#39225) · 7a1e1193
      YuanRisheng 提交于
      7a1e1193
    • A
      [PTen]Support AllocateFrom in Tensor and Alloc/HostAlloc in Context (#39022) · 5631da9c
      Aurelius84 提交于
      * Support allocate_from in Tensor and allocate_data in Context
      
      * fix #ifdef CUDA
      
      * fix cycle depends
      
      * fix test_xxx_dev_api failed
      
      * fix windows compiling error
      
      * fix unittest
      
      * modify into PImpl
      
      * fix selected rows
      
      * add TODO comment
      
      * refine interface according reviewer
      5631da9c
    • Z
      Add SparseCooTensor and SparseCsrTensor (#38906) · a7edb3f3
      zhangkaihuo 提交于
      * fix bug:
      1. atten: set the default value of attn_dropout_rate to None
      2. ffn: add activation parameter
      
      * for pure fp16
      
      * Add a SparseCsrTensor
      
      * remove unused functional
      
      * remove const
      
      * remove SetMemoberTensor
      
      * remove non_zero_nums_, the number of non zero elements of each batch can be obtained from the crows
      
      * SparseCooTensor
      
      * add SetMember
      
      * merge upstream; add SetMember
      
      * merge upstream
      
      * merge upstream; add newline at end of file
      
      * add newline at end of file
      
      * remove newline at end of file
      
      * remove newline at end of file
      
      * stash
      
      * user pten::framework::make_ddim
      
      * user pten::framework::make_ddim
      
      * merge upstream; use the latest mutable_data
      
      * merge upstream; use the latest mutable_data
      
      * return mutable dense tensor
      a7edb3f3
  14. 26 1月, 2022 1 次提交
  15. 25 1月, 2022 3 次提交
  16. 24 1月, 2022 2 次提交
    • Y
      [Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and... · 3bf3a6ee
      YuanRisheng 提交于
      [Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048)
      
      * refactor elementwise add grad
      
      * fix compile bugs
      
      * fix unit test bugs
      
      * fix file conflicts
      
      * fix bugs when buildPtenContext
      3bf3a6ee
    • [Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7
      石晓伟 提交于
      * updates callers, test=develop
      
      * updates tensor, test=develop
      
      * fixes errors, test=develop
      
      * remove some dtypes, test=develop
      
      * fix errors in the base storage modification, test=develop
      
      * fixes a bug, test=develop
      
      * fixes the bugs in push the whole, test=develop
      
      * updates, test=develop
      
      * update
      
      * update, test=develop
      
      * fixes the mac-py3 CI, test=develop
      
      * remove the storage impl, test=develop
      
      * updates some codes, test=develop
      
      * update, test=develop
      
      * updates pten allocation, test=develop
      a56e16a7