1. 14 2月, 2022 1 次提交
    • C
      [pten] add split kernel (#39060) · d0df5632
      chentianyu03 提交于
      * add split kernel
      
      * add split kernel signature
      
      * fix split bug
      
      * modify MakePtenScalarArrayFromVarList
      
      * modify MakePtenScalarArrayFromVarList
      
      * fix split windows register error
      
      * add test case for split kernel
      
      * replace raw split kernel with pten kernel
      
      * fix makeScalar/ScalarArray bug
      
      * remove debug log
      
      * remove int64_t type in buildPtcontext
      
      * update by code review
      
      * fix split dev test failed
      
      * change DenseTensorMeta to MetaTensor
      
      * change split api code from auto gen to manual
      
      * split cuda kernel support bfloat16 type
      
      * fix conflict
      
      * rm raw split kernel
      
      * merge develop branch
      
      * change to pten::errors
      d0df5632
  2. 30 1月, 2022 1 次提交
    • Z
      Add a Sparse OP : to_sparse_coo (#39264) · 78132fe1
      zhangkaihuo 提交于
      * dense_to_sparse_coo
      
      * optimize unit testing; support rocm
      
      * 1. delete fluid related header file
      2. update the copyright
      
      * fix hipMemcpy
      
      * update dense_to_sparsecoo
      
      * add namespace sparse
      78132fe1
  3. 21 1月, 2022 1 次提交
  4. 28 12月, 2021 1 次提交
  5. 23 12月, 2021 1 次提交
  6. 20 12月, 2021 1 次提交
  7. 29 11月, 2021 1 次提交
    • C
      [Pten] Add reduce mean kernel, replace with mean API (#37559) · f9e9fd19
      chentianyu03 提交于
      * add pten reduce kernel
      
      * add reduce_sum kernel
      
      * update attribute args and order
      
      * make out dtype undefined
      
      * fix empty input error
      
      * merge develop branch
      
      * rename sum as reduce function
      
      * rename sum as reduce function
      
      * fix reducekernelImpl args error
      
      * add reduce cuda kernel
      
      * modify dims type to const &
      
      * remove unsed log
      
      * fix reduce_all out eigen function error
      
      * remove unused codes
      
      * add the missing sum api define and testcase
      
      * merge develop branch
      
      * fix sum test axis value error
      
      * replace pten mean kernel with reduce_mean
      
      * revcover meam cuda to original implement
      f9e9fd19
  8. 22 11月, 2021 1 次提交
    • C
      [PTen] Add variable transform to/from ptenTensor and add cast kernel (#36916) · 5caa6fc5
      chentianyu03 提交于
      * add cast kernel
      
      * add cast cuda kernel
      
      * add cast kernel
      
      * make cast kernel output dtype undefined
      
      * get cast dtype from vardesc
      
      * move cast to manipulation and add test case
      
      * add castinfershape
      
      * avoid reinitilaze variable
      
      * InitializeVariable support datatype
      
      * merge develop branch
      
      * fix merge bug
      
      * revert modify initializeVariable
      
      * revert modify on InitializeVariable
      
      * revert modify on InitializeVariable
      
      * mutable support reset dtype
      
      * enable make pten tensor from variable when def_arg.type is undefined
      
      * fix build pten ctx start_idx error
      
      * copy pten out tensor to variable
      
      * merge develop branch
      
      * fix non pten kernel cast failed
      
      * add reset allocation place for remake tensor
      
      * fix inplace realloc error
      
      * add mutable on pten kernles and remove unused cast files
      
      * rename function names
      
      * fix output type error
      
      * fix conflict with develop branch
      
      * set data type to variable with pten's dtype
      
      * fix test_cast_api type mismatch
      
      * densorTensro mutable_data support 0 bytes value
      
      * fix the inplace bug of reshape kernel
      
      * fix pten.backend != variable.place when moving storage, palce mismatch bug
      
      * fix conflict with develop branch
      
      * Fix bug of paddle::experimental::MovesStorage
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * Revert "fix ReMakePtenDenseTensor place mismatch bug"
      
      This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a.
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * reverts the set_lod interface, test=develop
      
      * modify by the review options
      
      * modify error message
      
      * add & for const input arguments
      
      * add reference in params
      
      * elementwise_sub add mutable_data
      
      * fix ResetHolderWithType check size bug
      
      * add dependence pten_tensor to test_cast_api object
      
      * remove unused code to pass ci coverage
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
      5caa6fc5
  9. 16 11月, 2021 1 次提交
    • Y
      Add API and unit test for reshape (#37232) · 79b49c20
      YuanRisheng 提交于
      * reshape kernel refactor
      
      * fix compile bugs when run ci
      
      * support xpu for reshape
      
      * fix bugs when run unittest in kunlun ci
      
      * fix compile bugs when run kunlun
      
      * perfect code according to suggestion
      
      * add api and unit test for reshape
      79b49c20
  10. 12 11月, 2021 1 次提交
    • Y
      [Pten]Refactor the Elementwise_add Kernel (#37043) · c1310343
      YuanRisheng 提交于
      * elementwise_add kernel refactor
      
      * fix compile bugs in elementwise_add refactor
      
      * fix compile bugs when run in npu/xpu
      
      * fix bugs when run unit test
      
      * fix bugs when run ci-windows
      
      * modify code as recommended
      
      * code format adjust
      
      * fix bugs when run ci
      
      * fix compile bug when run in ci-windwos
      c1310343
  11. 05 11月, 2021 1 次提交