1. 09 2月, 2022 1 次提交
    • Z
      Add a Sparse Op: to_sparse_csr (#39333) · 76d527e1
      zhangkaihuo 提交于
      * implement AllocateFrom
      
      * dense_to_sparse_coo
      
      * optimize unit testing; support rocm
      
      * 1. delete fluid related header file
      2. update the copyright
      
      * fix hipMemcpy
      
      * update dense_to_sparsecoo
      
      * add namespace sparse
      
      * sparse_csr_to_dense
      
      * test to_sparse_coo: csr_to_coo
      
      * fix writing error
      
      * to_sparse_csr: dense_to_sparse_csr and sparse_coo_to_csr
      
      * fix check shape
      
      * fix unit test
      
      * replace CUDADeviceContext by GPUContext
      76d527e1
  2. 08 2月, 2022 2 次提交
  3. 07 2月, 2022 1 次提交
  4. 04 2月, 2022 1 次提交
  5. 30 1月, 2022 2 次提交
    • C
      [PTen] Change all InferMeta functions (#39222) · 7e29cea9
      Chen Weihang 提交于
      * change unary infermeta
      
      * change other infermeta
      
      * change all infermeta format
      
      * resolve conflit
      
      * fix test failed
      
      * resolve reshape conflit
      
      * fix compile failed
      
      * adapt auto api gen
      
      * fix reshape failed
      
      * fix concat failed
      
      * resolve conflict
      7e29cea9
    • Z
      Add a Sparse OP : to_sparse_coo (#39264) · 78132fe1
      zhangkaihuo 提交于
      * dense_to_sparse_coo
      
      * optimize unit testing; support rocm
      
      * 1. delete fluid related header file
      2. update the copyright
      
      * fix hipMemcpy
      
      * update dense_to_sparsecoo
      
      * add namespace sparse
      78132fe1
  6. 29 1月, 2022 2 次提交
    • C
      rename utils to manual (#39320) · 96bcf2df
      Chen Weihang 提交于
      96bcf2df
    • C
      [PTen] Tidy pten core headers (#39188) · dd990981
      Chen Weihang 提交于
      * open header for custom kernel
      
      * add core utils
      
      * tidy core code
      
      * tify header
      
      * tidy include
      
      * tidy namespace
      
      * resolve conflit
      
      * fix unittest and coverage
      
      * remove platform using
      
      * resolve conflict
      
      * resolve conflict
      
      * fix digamma namespace error
      
      * fix xpu full kernel error
      
      * fix xpu full kernel error
      
      * polish details
      
      * add place for lib storage
      dd990981
  7. 28 1月, 2022 2 次提交
  8. 27 1月, 2022 2 次提交
    • Z
      【PTen】Remove ReMakePtenDenseTensor (#39094) · 98c1829b
      zyfncg 提交于
      * remove remake densetensor
      
      * fix eager test error
      
      * fix bug in eager
      98c1829b
    • A
      [PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215
      Aganlengzi 提交于
      * [Demo] custom kernel based on pten kernel
      
      * merge and npu custom work well
      
      * del comments
      
      * delete other code
      
      * fix CUDAContext
      
      * fix not found small_vector.h
      
      * support NPU
      
      * fix NPUContext
      
      * fix DeviceContext support
      
      * add UT
      
      * fix call
      
      * add UT
      
      * fix
      
      * fix for comments and ut
      
      * add MACRO control
      
      * fix multi input output
      
      * support env CUSTOM_DEVICE_ROOT
      
      * deal with special cases
      
      * fix for Windows
      
      * try coverage with test_custom_kernel_dot.py
      
      * fix test_custom_kernel_dot
      
      * fix test_custom_kernel_dot
      
      * fix merge
      
      * fix merge
      
      * fix CI
      
      * update
      
      * merge and fix
      
      * remove WITH_CUSTOM_KERNEL
      
      * fix merge
      
      * merge and fix
      
      * fix ut
      
      * fix ut for mac
      
      * add more UT
      
      * add more UT
      
      * fix
      a8879215
  9. 25 1月, 2022 5 次提交
  10. 24 1月, 2022 3 次提交
    • [Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7
      石晓伟 提交于
      * updates callers, test=develop
      
      * updates tensor, test=develop
      
      * fixes errors, test=develop
      
      * remove some dtypes, test=develop
      
      * fix errors in the base storage modification, test=develop
      
      * fixes a bug, test=develop
      
      * fixes the bugs in push the whole, test=develop
      
      * updates, test=develop
      
      * update
      
      * update, test=develop
      
      * fixes the mac-py3 CI, test=develop
      
      * remove the storage impl, test=develop
      
      * updates some codes, test=develop
      
      * update, test=develop
      
      * updates pten allocation, test=develop
      a56e16a7
    • Z
      Fixed ResizeAndAllocate issues (#39101) · 9cfa811e
      Zhanlue Yang 提交于
      9cfa811e
    • Z
      Backward C++ API Code-Generation (#39057) · f83d1c0b
      zyfncg 提交于
      * add config of backward-api auto-gene
      
      * fix compile bug
      
      * remove wrong header
      
      * rename grad_api to backward_api
      
      * modify .gitignore
      f83d1c0b
  11. 21 1月, 2022 4 次提交
  12. 20 1月, 2022 2 次提交
  13. 18 1月, 2022 1 次提交
  14. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  15. 15 1月, 2022 3 次提交
  16. 14 1月, 2022 1 次提交
  17. 13 1月, 2022 2 次提交
    • C
      [pten]Remove pten/include dir files (#38878) · 7e0292ea
      chentianyu03 提交于
      * move dot_dev api into dot_kernel.h
      
      * add infermate header
      
      * modify to dotkerel in dot_op.h
      
      * mvoe conj dev api into complex_kernel.h
      
      * move sign dev api into  sign_kernel.h
      
      * move scale dev api into kernel.h and remove infermete.h
      
      * rm paddle/pten/include/math.h
      
      * rm paddle/pten/include/math.h
      
      * rm include dir
      
      * rm paddle/pten/include/math.h
      
      * fix conflict with develop branch
      
      * rm devContext in conj_op.h
      
      * add the missing complex_kernel header
      7e0292ea
    • splits allocation for pten, test=develop (#38853) · 277cf900
      石晓伟 提交于
      277cf900
  18. 11 1月, 2022 1 次提交
  19. 10 1月, 2022 2 次提交
    • Z
      [Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea
      Zhanlue Yang 提交于
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor
      
      * Modified framework::Tensor to inherit from DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      
      * Rearranged cfunction calls from tensor.data<void>() to tensor.data()
      
      * Fixed CI issues
      
      * Fixed lite issues
      
      * Fixed data() interface issues,test=allcases
      
      * Resolved IsInitialized() issues
      
      * Fixed ResetHolder() issues
      
      * Fixed MKLDNN & Storage issues
      
      * Resolved ShareBufferWith() issues
      
      * Fixed LoD issues
      5c73a6ea
    • C
      Support setting infershape function for custom grad op (#38776) · 046553c7
      Chen Weihang 提交于
      * unify infer_shape func calling
      
      * support set grad infer shape fn for custom op
      
      * unify infershape in new executor and eager
      
      * remove todo comment
      
      * revert infershape in operator
      046553c7
  20. 04 1月, 2022 1 次提交
  21. 31 12月, 2021 1 次提交