1. 23 5月, 2023 1 次提交
    • zhenhailiu's avatar
      [dist attr 迁移到 phi]Dist attr (#53848) · be1152a4
      zhenhailiu 提交于
      * merge code from forsish
      
      * polish
      
      * paddle/fluid/pybind/auto_parallel_py.cc
      
      * polish
      
      * polish
      
      * polish
      
      * polish
      
      * polish
      
      * polish
      
      * polish
      
      * polish
      
      * polish
      
      * polish
      
      * polish
      be1152a4
  2. 29 3月, 2023 1 次提交
  3. 20 3月, 2023 1 次提交
  4. 07 3月, 2023 1 次提交
  5. 30 12月, 2022 1 次提交
  6. 12 12月, 2022 1 次提交
  7. 09 11月, 2022 1 次提交
  8. 24 8月, 2022 1 次提交
  9. 01 8月, 2022 1 次提交
  10. 19 7月, 2022 1 次提交
  11. 16 6月, 2022 1 次提交
  12. 10 6月, 2022 1 次提交
  13. 04 6月, 2022 1 次提交
  14. 16 5月, 2022 1 次提交
    • Z
      [PHI] Support construct IntArray by using Non-CPU Tensosr (#41764) · 8eecd852
      zyfncg 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * int_array supports constructed by gpu tensor
      
      * add some test
      
      * polish code
      
      * adjust full api
      
      * add unittest
      
      * add unittest
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      8eecd852
  15. 25 4月, 2022 1 次提交
  16. 14 4月, 2022 1 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) · 54ccc308
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      54ccc308
  17. 13 4月, 2022 2 次提交
  18. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  19. 15 3月, 2022 1 次提交
  20. 24 2月, 2022 1 次提交
  21. 20 2月, 2022 1 次提交
  22. 11 2月, 2022 1 次提交
    • S
      add print pten kernel tool (#39371) · 8803f6bb
      Shang Zhizhou 提交于
      * test=document_fix;add print pten kernel tool
      
      * test=document_fix
      
      * test=document_fix
      
      * test=document_fix
      
      * test=document_fix
      
      * add print_pten_kernels tool
      
      * add print_pten_kernels tool
      
      * fix windows complie
      
      * notest,test=rocm_ci
      
      * add merge tool
      
      * add comments
      8803f6bb
  23. 27 1月, 2022 1 次提交
    • Z
      Add SparseCooTensor and SparseCsrTensor (#38906) · a7edb3f3
      zhangkaihuo 提交于
      * fix bug:
      1. atten: set the default value of attn_dropout_rate to None
      2. ffn: add activation parameter
      
      * for pure fp16
      
      * Add a SparseCsrTensor
      
      * remove unused functional
      
      * remove const
      
      * remove SetMemoberTensor
      
      * remove non_zero_nums_, the number of non zero elements of each batch can be obtained from the crows
      
      * SparseCooTensor
      
      * add SetMember
      
      * merge upstream; add SetMember
      
      * merge upstream
      
      * merge upstream; add newline at end of file
      
      * add newline at end of file
      
      * remove newline at end of file
      
      * remove newline at end of file
      
      * stash
      
      * user pten::framework::make_ddim
      
      * user pten::framework::make_ddim
      
      * merge upstream; use the latest mutable_data
      
      * merge upstream; use the latest mutable_data
      
      * return mutable dense tensor
      a7edb3f3
  24. 26 1月, 2022 1 次提交
    • C
      [PTen] Unify InferMeta(Shape) Function in pten and fluid op (#38976) · b75507d3
      Chen Weihang 提交于
      * infermeta context init design
      
      * support infermeta called in fluid op
      
      * add hasattr and attr methods
      
      * add dygraah GetVarPtrs support
      
      * rename arg_map_context to arg_map_utils
      
      * add registry for arg map func
      
      * resolve conflit
      
      * refactor op utils design
      
      * polish meta config
      
      * fix details
      
      * remove hasattr method
      
      * resolve conflit
      
      * revert cmake order change
      
      * revert some change
      
      * change init pos
      
      * fix compile faileed
      
      * fix typo
      
      * fix inference failed
      
      * fix windows ccompile failed
      
      * polish format
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      b75507d3
  25. 22 1月, 2022 1 次提交
  26. 21 1月, 2022 1 次提交
  27. 20 1月, 2022 1 次提交
  28. 13 1月, 2022 1 次提交
    • C
      [pten]Remove pten/include dir files (#38878) · 7e0292ea
      chentianyu03 提交于
      * move dot_dev api into dot_kernel.h
      
      * add infermate header
      
      * modify to dotkerel in dot_op.h
      
      * mvoe conj dev api into complex_kernel.h
      
      * move sign dev api into  sign_kernel.h
      
      * move scale dev api into kernel.h and remove infermete.h
      
      * rm paddle/pten/include/math.h
      
      * rm paddle/pten/include/math.h
      
      * rm include dir
      
      * rm paddle/pten/include/math.h
      
      * fix conflict with develop branch
      
      * rm devContext in conj_op.h
      
      * add the missing complex_kernel header
      7e0292ea
  29. 10 1月, 2022 1 次提交
  30. 31 12月, 2021 1 次提交
  31. 29 12月, 2021 1 次提交
  32. 28 12月, 2021 1 次提交
  33. 27 12月, 2021 1 次提交
  34. 26 12月, 2021 1 次提交
    • C
      [PTen] Move copy kernel impl (#38421) · 73819658
      Chen Weihang 提交于
      * add register general kernel marco
      
      * move copy kernel impl
      
      * revert needless change
      
      * polish details
      
      * fix xpu compil faild
      
      * fix xpu compile failed
      
      * polish format
      73819658
  35. 23 12月, 2021 1 次提交
  36. 22 12月, 2021 2 次提交
  37. 21 12月, 2021 2 次提交