1. 28 4月, 2022 1 次提交
    • Z
      [cherry-pick] Optimize performance of dygraph (#42231, #42253) (#42309) · 69a92b7b
      zyfncg 提交于
      * Optimize the performanece of sum api (#42231)
      
      * optimize the performanece of sum api
      
      * optimize IsDenseTensorInput
      
      * remove debug log
      
      * Add move construct for KernelSignature (#42253)
      
      * add move construct for KernelSignature
      
      * add noexcept
      
      * fix cherry-pick problem
      69a92b7b
  2. 27 4月, 2022 1 次提交
    • C
      [Cherry-pick] Optimize dygraph performance part4 (#42306) · 9bc423b1
      Chen Weihang 提交于
      * Remove std::type_index in AttributeArdDef (#42122)
      
      * polish some impl
      
      * add lost attr type
      
      * polish details
      
      * fix error type
      
      * polish in name lists
      
      * add double attr
      
      * adapt infrt attr parse
      
      * add attr type test (#42263)
      
      * opt attr eaque perf (#42272)
      9bc423b1
  3. 26 4月, 2022 1 次提交
    • C
      [Cherry-pick] Optimize dygraph performance part2 (#42224) · ab24b9c0
      Chen Weihang 提交于
      * Add paddle::variant and replace paddle::any (#42139)
      
      * add variant and replace any
      
      * split attribute
      
      * Optimize dygraph GetExpectedKernelType perf (#42154)
      
      * opt dygraph scheduling
      
      * revert part impl
      
      * fix variant compile error (#42203)
      
      * replace any by variant in infermeta (#42181)
      ab24b9c0
  4. 25 4月, 2022 1 次提交
  5. 21 4月, 2022 1 次提交
  6. 20 4月, 2022 1 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) (#41963) · 3b25afb2
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      3b25afb2
  7. 18 4月, 2022 2 次提交
    • C
      [Phi]Reduce kernels into multiply files (#41747) (#41854) · 688f4ec0
      chentianyu03 提交于
      * split reduce_kernel
      
      * rm reduce_kernel in cmake
      
      * split reduce_grad kernels
      
      * fix cmake build error
      
      * format code
      
      * fix standalone_executor_test error
      688f4ec0
    • C
      [Cherry-pick] Organize the API of custom operators (#41882) · 897911fc
      Chen Weihang 提交于
      * [Phi&CustomOp] Remove deprecated enum PlaceType for custom op & add warning (#41647)
      
      * remove old custom op placetype
      
      * replace dist  placetype using
      
      * add with gpu macro
      
      * fix mutable_data error
      
      * fix set value error
      
      * add comment
      
      * remove all is initialized using (#41766)
      
      * remove inner_place using (#41768)
      
      * polish tensor depreacted method warning (#41807)
      
      * [CustomOp] Fix PlaceType related compat error (#41826)
      
      * fix place type related compat error
      
      * fix test failed
      
      * remove dll decl
      
      * revert place type change
      
      * add dll decl
      
      * resolve conflict
      897911fc
  8. 04 4月, 2022 1 次提交
  9. 02 4月, 2022 2 次提交
  10. 01 4月, 2022 2 次提交
  11. 31 3月, 2022 2 次提交
  12. 29 3月, 2022 1 次提交
  13. 28 3月, 2022 1 次提交
  14. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  15. 24 3月, 2022 1 次提交
  16. 22 3月, 2022 1 次提交
  17. 19 3月, 2022 1 次提交
  18. 18 3月, 2022 1 次提交
  19. 17 3月, 2022 2 次提交
    • C
      [Phi] Move assign kernel into phi (#40022) · 1904572a
      Chen Weihang 提交于
      * move assign kernel init commit
      
      * change vec<tensor> to vec<tensor*>
      
      * support tensor array
      
      * support api declare
      
      * fix test_list failed
      
      * fix npu and xpu failed
      
      * fix infrt failed
      
      * remove assign array size in operator
      
      * move assign sr header into sr dir
      
      * add infermeta for assign
      
      * test op success
      
      * fix test_list failed
      
      * fix kunlun failed
      
      * add set host allocator in tests
      
      * support tensor array in arg ctx
      
      * open set layout in share_meta
      
      * fix meta tensor layout error
      
      * fix test failed
      1904572a
    • Y
      rename math (#40641) · 883a8eea
      YuanRisheng 提交于
      883a8eea
  20. 16 3月, 2022 2 次提交
  21. 14 3月, 2022 2 次提交
  22. 12 3月, 2022 1 次提交
  23. 11 3月, 2022 2 次提交
  24. 09 3月, 2022 2 次提交
  25. 08 3月, 2022 1 次提交
  26. 07 3月, 2022 1 次提交
    • C
      [Phi] Remove storage deps of empty (#40136) · b46e49de
      Chen Weihang 提交于
      * remove storage deps of empty
      
      * remove invalid empty method
      
      * remove error empty using
      
      * fix test_sparse_utils_dev_api
      
      * revert some sparse change
      
      * add memset for conv grad
      
      * resolve conflict
      
      * resolve conflict
      
      * resolve conflict
      b46e49de
  27. 04 3月, 2022 2 次提交
  28. 03 3月, 2022 2 次提交
  29. 02 3月, 2022 1 次提交