1. 09 2月, 2023 2 次提交
    • H
      [PHI decoupling] move strided_memcpy.h to phi (#50346) · 17318c1a
      Huang Jiyi 提交于
      * decouple strided_memcpy
      
      * move strided_memcpy
      
      * move strided_memcpy to phi
      
      * fix namespace
      
      * update
      
      * fix gpu compile bugs
      17318c1a
    • Y
      Add MultiTenosrAdam OP (#49220) · 10654c77
      yuehuayingxueluo 提交于
      * add multi_tenosr_adam
      
      * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py
      
      * fix adam.py optimizer.py
      
      * fix adamw.py
      
      * fix test_multi_tensor_adam.py
      
      * fix CI bug
      
      * fix CI coverage
      
      * fix ci bug
      
      * fix betapow
      
      * fix some bugs
      
      * fix test_adamw_op.py
      
      * fix CI coverage
      
      * fix multi_tensor_adam_kernel.cc
      
      * fix CI bug
      
      * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py
      
      * fix code style
      
      * update C++ parts
      
      * remove python parts modification temporarily
      
      * add C++ ut
      
      * update betapow copy code logic
      
      * fix ci ut
      
      * fix windows ci
      
      * fix coverage ci
      
      * improve coverage rate
      
      ---------
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      10654c77
  2. 15 12月, 2022 1 次提交
  3. 28 11月, 2022 1 次提交
  4. 10 11月, 2022 1 次提交
  5. 08 11月, 2022 1 次提交
  6. 26 10月, 2022 1 次提交
  7. 12 10月, 2022 1 次提交
  8. 30 9月, 2022 1 次提交
  9. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  10. 19 9月, 2022 1 次提交
    • L
      Performance fix for broadcast kernel [Part3] (#46071) · 46e4fb2a
      limingshu 提交于
      * first commit
      
      * refine code with template argument
      
      * refine code with template argument
      
      * add ternary broadcast test file
      
      * add ternary broadcast test file
      
      * fix accoriding to ci
      
      * fix op-benchmark ci error
      46e4fb2a
  11. 07 9月, 2022 1 次提交
  12. 05 9月, 2022 1 次提交
  13. 31 8月, 2022 1 次提交
    • C
      Fix split api bug (#45396) · 4a25b60d
      Charles-hit 提交于
      * fix split bug
      
      * solve function redefine
      
      * fix fluid.layers.split and add unit test
      
      * delete splitInferMeta register in unary.cc
      
      * modify test_split_op GPU unit test
      
      * modify test_split_op GPU unit test place param
      
      * refactor split op and fix infershape bugs
      
      * add () in && and ||
      
      * fix split C++ unit test
      
      * fix split infershape
      4a25b60d
  14. 30 8月, 2022 1 次提交
  15. 26 8月, 2022 1 次提交
    • K
      Transfer transfer_layout from fluid to phi (#45261) · 985f2a4a
      kangguangli 提交于
      * remove fluid kernel and activate phi kernel
      
      * fix parameter error
      
      * transfer mkldnn part
      
      * modify header file path
      
      * fix compile error
      
      * transfer special case
      
      * fix lod setting and special case for layout setting
      
      * add testcase and refine code
      985f2a4a
  16. 25 8月, 2022 1 次提交
    • K
      Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a
      kangguangli 提交于
      * transfer memcpy_d2h from fluid to phi
      
      * refine arg check and add comment
      
      * fix cannot fallback to phi kernel
      
      * fix gpu_context host alloc when tensor size = 0
      
      * add kernel for std::vector<DenseTensor> args
      
      * fix bugs in MemcpyD2HMultiIOKernel
      
      * remove useless header file
      
      * polish format
      
      * fix typo
      
      * add testcase for cudapinned place
      
      * refine check condition in test
      
      * polish error message
      
      * polish error message
      
      * remove header in fluid  directory
      
      * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation
      
      * fix code style check
      0d14e74a
  17. 01 8月, 2022 2 次提交
  18. 26 7月, 2022 1 次提交
  19. 19 7月, 2022 1 次提交
  20. 15 7月, 2022 1 次提交
  21. 13 7月, 2022 1 次提交
  22. 12 7月, 2022 1 次提交
  23. 02 7月, 2022 2 次提交
    • L
      unify cpu context, part2 (#44012) · 755438a7
      Leo Chen 提交于
      * fix init()
      
      * delete test_device_context
      
      * replace CPUDeviceContext with CPUContext
      
      * fix test_scalar
      
      * remove dot_op.cc
      
      * fix compile
      755438a7
    • L
      unify cpu context (#43989) · 09096aeb
      Leo Chen 提交于
      * unify cpu context
      
      * fix init()
      
      * delete test_device_context
      
      * fix test_scalar
      09096aeb
  24. 24 6月, 2022 1 次提交
    • Y
      [Phi]Change Copy from Kernel to basic component utils (#43622) · 2739bd73
      YuanRisheng 提交于
      * perfect copy
      
      * deal with conflict
      
      * deal with conflict
      
      * fix compile bugs
      
      * fix unittest bugs
      
      * change code format
      
      * deal with conflict
      
      * modify code by review
      
      * fix ce bugs
      
      * fix ce bugs
      
      * add lo
      
      * perfect code format
      
      * deal with conflicts
      2739bd73
  25. 23 6月, 2022 1 次提交
  26. 05 6月, 2022 1 次提交
  27. 04 6月, 2022 1 次提交
  28. 12 5月, 2022 1 次提交
  29. 19 4月, 2022 1 次提交
  30. 15 4月, 2022 1 次提交
  31. 02 4月, 2022 2 次提交
  32. 01 4月, 2022 2 次提交
  33. 31 3月, 2022 1 次提交
  34. 29 3月, 2022 1 次提交
  35. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac