1. 16 5月, 2022 1 次提交
    • Z
      [PHI] Support construct IntArray by using Non-CPU Tensosr (#41764) · 8eecd852
      zyfncg 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * int_array supports constructed by gpu tensor
      
      * add some test
      
      * polish code
      
      * adjust full api
      
      * add unittest
      
      * add unittest
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      8eecd852
  2. 19 4月, 2022 1 次提交
  3. 18 4月, 2022 1 次提交
  4. 17 4月, 2022 1 次提交
  5. 14 4月, 2022 2 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) · 54ccc308
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      54ccc308
    • J
      Fix to #38693 (minimal UT) (#41026) · d0f3296b
      Jacek Czaja 提交于
      * Add UT
      
      - Added missed data_layout
      
      - Added missing conversions
      
      - NDHWC added
      
      - NDHWC support in data_transform
      
      - another fix
      
      - condddate change
      
      - fix
      
      u- fix
      
      - fix
      
      - fix
      
      - fix
      
      - fix
      
      - fix to hack
      
      - compilation fix
      
      - fix to automatic merge
      
      * - reduced UT
      
      * - fix
      
      * - lint
      
      * - fix to lint
      d0f3296b
  6. 13 4月, 2022 3 次提交
  7. 12 4月, 2022 3 次提交
  8. 08 4月, 2022 1 次提交
  9. 31 3月, 2022 1 次提交
  10. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  11. 23 3月, 2022 1 次提交
    • F
      [NPU] add npu support for conv3d and conv3d_grad (#38480) · ff568afa
      furnace 提交于
      * [NPU] add npu support for conv3d and conv3d_grad
      
      * [NPU] delete failed unittests due to Ascend not support
      
      * [NPU] delete debug codes
      
      * [NPU] optimize codes, notest
      
      * [NPU] remove const_cast
      
      * [NPU] optimize for remove const_cast
      
      * [NPU] fix written errors
      ff568afa
  12. 22 3月, 2022 1 次提交
  13. 21 3月, 2022 1 次提交
  14. 17 3月, 2022 1 次提交
  15. 15 3月, 2022 1 次提交
    • X
      [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass... · 64223620
      xiongkun 提交于
      [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770)
      
      * tranfer and pass the lgamma unittest
      
      * merge and pass the test
      
      * transfer kldiv_loss and kldiv_loss_grad; pass the unitest
      
      * trafer the isclose and cumprod kernel
      
      * change PT_REGISTER -> PD_REGISTER
      
      * fix by code review
      
      * fix by code review
      
      * fix
      
      * remove enforce include dependence from scalar
      
      * fix
      
      * fix by code review
      
      * fix by code review
      64223620
  16. 07 3月, 2022 1 次提交
  17. 06 3月, 2022 1 次提交
  18. 02 3月, 2022 2 次提交
  19. 01 3月, 2022 1 次提交
  20. 28 2月, 2022 1 次提交
  21. 25 2月, 2022 1 次提交
    • C
      [Phi] Support cudnn kernel moving & move softmax kernels (#39547) · 8895379a
      Chen Weihang 提交于
      * support cudnn kernel moving
      
      * polish cmake rules
      
      * add unittest for coverage
      
      * remove orig kernel
      
      * remove softmax cudnn kernel
      
      * fix softmax test failed
      
      * fix npu func error
      
      * resolve conflict
      
      * rename gpu dnn kernels
      
      * fix name rule error
      
      * fix compile error
      
      * update fp16 namespace
      8895379a
  22. 24 2月, 2022 1 次提交
  23. 23 2月, 2022 1 次提交
  24. 22 2月, 2022 3 次提交
  25. 21 2月, 2022 2 次提交
  26. 20 2月, 2022 1 次提交