1. 28 6月, 2022 1 次提交
  2. 27 6月, 2022 1 次提交
  3. 24 6月, 2022 1 次提交
  4. 23 6月, 2022 1 次提交
  5. 15 6月, 2022 1 次提交
  6. 10 6月, 2022 1 次提交
  7. 04 6月, 2022 1 次提交
  8. 24 5月, 2022 1 次提交
  9. 18 5月, 2022 1 次提交
  10. 16 5月, 2022 1 次提交
    • Z
      [PHI] Support construct IntArray by using Non-CPU Tensosr (#41764) · 8eecd852
      zyfncg 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * int_array supports constructed by gpu tensor
      
      * add some test
      
      * polish code
      
      * adjust full api
      
      * add unittest
      
      * add unittest
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      8eecd852
  11. 11 5月, 2022 1 次提交
    • Z
      [Phi] Change the output format of C++ backward api (Part1) (#42677) · ba71fbea
      zyfncg 提交于
      * change the output format of C++ backward api
      
      * fix merge conflict
      
      * fix sparse api code auto-gen
      
      * fix eager_gen bug
      
      * fix bug of output is null
      
      * fix bug of conv2d_grad_impl
      
      * fix optional grad
      
      * fix bug of eager-gen double_grad
      
      * fix bug
      
      * fix multiply_double_grad bug
      
      * remove node pruning
      ba71fbea
  12. 07 5月, 2022 1 次提交
  13. 14 4月, 2022 1 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) · 54ccc308
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      54ccc308
  14. 13 4月, 2022 3 次提交
  15. 06 4月, 2022 1 次提交
    • Z
      Add some op yaml (#41173) · 176df91c
      zyfncg 提交于
      * add real and imag yaml
      
      * add roi_align and roi_pool yaml
      
      * add qr yaml
      
      * add psroi_pool yaml
      
      * fix bug
      
      * fix param bug of psroi_pool
      
      * fix infrt problem
      
      * fix merge bug
      176df91c
  16. 04 4月, 2022 1 次提交
    • C
      [Yaml]Add concat grad yaml (#41365) · 119816f9
      chentianyu03 提交于
      * add concat_grad kernel
      
      * fix error
      
      * remove comment code
      
      * fix outs nullptr error
      
      * change to phi header
      
      * add concat_grad declare for standalone_executor_test
      
      * add concat_grad yaml
      
      * add concat api
      
      * fix test concat op error
      
      * fix test concat op error
      119816f9
  17. 30 3月, 2022 1 次提交
  18. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  19. 24 3月, 2022 2 次提交
  20. 21 3月, 2022 1 次提交
  21. 17 3月, 2022 1 次提交
  22. 09 3月, 2022 1 次提交
  23. 08 3月, 2022 1 次提交
  24. 03 3月, 2022 1 次提交
  25. 26 2月, 2022 1 次提交
    • Z
      Support custom implement for C++ API (#39521) · caea126c
      zyfncg 提交于
      * Support custom implement for C++ API
      
      * rename api_invoke_impl to api_custom_impl
      
      * remove manual_api
      
      * delete mutable_data in copy_to api
      
      * fix problem of copy_to
      
      * add unittest for infer_meta_fn_factory
      
      * fix split cofig in yaml
      
      * fix split cofig in yaml
      
      * modify sum api yaml
      
      * add copy_to wrapped infermeta
      
      * rollback copy impl
      caea126c
  26. 24 2月, 2022 1 次提交
  27. 23 2月, 2022 1 次提交
  28. 21 2月, 2022 1 次提交
  29. 20 2月, 2022 1 次提交
  30. 18 2月, 2022 1 次提交
  31. 17 2月, 2022 1 次提交
    • L
      avoid custom kernel deps on pten_function_api (#39661) · cbce0e60
      Leo Chen 提交于
      * pten matmul cuda kernel support bf16
      
      * avoid custom kernel deps on pten_function_api
      
      * Revert "pten matmul cuda kernel support bf16"
      
      This reverts commit 5d520845b9a189375677276efb673235ed8e5ee0.
      
      * refine code
      
      * fix compile
      
      * fix test_split_api
      cbce0e60
  32. 13 2月, 2022 1 次提交
  33. 10 2月, 2022 1 次提交
  34. 04 2月, 2022 1 次提交
  35. 30 1月, 2022 1 次提交
    • Z
      Add a Sparse OP : to_sparse_coo (#39264) · 78132fe1
      zhangkaihuo 提交于
      * dense_to_sparse_coo
      
      * optimize unit testing; support rocm
      
      * 1. delete fluid related header file
      2. update the copyright
      
      * fix hipMemcpy
      
      * update dense_to_sparsecoo
      
      * add namespace sparse
      78132fe1
  36. 29 1月, 2022 1 次提交
  37. 27 1月, 2022 1 次提交
    • A
      [PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215
      Aganlengzi 提交于
      * [Demo] custom kernel based on pten kernel
      
      * merge and npu custom work well
      
      * del comments
      
      * delete other code
      
      * fix CUDAContext
      
      * fix not found small_vector.h
      
      * support NPU
      
      * fix NPUContext
      
      * fix DeviceContext support
      
      * add UT
      
      * fix call
      
      * add UT
      
      * fix
      
      * fix for comments and ut
      
      * add MACRO control
      
      * fix multi input output
      
      * support env CUSTOM_DEVICE_ROOT
      
      * deal with special cases
      
      * fix for Windows
      
      * try coverage with test_custom_kernel_dot.py
      
      * fix test_custom_kernel_dot
      
      * fix test_custom_kernel_dot
      
      * fix merge
      
      * fix merge
      
      * fix CI
      
      * update
      
      * merge and fix
      
      * remove WITH_CUSTOM_KERNEL
      
      * fix merge
      
      * merge and fix
      
      * fix ut
      
      * fix ut for mac
      
      * add more UT
      
      * add more UT
      
      * fix
      a8879215