1. 07 5月, 2022 1 次提交
  2. 14 4月, 2022 1 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) · 54ccc308
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      54ccc308
  3. 13 4月, 2022 3 次提交
  4. 06 4月, 2022 1 次提交
    • Z
      Add some op yaml (#41173) · 176df91c
      zyfncg 提交于
      * add real and imag yaml
      
      * add roi_align and roi_pool yaml
      
      * add qr yaml
      
      * add psroi_pool yaml
      
      * fix bug
      
      * fix param bug of psroi_pool
      
      * fix infrt problem
      
      * fix merge bug
      176df91c
  5. 04 4月, 2022 1 次提交
    • C
      [Yaml]Add concat grad yaml (#41365) · 119816f9
      chentianyu03 提交于
      * add concat_grad kernel
      
      * fix error
      
      * remove comment code
      
      * fix outs nullptr error
      
      * change to phi header
      
      * add concat_grad declare for standalone_executor_test
      
      * add concat_grad yaml
      
      * add concat api
      
      * fix test concat op error
      
      * fix test concat op error
      119816f9
  6. 30 3月, 2022 1 次提交
  7. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  8. 24 3月, 2022 2 次提交
  9. 21 3月, 2022 1 次提交
  10. 17 3月, 2022 1 次提交
  11. 09 3月, 2022 1 次提交
  12. 08 3月, 2022 1 次提交
  13. 03 3月, 2022 1 次提交
  14. 26 2月, 2022 1 次提交
    • Z
      Support custom implement for C++ API (#39521) · caea126c
      zyfncg 提交于
      * Support custom implement for C++ API
      
      * rename api_invoke_impl to api_custom_impl
      
      * remove manual_api
      
      * delete mutable_data in copy_to api
      
      * fix problem of copy_to
      
      * add unittest for infer_meta_fn_factory
      
      * fix split cofig in yaml
      
      * fix split cofig in yaml
      
      * modify sum api yaml
      
      * add copy_to wrapped infermeta
      
      * rollback copy impl
      caea126c
  15. 24 2月, 2022 1 次提交
  16. 23 2月, 2022 1 次提交
  17. 21 2月, 2022 1 次提交
  18. 20 2月, 2022 1 次提交
  19. 18 2月, 2022 1 次提交
  20. 17 2月, 2022 1 次提交
    • L
      avoid custom kernel deps on pten_function_api (#39661) · cbce0e60
      Leo Chen 提交于
      * pten matmul cuda kernel support bf16
      
      * avoid custom kernel deps on pten_function_api
      
      * Revert "pten matmul cuda kernel support bf16"
      
      This reverts commit 5d520845b9a189375677276efb673235ed8e5ee0.
      
      * refine code
      
      * fix compile
      
      * fix test_split_api
      cbce0e60
  21. 13 2月, 2022 1 次提交
  22. 10 2月, 2022 1 次提交
  23. 04 2月, 2022 1 次提交
  24. 30 1月, 2022 1 次提交
    • Z
      Add a Sparse OP : to_sparse_coo (#39264) · 78132fe1
      zhangkaihuo 提交于
      * dense_to_sparse_coo
      
      * optimize unit testing; support rocm
      
      * 1. delete fluid related header file
      2. update the copyright
      
      * fix hipMemcpy
      
      * update dense_to_sparsecoo
      
      * add namespace sparse
      78132fe1
  25. 29 1月, 2022 1 次提交
  26. 27 1月, 2022 1 次提交
    • A
      [PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215
      Aganlengzi 提交于
      * [Demo] custom kernel based on pten kernel
      
      * merge and npu custom work well
      
      * del comments
      
      * delete other code
      
      * fix CUDAContext
      
      * fix not found small_vector.h
      
      * support NPU
      
      * fix NPUContext
      
      * fix DeviceContext support
      
      * add UT
      
      * fix call
      
      * add UT
      
      * fix
      
      * fix for comments and ut
      
      * add MACRO control
      
      * fix multi input output
      
      * support env CUSTOM_DEVICE_ROOT
      
      * deal with special cases
      
      * fix for Windows
      
      * try coverage with test_custom_kernel_dot.py
      
      * fix test_custom_kernel_dot
      
      * fix test_custom_kernel_dot
      
      * fix merge
      
      * fix merge
      
      * fix CI
      
      * update
      
      * merge and fix
      
      * remove WITH_CUSTOM_KERNEL
      
      * fix merge
      
      * merge and fix
      
      * fix ut
      
      * fix ut for mac
      
      * add more UT
      
      * add more UT
      
      * fix
      a8879215
  27. 25 1月, 2022 2 次提交
  28. 24 1月, 2022 1 次提交
  29. 16 12月, 2021 1 次提交
  30. 10 12月, 2021 1 次提交
  31. 07 12月, 2021 2 次提交
  32. 03 12月, 2021 2 次提交
    • Z
      fix python command in cmake of api-gen (#37818) · 7e9b20b5
      zyfncg 提交于
      7e9b20b5
    • Z
      【PTen】C++ API Code-Generation (#37668) · c58c4ede
      zyfncg 提交于
      * add code-gen file
      
      * add api-gen in cmake
      
      * adjust the code format
      
      * temp save the code
      
      * add genen-api module into pten
      
      * merge the develop code
      
      * fix merge conflict
      
      * fix code conflict with develop
      
      * support reduce_mean/sum
      
      * fix the CI requirement
      
      * fix requirement problem of CI
      
      * remove original api code
      
      * fix bug caused by removing original api
      c58c4ede
  33. 19 11月, 2021 1 次提交
  34. 15 11月, 2021 1 次提交
    • C
      [Pten] Refactor the implementation of custom operator (#37122) · 1e598f1a
      Chen Weihang 提交于
      * move extension into pten [no-verify]
      
      * append tensor methods by ext_tensor [no-verify]
      
      * append other tensor methods [no-verify]
      
      * ext related files tidy [no-verify]
      
      * include relation tidy [no-verify]
      
      * add pten tensor test [no-verify]
      
      * replace tensor in custom op & compile success
      
      * refine tensor constructor for unittest
      
      * custom relu jit run success
      
      * fix all custom op unittests
      
      * add inference cmake adapt [no-verify]
      
      * fix failed unittests
      
      * fix windows failed unittests
      
      * try to fix kunlun and inference failed
      
      * fix test_elementwise_api error
      
      * try to fix win compile failed
      
      * fix kunlun fp16 type error
      
      * remove useless haddle error macro
      
      * add custom linear op test
      
      * fix compile failed & add win symbols
      
      * fix non pten kernel cast failed
      
      * add dll decl for api
      
      * polish several deetails
      
      * polish details by review comment
      
      * add dll_decl for register
      1e598f1a