1. 05 9月, 2022 1 次提交
    • H
      [phi] Migrate memcpy kernel to PHI, hold NPU op (#45622) · 2f19a364
      HongyuJia 提交于
      * migrate memcpy to phi
      
      * fix typo error
      
      * fix typo error
      
      * fix  bug and testcase
      
      * fix typo, uniform_random_kernel.cc header
      
      * fix Alloc pinned bug
      
      * change GPUContext::GetPinnedPlace
      
      * add GetPinnedPlace function
      
      * add GetPinnedPlace function
      
      * restore default throw error
      
      * fix Unimplemented error
      
      * skip StandaloneExecutor testcase
      
      * delete memcpy_sig
      2f19a364
  2. 31 8月, 2022 1 次提交
    • C
      Fix split api bug (#45396) · 4a25b60d
      Charles-hit 提交于
      * fix split bug
      
      * solve function redefine
      
      * fix fluid.layers.split and add unit test
      
      * delete splitInferMeta register in unary.cc
      
      * modify test_split_op GPU unit test
      
      * modify test_split_op GPU unit test place param
      
      * refactor split op and fix infershape bugs
      
      * add () in && and ||
      
      * fix split C++ unit test
      
      * fix split infershape
      4a25b60d
  3. 24 8月, 2022 1 次提交
    • L
      make tensor_util contains no cuda code (#45256) · 78916a7a
      Leo Chen 提交于
      * make tensor_util contains no cuda code
      
      * refine isfinite
      
      * revert ut
      
      * move isfinite function to its op
      
      * fix test
      
      * fix compile
      
      * std::isnan is not defined for int type on windows
      
      * fix windows compile
      
      * fix fp16
      
      * fix rocm compile
      
      * revert gradient node
      78916a7a
  4. 10 8月, 2022 1 次提交
  5. 26 7月, 2022 1 次提交
  6. 19 7月, 2022 1 次提交
  7. 14 7月, 2022 1 次提交
    • Y
      [Phi]Improve the mechanism for mkldnn kernel in PHI (#43941) · e9b4d0be
      YuanRisheng 提交于
      * adapt mkldnn kernel in PHI
      
      * fix ci compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * delete comment
      
      * fix compile bugs in windows-inference
      
      * delete code for converage
      
      * modify code by review
      
      * modify code by review
      
      * add todo
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix unittest bugsx
      e9b4d0be
  8. 01 7月, 2022 1 次提交
  9. 28 6月, 2022 1 次提交
  10. 27 6月, 2022 1 次提交
    • S
      make inference_api_test compile with dynamic linking library (#41944) · f8b9073f
      Sing_chan 提交于
      * make inference_api_test use import_lib + dll to smaller its size;test=document_fix;test=windows_ci_inference
      
      * make place.h not export symbols; change the order of inference test dependencies
      
      * cant delete analysis pass ..in inference_analysis_test or build test_analyzer.exe will failed; temporarily comment test need paddle_inference_c
      
      * add onnx path; cp onnxruntime.dll to tests/api
      
      * make c_ut depend on paddle_inference_c_shared rather than paddle_inference_c
      
      * make c_ut depend .dll in windows and .lib in linux;path add capi_exp
      
      * init context pool by analysis predictor
      
      * make windows and linux both use shared; compile shared even on_infer off
      
      * move case:profile after case:fuse_statis to avoid context pool null problem
      
      * make c_ut depend .dll in windows and .lib in linux;path add capi_exp
      f8b9073f
  11. 24 6月, 2022 1 次提交
  12. 23 6月, 2022 1 次提交
  13. 05 6月, 2022 1 次提交
  14. 04 6月, 2022 1 次提交
  15. 18 5月, 2022 1 次提交
  16. 16 5月, 2022 1 次提交
    • Z
      [PHI] Support construct IntArray by using Non-CPU Tensosr (#41764) · 8eecd852
      zyfncg 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * int_array supports constructed by gpu tensor
      
      * add some test
      
      * polish code
      
      * adjust full api
      
      * add unittest
      
      * add unittest
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      8eecd852
  17. 19 4月, 2022 1 次提交
  18. 18 4月, 2022 1 次提交
  19. 17 4月, 2022 1 次提交
  20. 14 4月, 2022 2 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) · 54ccc308
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      54ccc308
    • J
      Fix to #38693 (minimal UT) (#41026) · d0f3296b
      Jacek Czaja 提交于
      * Add UT
      
      - Added missed data_layout
      
      - Added missing conversions
      
      - NDHWC added
      
      - NDHWC support in data_transform
      
      - another fix
      
      - condddate change
      
      - fix
      
      u- fix
      
      - fix
      
      - fix
      
      - fix
      
      - fix
      
      - fix to hack
      
      - compilation fix
      
      - fix to automatic merge
      
      * - reduced UT
      
      * - fix
      
      * - lint
      
      * - fix to lint
      d0f3296b
  21. 13 4月, 2022 3 次提交
  22. 12 4月, 2022 3 次提交
  23. 08 4月, 2022 1 次提交
  24. 31 3月, 2022 1 次提交
  25. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  26. 23 3月, 2022 1 次提交
    • F
      [NPU] add npu support for conv3d and conv3d_grad (#38480) · ff568afa
      furnace 提交于
      * [NPU] add npu support for conv3d and conv3d_grad
      
      * [NPU] delete failed unittests due to Ascend not support
      
      * [NPU] delete debug codes
      
      * [NPU] optimize codes, notest
      
      * [NPU] remove const_cast
      
      * [NPU] optimize for remove const_cast
      
      * [NPU] fix written errors
      ff568afa
  27. 22 3月, 2022 1 次提交
  28. 21 3月, 2022 1 次提交
  29. 17 3月, 2022 1 次提交
  30. 15 3月, 2022 1 次提交
    • X
      [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass... · 64223620
      xiongkun 提交于
      [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770)
      
      * tranfer and pass the lgamma unittest
      
      * merge and pass the test
      
      * transfer kldiv_loss and kldiv_loss_grad; pass the unitest
      
      * trafer the isclose and cumprod kernel
      
      * change PT_REGISTER -> PD_REGISTER
      
      * fix by code review
      
      * fix by code review
      
      * fix
      
      * remove enforce include dependence from scalar
      
      * fix
      
      * fix by code review
      
      * fix by code review
      64223620
  31. 07 3月, 2022 1 次提交
  32. 06 3月, 2022 1 次提交
  33. 02 3月, 2022 2 次提交
  34. 01 3月, 2022 1 次提交