1. 24 6月, 2022 1 次提交
    • Y
      [Phi]Change Copy from Kernel to basic component utils (#43622) · 2739bd73
      YuanRisheng 提交于
      * perfect copy
      
      * deal with conflict
      
      * deal with conflict
      
      * fix compile bugs
      
      * fix unittest bugs
      
      * change code format
      
      * deal with conflict
      
      * modify code by review
      
      * fix ce bugs
      
      * fix ce bugs
      
      * add lo
      
      * perfect code format
      
      * deal with conflicts
      2739bd73
  2. 23 6月, 2022 1 次提交
  3. 16 6月, 2022 1 次提交
  4. 10 6月, 2022 1 次提交
  5. 04 6月, 2022 1 次提交
  6. 25 4月, 2022 1 次提交
  7. 15 4月, 2022 2 次提交
    • C
      [Phi]Reduce kernels into multiply files (#41747) · 1927aff9
      chentianyu03 提交于
      * split reduce_kernel
      
      * rm reduce_kernel in cmake
      
      * split reduce_grad kernels
      
      * fix cmake build error
      
      * format code
      
      * fix standalone_executor_test error
      1927aff9
    • L
      Change cuDNN Conv kernel for auto tune feature (#41313) · 35acfeda
      limingshu 提交于
      * change cudnn helper for auto-tune
      
      * Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm.
      
      * Fix the bug in calculating and printing current step cache hit rate.
      
      * Improve the autotune cache and fix unittest.
      
      * Change the key from AlgorithmType to int64_t.
      
      * Fix unittest for cpu-only env.
      
      * change ChooseAlgoByWorkspace for heuristic mode
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      35acfeda
  8. 10 4月, 2022 1 次提交
  9. 01 4月, 2022 1 次提交
    • C
      [Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe
      Chen Weihang 提交于
      * add cross_entropy_with_softmax phi kernel
      
      * remove softmax_with_cross_entropy kernel
      
      * add softmax_with_cross_entropy grad kernel
      
      * remove original op kernel
      
      * refine cross entropy impl
      
      * fix pointer error
      
      * revert kernel cu change
      
      * fix xpu failed
      
      * fix cinn failed
      
      * fix npu failed
      
      * add forward sig
      
      * add check_nan_inf for pt kernel
      
      * remove repeat cmake item
      
      * fix unittest error
      e6ec98fe
  10. 30 3月, 2022 1 次提交
  11. 28 3月, 2022 2 次提交
    • H
      Move meshgrid to phi (#40994) · ca871957
      hong 提交于
      * move momentum, rmsprop to phi; test=develop
      
      * update
      
      * update
      
      * update
      
      * update
      
      * udpate; test=develop
      
      * fix xpu npu bugs; test=develop
      
      * fix npu bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows compile error; test=develop
      
      * polish code; test=develop
      
      * fix conflict; test=develop
      
      * add meshgrid;
      
      * update
      
      * polish code
      
      * polish code;
      
      * fix bug
      
      * format; remove useless code
      
      * fix npu bug
      
      * fix bug
      ca871957
    • 0
      [Phi] Move warpctc OP to phi (#40023) · cb183762
      0x45f 提交于
      * moving OP
      
      * move forward
      
      * move grad and infershape
      
      * code format
      
      * format code
      
      * fix code
      
      * fix code
      
      * fix CMakerLists.txt
      
      * fix comments
      
      * Refine CMakeLists for rocm ci
      cb183762
  12. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  13. 25 3月, 2022 1 次提交
    • A
      [Phi] Migrate Adam and AdamW into Phi (#40351) · 56cd3407
      Aurelius84 提交于
      * [Phi] Migrate Adam and Adamw into Phi
      
      * fix compile error and unittest ok
      
      * fix compile error and unittest ok
      
      * fix undefined reference to fLI::FLAGS
      
      * test depend on operator
      
      * fix cmake
      
      * fix xpu compile
      
      * fix infrt
      
      * fix amp_type_traits
      
      * fix amp_type_traits
      
      * modify according reviewer
      
      * modify according reviewer
      
      * fix dtype float16
      
      * fix typo
      
      * fix Cmake
      
      * fix code style
      56cd3407
  14. 24 3月, 2022 1 次提交
  15. 23 3月, 2022 2 次提交
  16. 18 3月, 2022 2 次提交
    • Z
      [Phi]Move hierarchical_sigmoid kernel to phi (#40553) · 64a7cbd3
      Zhang Zheng 提交于
      * first commit
      
      * fix compile error
      
      * support std::vector<std::srting>
      
      * fix
      
      * fix op support on GPU by chenweihang
      
      * pass test
      
      * infershape
      
      * add set_dtype
      
      * fix order
      
      * fix
      
      * unify the impl of dt and sr
      
      * fix
      64a7cbd3
    • C
      [Phi] move reduce_grad kernel into phi (#40522) · 70726696
      chentianyu03 提交于
      * move reduce_mean_grad kernel into phi
      
      * move reduce_max/min_grad into phi
      
      * remove raw max/min grad kernel
      
      * fix bug
      
      * fix max/min grad error
      
      * move all reduce_grad kernel into one file
      
      * add prod grad kernel
      
      * add infermeta for prod kernel
      70726696
  17. 17 3月, 2022 1 次提交
  18. 15 3月, 2022 1 次提交
  19. 14 3月, 2022 1 次提交
  20. 10 3月, 2022 2 次提交
    • L
      [phi] move matrix_power op (#40231) · befa78ea
      Liu-xiandong 提交于
      * [phi] move matrix_power op
      
      * MatrixInverse fluid -> phi
      
      * modify the CMake to fix compile bug
      
      * delete useless comment
      
      * mutable memory -> phi Alloc
      
      * modify the include file
      
      * modify the include file
      
      * fix bug in CI compiler
      befa78ea
    • Z
      [PHI] Move segment_pool to phi. (#40099) · a07f19ee
      Zhong Hui 提交于
      * move segment_pool to phi.
      
      * mark summed ids as optional tensor.
      
      * fix as reviews.
      a07f19ee
  21. 09 3月, 2022 1 次提交
  22. 08 3月, 2022 2 次提交
  23. 07 3月, 2022 1 次提交
    • C
      [Phi] Remove storage deps of empty (#40136) · b46e49de
      Chen Weihang 提交于
      * remove storage deps of empty
      
      * remove invalid empty method
      
      * remove error empty using
      
      * fix test_sparse_utils_dev_api
      
      * revert some sparse change
      
      * add memset for conv grad
      
      * resolve conflict
      
      * resolve conflict
      
      * resolve conflict
      b46e49de
  24. 06 3月, 2022 1 次提交
  25. 04 3月, 2022 1 次提交
    • H
      Move conv to pten (#39354) · d50fb43e
      hong 提交于
      * move conv to pten
      
      * move conv to pten; test=develop
      
      * fix bug;
      
      * add conv cudnn impl; test=develop
      
      * update
      
      * update operator; test=develop
      
      * fix bug; test=develop
      
      * move operator and prepared_operator to develop; test=develop
      
      * resolve conflict; test=develop
      
      * remove useless code;test=develop
      
      * add depency ; test=develop
      
      * fix bug;
      
      * add sig.cc ; test=develop
      
      * fix use_op error; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * add conv3d register; test=develop
      
      * fix star gan and conv_nn_grad test failed; test=develop
      
      * add header; test=develop
      
      * manul to recover to develop;
      
      * resolve confilct; test=develop
      
      * remove useless code
      
      * fix bug;
      
      * remove conv2d_cudnn; test=develop
      
      * fix bugs; test=develop
      
      * fix cpu rocm compile bugs; test=develop
      
      * fix blas error; test=develop
      
      * fix compile bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows error; test=develop
      
      * resolve confilct; test=develop
      d50fb43e
  26. 26 2月, 2022 2 次提交
    • F
      Move GumbelSoftmax OP to phi (#39873) · 581b2c64
      From00 提交于
      * Move GumbelSoftmax OP to phi
      
      * platform::errors -> phi::errors; GumbelSoftmaxGradInferMeta -> backend.h/cc
      
      * Use axis util in kernel impl
      
      * Remove namespace platform::errors
      
      * Use GetCPUEngine in Device Context
      581b2c64
    • Z
      Support custom implement for C++ API (#39521) · caea126c
      zyfncg 提交于
      * Support custom implement for C++ API
      
      * rename api_invoke_impl to api_custom_impl
      
      * remove manual_api
      
      * delete mutable_data in copy_to api
      
      * fix problem of copy_to
      
      * add unittest for infer_meta_fn_factory
      
      * fix split cofig in yaml
      
      * fix split cofig in yaml
      
      * modify sum api yaml
      
      * add copy_to wrapped infermeta
      
      * rollback copy impl
      caea126c
  27. 25 2月, 2022 1 次提交
    • C
      [Phi] Support cudnn kernel moving & move softmax kernels (#39547) · 8895379a
      Chen Weihang 提交于
      * support cudnn kernel moving
      
      * polish cmake rules
      
      * add unittest for coverage
      
      * remove orig kernel
      
      * remove softmax cudnn kernel
      
      * fix softmax test failed
      
      * fix npu func error
      
      * resolve conflict
      
      * rename gpu dnn kernels
      
      * fix name rule error
      
      * fix compile error
      
      * update fp16 namespace
      8895379a
  28. 24 2月, 2022 1 次提交
  29. 23 2月, 2022 1 次提交
  30. 22 2月, 2022 1 次提交
  31. 20 2月, 2022 1 次提交
  32. 17 2月, 2022 1 次提交
  33. 30 1月, 2022 1 次提交
    • Z
      Add a Sparse OP : to_sparse_coo (#39264) · 78132fe1
      zhangkaihuo 提交于
      * dense_to_sparse_coo
      
      * optimize unit testing; support rocm
      
      * 1. delete fluid related header file
      2. update the copyright
      
      * fix hipMemcpy
      
      * update dense_to_sparsecoo
      
      * add namespace sparse
      78132fe1