1. 18 3月, 2022 1 次提交
    • C
      [Phi] move reduce_grad kernel into phi (#40522) · 70726696
      chentianyu03 提交于
      * move reduce_mean_grad kernel into phi
      
      * move reduce_max/min_grad into phi
      
      * remove raw max/min grad kernel
      
      * fix bug
      
      * fix max/min grad error
      
      * move all reduce_grad kernel into one file
      
      * add prod grad kernel
      
      * add infermeta for prod kernel
      70726696
  2. 17 3月, 2022 1 次提交
  3. 15 3月, 2022 1 次提交
  4. 14 3月, 2022 1 次提交
  5. 10 3月, 2022 2 次提交
    • L
      [phi] move matrix_power op (#40231) · befa78ea
      Liu-xiandong 提交于
      * [phi] move matrix_power op
      
      * MatrixInverse fluid -> phi
      
      * modify the CMake to fix compile bug
      
      * delete useless comment
      
      * mutable memory -> phi Alloc
      
      * modify the include file
      
      * modify the include file
      
      * fix bug in CI compiler
      befa78ea
    • Z
      [PHI] Move segment_pool to phi. (#40099) · a07f19ee
      Zhong Hui 提交于
      * move segment_pool to phi.
      
      * mark summed ids as optional tensor.
      
      * fix as reviews.
      a07f19ee
  6. 09 3月, 2022 1 次提交
  7. 08 3月, 2022 2 次提交
  8. 07 3月, 2022 1 次提交
    • C
      [Phi] Remove storage deps of empty (#40136) · b46e49de
      Chen Weihang 提交于
      * remove storage deps of empty
      
      * remove invalid empty method
      
      * remove error empty using
      
      * fix test_sparse_utils_dev_api
      
      * revert some sparse change
      
      * add memset for conv grad
      
      * resolve conflict
      
      * resolve conflict
      
      * resolve conflict
      b46e49de
  9. 06 3月, 2022 1 次提交
  10. 04 3月, 2022 1 次提交
    • H
      Move conv to pten (#39354) · d50fb43e
      hong 提交于
      * move conv to pten
      
      * move conv to pten; test=develop
      
      * fix bug;
      
      * add conv cudnn impl; test=develop
      
      * update
      
      * update operator; test=develop
      
      * fix bug; test=develop
      
      * move operator and prepared_operator to develop; test=develop
      
      * resolve conflict; test=develop
      
      * remove useless code;test=develop
      
      * add depency ; test=develop
      
      * fix bug;
      
      * add sig.cc ; test=develop
      
      * fix use_op error; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * add conv3d register; test=develop
      
      * fix star gan and conv_nn_grad test failed; test=develop
      
      * add header; test=develop
      
      * manul to recover to develop;
      
      * resolve confilct; test=develop
      
      * remove useless code
      
      * fix bug;
      
      * remove conv2d_cudnn; test=develop
      
      * fix bugs; test=develop
      
      * fix cpu rocm compile bugs; test=develop
      
      * fix blas error; test=develop
      
      * fix compile bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows error; test=develop
      
      * resolve confilct; test=develop
      d50fb43e
  11. 26 2月, 2022 2 次提交
    • F
      Move GumbelSoftmax OP to phi (#39873) · 581b2c64
      From00 提交于
      * Move GumbelSoftmax OP to phi
      
      * platform::errors -> phi::errors; GumbelSoftmaxGradInferMeta -> backend.h/cc
      
      * Use axis util in kernel impl
      
      * Remove namespace platform::errors
      
      * Use GetCPUEngine in Device Context
      581b2c64
    • Z
      Support custom implement for C++ API (#39521) · caea126c
      zyfncg 提交于
      * Support custom implement for C++ API
      
      * rename api_invoke_impl to api_custom_impl
      
      * remove manual_api
      
      * delete mutable_data in copy_to api
      
      * fix problem of copy_to
      
      * add unittest for infer_meta_fn_factory
      
      * fix split cofig in yaml
      
      * fix split cofig in yaml
      
      * modify sum api yaml
      
      * add copy_to wrapped infermeta
      
      * rollback copy impl
      caea126c
  12. 25 2月, 2022 1 次提交
    • C
      [Phi] Support cudnn kernel moving & move softmax kernels (#39547) · 8895379a
      Chen Weihang 提交于
      * support cudnn kernel moving
      
      * polish cmake rules
      
      * add unittest for coverage
      
      * remove orig kernel
      
      * remove softmax cudnn kernel
      
      * fix softmax test failed
      
      * fix npu func error
      
      * resolve conflict
      
      * rename gpu dnn kernels
      
      * fix name rule error
      
      * fix compile error
      
      * update fp16 namespace
      8895379a
  13. 24 2月, 2022 1 次提交
  14. 23 2月, 2022 1 次提交
  15. 22 2月, 2022 1 次提交
  16. 20 2月, 2022 1 次提交
  17. 17 2月, 2022 1 次提交
  18. 30 1月, 2022 1 次提交
    • Z
      Add a Sparse OP : to_sparse_coo (#39264) · 78132fe1
      zhangkaihuo 提交于
      * dense_to_sparse_coo
      
      * optimize unit testing; support rocm
      
      * 1. delete fluid related header file
      2. update the copyright
      
      * fix hipMemcpy
      
      * update dense_to_sparsecoo
      
      * add namespace sparse
      78132fe1
  19. 26 1月, 2022 1 次提交
    • L
      [pten] remove deprecated fluid op kernel for pten (#38842) · 3ab9aef1
      Leo Chen 提交于
      * update cmake file to remove fluid kernel
      
      * add pten declaration.h to where pybind.h used
      
      * fix sync_bn and tensorrt_engine
      
      * refine detection_library
      
      * fix interpreter_core
      
      * support eager legacy
      
      * fit eager legacy for pten
      
      * fall back to cpu if not found kernel
      
      * fix compile problem
      
      * fix compile problem
      
      * refine fallback logic
      
      * fit operator.run()
      
      * fix xpu compile
      
      * fit for new_exec
      
      * add REGISTER_OP_WITHOUT_GRADIENT
      
      * un-cache pt_kernel_context
      
      * fix compile
      
      * fix cudnn
      
      * fix compiling with on_infer
      
      * fix mkldnn
      
      * fix isfinite_v2
      
      * fix xpu problem
      
      * fix op_device
      
      * refine fallback for xpu
      
      * fix xpu compile
      
      * merge develop
      
      * refine code format
      
      * fix compile
      
      * fix compile
      
      * add data_transfer
      
      * fix PreparePtenData
      
      * fix cpu context
      
      * merge develop
      
      * fix compile
      
      * fix error device context
      
      * fix xpu
      
      * fix dev_ctx
      3ab9aef1
  20. 24 1月, 2022 1 次提交
  21. 22 1月, 2022 2 次提交
  22. 21 1月, 2022 1 次提交
  23. 12 1月, 2022 1 次提交
  24. 10 1月, 2022 1 次提交
  25. 04 1月, 2022 1 次提交
  26. 31 12月, 2021 1 次提交
  27. 29 12月, 2021 1 次提交
  28. 27 12月, 2021 2 次提交
  29. 26 12月, 2021 1 次提交
    • C
      [PTen] Move copy kernel impl (#38421) · 73819658
      Chen Weihang 提交于
      * add register general kernel marco
      
      * move copy kernel impl
      
      * revert needless change
      
      * polish details
      
      * fix xpu compil faild
      
      * fix xpu compile failed
      
      * polish format
      73819658
  30. 22 12月, 2021 2 次提交
  31. 21 12月, 2021 2 次提交
  32. 09 12月, 2021 1 次提交
  33. 29 11月, 2021 1 次提交
    • C
      [Pten] Add reduce mean kernel, replace with mean API (#37559) · f9e9fd19
      chentianyu03 提交于
      * add pten reduce kernel
      
      * add reduce_sum kernel
      
      * update attribute args and order
      
      * make out dtype undefined
      
      * fix empty input error
      
      * merge develop branch
      
      * rename sum as reduce function
      
      * rename sum as reduce function
      
      * fix reducekernelImpl args error
      
      * add reduce cuda kernel
      
      * modify dims type to const &
      
      * remove unsed log
      
      * fix reduce_all out eigen function error
      
      * remove unused codes
      
      * add the missing sum api define and testcase
      
      * merge develop branch
      
      * fix sum test axis value error
      
      * replace pten mean kernel with reduce_mean
      
      * revcover meam cuda to original implement
      f9e9fd19