1. 16 5月, 2023 1 次提交
  2. 13 4月, 2023 1 次提交
    • H
      [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26
      HongyuJia 提交于
      * [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h
      
      * Add logging.h for profiler.cc
      
      * Add logging.h for gloo_utils.h
      
      * Add logging.h for addmm_kernel_impl.h
      
      * Add logging.h for addmm_grad_kernel_impl.h
      
      * Add logging.h for p_send_kernel.cu
      
      * Add logging.h for determinant_grad_kernel_impl.h
      
      * Add logging.h for p_recv_kernel.cu
      
      * Add logging.h for elementwise_grad_base.h
      
      * Add logging.h for transfer_layout_kernel.cc
      
      * Add logging.h for eigvals_kernel.cc and index_select_impl.h
      
      * Add logging.h for all files in kernel directory
      
      * Add logging.h for xpu_info.cc
      
      * Add logging.h for xpu
      5664ea26
  3. 21 3月, 2023 1 次提交
    • iSerendipity's avatar
      [PHI decoupling] Move DataType* from paddle:experimental to phi namespace (#51716) · 4638a62e
      iSerendipity 提交于
      * move DataType from paddle::experimental to phi
      
      * convert namespace
      
      * convert namespace
      
      * convert namespace
      
      * clarify namespace
      
      * convert more datatype
      
      * Revert "convert more datatype"
      
      This reverts commit 083b462959e6a22d4d8767707b628b95b396642e.
      
      * convert more in auto_code_generator
      
      * fix conflicts for XPU
      
      * fix namespace conflicts
      
      * fix errors
      
      * Revert "fix errors"
      
      This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876.
      
      * fix errors
      
      * fix formatting
      4638a62e
  4. 21 2月, 2023 1 次提交
  5. 28 12月, 2022 1 次提交
    • L
      [new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722
      Leo Chen 提交于
      * add skip run
      
      * alloc minimum memory
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * fix cases when tensor is initialized or empty
      
      * alloc empty output for place info
      
      * add test
      
      * increase timeout
      
      * format code
      
      * skip cpu
      
      * add cudnn_deterministic
      
      * fit for hostAlloc
      
      * follow comments
      
      * change check_size to fake_alloc
      63d2d722
  6. 02 12月, 2022 1 次提交
  7. 29 11月, 2022 1 次提交
  8. 24 11月, 2022 1 次提交
    • H
      [Phi Support CuDNN] Support ALL CuDNN (#47865) · 1623f1b4
      HongyuJia 提交于
      * support default use_gpudnn=True
      
      * fully support cudnn in phi
      
      * add header file
      
      * add white_list, verify accuracy
      
      * phi support all cudnn
      
      * opt affine_grad
      
      * try different arches of pretrained_model
      
      * try different arches of pretrained_model
      
      * add debug string
      
      * debug eager_method
      
      * add debug string, pass all local ctest
      
      * polish all debug code
      
      * delete use_cudnn relevant code autogen
      
      * fix depthwise_conv2d
      
      * Share all other members of Tensor except use_cudnn
      
      * polish codes according to review opinion
      
      * polish codes according to review opinion, fix bug
      
      * polish codes according to review opinion, opt performance
      
      * polish codes according to review opinion, fix pooling.py
      1623f1b4
  9. 16 11月, 2022 1 次提交
  10. 09 11月, 2022 2 次提交
  11. 04 11月, 2022 1 次提交
  12. 02 11月, 2022 1 次提交
  13. 26 10月, 2022 1 次提交
    • H
      [MKLDNN] Delete mkldnn hard code of prior_box (#47068) · d78dd7ea
      HongyuJia 提交于
      * remove prior_box mkldnn hard code
      
      * add header file
      
      * simplify PD_VISIT_TYPE
      
      * decouple dependency between prior_box and density_prior_box
      
      * fix pragma omp parallel error
      
      * bypass #pragma omp_parallel_for error
      
      * polish code
      
      * remove visit_type headerfile
      
      * polish codestyle
      
      * polish codestyle
      
      * try fix CI error
      
      * add testcase, datatype=float64
      
      * reset test_prior_box testcase
      
      * add datacheck to DenseTensor
      
      * update template name
      
      * call prior_box with macro expand
      d78dd7ea
  14. 27 4月, 2022 1 次提交
    • Z
      Optimize performance of dygraph (v4) (#42196) · 37e2f027
      zyfncg 提交于
      * optimize performance of dygraph
      
      * optimize performance of dygraph and elementwise_add
      
      * optimize the trace op
      
      * fix bug
      
      * fix bug
      
      * fix unittest bug
      
      * fix code format
      37e2f027
  15. 19 4月, 2022 1 次提交
  16. 13 4月, 2022 1 次提交
  17. 30 3月, 2022 1 次提交
    • P
      support view strategy in dygraph eager_final state (#40891) · 495ca4aa
      pangyoki 提交于
      * support view strategy in eager_final state
      
      * perfect reshape kernel
      
      * fix bugs of sig
      
      * add unittest for reshape_sig
      
      * fix bugs when run converage
      
      * fix inplace bug in final_state eager_gen
      
      * fix python_c_gen
      
      * support view strategy for final state
      
      * fix order of out and xshape in reshape
      
      * fix Coverage_CI unittest timeout error
      
      * support reshape view
      
      * fix reshape_sig
      
      * fix yml and api_base
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      495ca4aa
  18. 21 3月, 2022 1 次提交
    • H
      Add yaml config part0 (#40020) · cc853e95
      hong 提交于
      * add add yaml
      
      * add elementwise add yaml; test=develop
      
      * add norm
      
      * update
      
      * add some yaml config; test=develop
      
      * fix bug; test=develop
      
      * fix compare error; test=develop
      
      * revert erger_gen.py
      
      * update; test=deveop
      
      * remove usless code; test=deveop
      
      * fix bug; test=develop
      
      * fix test error; test=develop
      
      * remove int_type; test=develop
      
      * fix type error; test=develop
      
      * format; test=develop
      
      * remove type register; test=develop
      
      * polish code; test=develop
      
      * fix ci error; test=develop
      cc853e95
  19. 28 2月, 2022 1 次提交
  20. 22 2月, 2022 1 次提交
    • X
      change Vector to std::vector and provide MixVector class as a helper … (#39559) · 728c0624
      xiongkun 提交于
      * change Vector to std::vector and provide MixVector class as a helper wrapper class
      
      * solve the multi-gpu hang problem
      
      * remove the duplicate template instantialize
      
      * Copy vector to cpu
      
      * add CopyToCPU
      
      * xxx
      
      * final version: fix the problem of all reduce
      
      * remove mixvector dependence
      
      * fix
      
      * merge
      
      * fix code
      
      * fix by CI
      728c0624
  21. 20 2月, 2022 1 次提交
  22. 17 2月, 2022 1 次提交
  23. 29 1月, 2022 1 次提交
    • C
      [PTen] Tidy pten core headers (#39188) · dd990981
      Chen Weihang 提交于
      * open header for custom kernel
      
      * add core utils
      
      * tidy core code
      
      * tify header
      
      * tidy include
      
      * tidy namespace
      
      * resolve conflit
      
      * fix unittest and coverage
      
      * remove platform using
      
      * resolve conflict
      
      * resolve conflict
      
      * fix digamma namespace error
      
      * fix xpu full kernel error
      
      * fix xpu full kernel error
      
      * polish details
      
      * add place for lib storage
      dd990981
  24. 27 1月, 2022 1 次提交
  25. 26 1月, 2022 1 次提交
  26. 25 1月, 2022 2 次提交
  27. 24 1月, 2022 1 次提交
    • [Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7
      石晓伟 提交于
      * updates callers, test=develop
      
      * updates tensor, test=develop
      
      * fixes errors, test=develop
      
      * remove some dtypes, test=develop
      
      * fix errors in the base storage modification, test=develop
      
      * fixes a bug, test=develop
      
      * fixes the bugs in push the whole, test=develop
      
      * updates, test=develop
      
      * update
      
      * update, test=develop
      
      * fixes the mac-py3 CI, test=develop
      
      * remove the storage impl, test=develop
      
      * updates some codes, test=develop
      
      * update, test=develop
      
      * updates pten allocation, test=develop
      a56e16a7
  28. 20 1月, 2022 2 次提交
  29. 18 1月, 2022 1 次提交
  30. 17 1月, 2022 1 次提交
  31. 15 1月, 2022 1 次提交
  32. 14 1月, 2022 1 次提交
  33. 11 1月, 2022 1 次提交
    • Z
      【PTen】Add dot and matmul grad kernel in pten (#38713) · be817719
      zyfncg 提交于
      * refactor matmul directory in pten
      
      * fix merge conflict
      
      * add dot_grad kernel
      
      * add dot_grad kernel in pten
      
      * add matmul_grad kernel
      
      * update the code
      
      * delete useless code in fluid
      
      * fix some bug of running matmul grad kernel
      
      * fix merge conflict
      
      * refactor some code
      
      * refactor code
      be817719
  34. 10 1月, 2022 1 次提交
    • Z
      [Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea
      Zhanlue Yang 提交于
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor
      
      * Modified framework::Tensor to inherit from DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      
      * Rearranged cfunction calls from tensor.data<void>() to tensor.data()
      
      * Fixed CI issues
      
      * Fixed lite issues
      
      * Fixed data() interface issues,test=allcases
      
      * Resolved IsInitialized() issues
      
      * Fixed ResetHolder() issues
      
      * Fixed MKLDNN & Storage issues
      
      * Resolved ShareBufferWith() issues
      
      * Fixed LoD issues
      5c73a6ea
  35. 04 1月, 2022 1 次提交
    • Z
      [Unify Tensors PR #3]Port framework::Tensor members & interfaces to... · dfdc9960
      Zhanlue Yang 提交于
      [Unify Tensors PR #3]Port framework::Tensor members & interfaces to pten::DenseTensor, test=allcases (#38473)
      
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      dfdc9960
  36. 30 12月, 2021 1 次提交
  37. 22 12月, 2021 1 次提交