1. 06 9月, 2023 1 次提交
    • C
      [AutoParallel] Generate spmd rule and reshard impl in phi api (#56831) · e9364a38
      Chen Weihang 提交于
      * add spmd and reshard code gen
      
      * add backward reshard code gen
      
      * test matmul forward success
      
      * polish test impl
      
      * add unsafe mutable value
      
      * polish details and add test
      
      * fix unittest time out
      
      * fix typo
      
      * refactor reshard input generate impl
      
      * resolve conflict with develop
      
      * fix compile error
      e9364a38
  2. 05 9月, 2023 1 次提交
    • G
      [Auto Parallel]: Support std::vector<phi::Tensor> input and output for DistTensor. (#56602) · d2fedeac
      Ghost Screaming 提交于
      * [WIP] Support std::vector<phi::Tensor> input and output for DistTensor.
      Concat forward and backward are verified.
      
      * Polish code for new dist tensor implementation.
      
      * Fix bug of DistTensor upgrade. Add support functions for std::vector<Tensor> -> std::vector<Tensor>.
      
      * Add support for DistTensor type of std::vector<phi::Tensor> as input or output of operators.
      Following testcases are passed.
      1. concat: std::vector<phi::Tensor> -> phi::Tensor
      2. unbind: phi::Tensor -> std::vector<phi::Tensor>
      3. broadcast_tensors: std::vector<phi::Tensor> -> std::vector<phi::Tensor>
      
      * Polish code. Remove useless comments.
      
      * Add update_loss_scaling in skip_op_lists.
      
      * Polish code.
      d2fedeac
  3. 04 9月, 2023 1 次提交
  4. 01 9月, 2023 1 次提交
    • C
      [clang-tidy] No.34,36 enable... · 17e4be21
      cyberslack_lee 提交于
      [clang-tidy] No.34,36 enable performance-noexcept-move-constructor,modernize-use-transparent-functors (#56261)
      
      * fix
      
      * fix
      
      * CI
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * CI
      
      * fix
      
      * CI
      17e4be21
  5. 30 8月, 2023 1 次提交
    • H
      Add paddle custom flags support (#56256) · 2ef4ec71
      huangjiyi 提交于
      * update
      
      * repalce gflags header
      
      * replace DEFINE_<type> with PD_DEFINE_<type>
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update cmake
      
      * add :: before some paddle namespace
      
      * fix link error
      
      * fix CI-Py3
      
      * allow commandline parse
      
      * fix SetFlagsFromEnv
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix CI-Coverage-build
      
      * fix CI-Windows-build
      
      * fix CI-Inference
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix inference api test
      
      * fix infer_ut test
      
      * revert infer_ut gflags usage
      
      * update
      
      * fix inference
      
      * remove flags export macro
      
      * revert inference demo_ci gflags usage
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug when turn on WITH_GFLAGS
      
      * turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * update
      
      * update and add unittest
      
      * add unittest
      
      * fix conflict
      
      * rerun ci
      
      * update
      
      * resolve conflict
      2ef4ec71
  6. 28 8月, 2023 1 次提交
    • C
      [AutoParallel] Simplify PADDLE_WITH_DISTRIBUTE marco using (#56361) · 62c78e26
      Chen Weihang 提交于
      * simplify with dist marco
      
      * polish error message format
      
      * fix vtable error
      
      * fix cmake error
      
      * fix winsock redefined error
      
      * fix windows compile error
      
      * fix windows conpile failed
      
      * fix merge error
      
      * fix vec compile error
      
      * add port.h into test_cpu_vec
      
      * fix merge error
      
      * try to fix winsock error
      62c78e26
  7. 25 8月, 2023 1 次提交
  8. 22 8月, 2023 1 次提交
    • C
      [AutoParallel] Polish dist tensor design (#56368) · 8495377a
      Chen Weihang 提交于
      * polish dist teensor design
      
      * adjust constructor
      
      * polish details
      
      * polish details design
      
      * fix compile error
      
      * refactor init tensor impl
      
      * fix reshard test
      
      * polish details
      
      * add unittest for coverage
      8495377a
  9. 21 8月, 2023 1 次提交
  10. 18 8月, 2023 1 次提交
  11. 16 8月, 2023 1 次提交
    • C
      [AutoParallel] Dygraph basic impl for semi auto parallel (#55698) · 7039bef3
      Chen Weihang 提交于
      * add phi forward api gen impl
      
      * add phi backward gen code
      
      * polish api code gen impl
      
      * polish code gen impl
      
      * remove auto_paralel namespace
      
      * add dygraph forward impl
      
      * add for_auto_parallel cond
      
      * fix code gen errors
      
      * add dygraph backward impl
      
      * resolve conflict with develop
      
      * refactor dist api gen impl
      
      * revert origin api gen impl
      
      * replace template for override func
      
      * fix dnnl marco error
      
      * revert third_party change
      
      * add with distributed marco
      
      * Update grad_tensor_holder.cc details
      
      * merge dist tensor constructor
      
      * change test tensor to replicate
      
      * fx typo
      
      * resolve conflict with develop
      
      * fix out dim error
      7039bef3
  12. 15 8月, 2023 1 次提交
  13. 14 8月, 2023 1 次提交
  14. 07 8月, 2023 1 次提交
  15. 04 8月, 2023 1 次提交
  16. 03 8月, 2023 1 次提交
  17. 31 7月, 2023 1 次提交
  18. 24 7月, 2023 1 次提交
  19. 19 7月, 2023 1 次提交
    • H
      [NewIR]Add feed with place op (#55343) · 8e9e0659
      hong 提交于
      * add feed with place op
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * add enable_static
      
      * remove useless test case
      
      * register int and doubel type
      
      * fix bug
      8e9e0659
  20. 17 7月, 2023 1 次提交
  21. 29 6月, 2023 1 次提交
    • H
      Refactor op info parser (#54859) · f18d538b
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * update
      
      * update
      
      * fix bug
      
      * split file
      
      * fix conflict
      
      * polish code and fix conflict
      
      * support place transformer
      
      * finish bug
      
      * add gpu flags
      
      * fix with cuda macro
      
      * add fetch kernel
      
      * support fetch var in new ir
      
      * fix bug
      
      * polish code
      
      * change array equal to np.testing
      
      * support feed in new ir
      
      * update
      
      * fix bug
      
      * try to hack combine op
      
      * add scope guard
      
      * revert atan2 op
      
      * add scope guard
      
      * update
      
      * polish code
      
      * update
      
      * refactor build kernel context
      
      * fix unitest bug
      
      * polish code
      
      * use original order
      
      * remove useless code
      
      * polish code
      
      * fix bug
      f18d538b
  22. 27 6月, 2023 1 次提交
    • H
      New ir support data transfer (#54763) · b58869fa
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * update
      
      * update
      
      * fix bug
      
      * split file
      
      * fix conflict
      
      * polish code and fix conflict
      
      * support place transformer
      
      * finish bug
      
      * add gpu flags
      
      * fix with cuda macro
      
      * update
      
      * add scope guard
      
      * polish code
      b58869fa
  23. 13 6月, 2023 1 次提交
  24. 26 5月, 2023 1 次提交
    • Y
      [PHI Decoupling]Create PHI shared lib (#53735) · da50a009
      YuanRisheng 提交于
      * create phi so
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * add file
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * perfect so
      
      * fix py3 bugs
      
      * delete all static target in phi
      
      * fix windows bugs
      
      * fix py3 bugs
      
      * fix ci bugs
      
      * fix windows bugs
      
      * fix bugs: gflags can't be linked by dynamic and static lib
      
      * fix bugs that can not load 3rd party
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix conflict
      
      * fix xpu bugs
      
      * fix mac compile bugs
      
      * fix psgpu bugs
      
      * fix inference failed
      
      * deal with conflict
      
      * fix LIBRARY_PATH bug
      
      * fix windows bugs
      
      * fix onednn error
      
      * fix windows compile bugs
      
      * fix windows compile bugs
      
      * fix test_cuda_graph_static_mode_error aborted
      
      * fix windows bugs
      
      * fix mac-python3 error
      
      * fix hip compile bugs
      
      * change mode to static
      
      * change to static mode
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix bugs
      
      * add static flag
      
      * add PADDLE_API
      
      * change position of PADDLE_API
      
      * fix windows bugs
      
      * change mode to dynamic lib
      
      * fix windows static bugs
      
      * deal with conflict
      
      * fix windows unit bug
      
      * fix coverage
      
      * deal with conflict
      
      * fix windows-inference
      
      * fix py3 bugs
      
      * fix bugs when compile type_info
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix windows openblas
      
      * fix xpu bugs
      
      * fix enforce_test in windows
      
      * update code according comment
      
      * fix windows cmake bug
      
      * fix windows bugs
      
      * fix windows bugs
      
      * delete cinn unittest
      
      * fix cinn bugs
      
      ---------
      Co-authored-by: HappyHeavyRain's avatarlzydev <1528794076@qq.com>
      da50a009
  25. 23 5月, 2023 1 次提交
  26. 15 5月, 2023 1 次提交
  27. 24 4月, 2023 1 次提交
  28. 20 4月, 2023 1 次提交
  29. 13 4月, 2023 1 次提交
    • H
      [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26
      HongyuJia 提交于
      * [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h
      
      * Add logging.h for profiler.cc
      
      * Add logging.h for gloo_utils.h
      
      * Add logging.h for addmm_kernel_impl.h
      
      * Add logging.h for addmm_grad_kernel_impl.h
      
      * Add logging.h for p_send_kernel.cu
      
      * Add logging.h for determinant_grad_kernel_impl.h
      
      * Add logging.h for p_recv_kernel.cu
      
      * Add logging.h for elementwise_grad_base.h
      
      * Add logging.h for transfer_layout_kernel.cc
      
      * Add logging.h for eigvals_kernel.cc and index_select_impl.h
      
      * Add logging.h for all files in kernel directory
      
      * Add logging.h for xpu_info.cc
      
      * Add logging.h for xpu
      5664ea26
  30. 11 4月, 2023 1 次提交
  31. 10 4月, 2023 1 次提交
    • H
      [Opt Performance] Optimize custom operator performance (#52597) · 01247e33
      HongyuJia 提交于
      * [Opt Performance] Optimize custom operator performance, reconstruct python API auto-gen, add cache and use const inference
      
      * opt AutoGradMeta implementation
      
      * remove profiler codes
      
      * fix unit test
      
      * change year, 2021->2023
      
      * fix int64_t parse bug
      01247e33
  32. 03 4月, 2023 1 次提交
  33. 31 3月, 2023 1 次提交
  34. 27 3月, 2023 2 次提交
  35. 23 3月, 2023 1 次提交
  36. 21 3月, 2023 1 次提交
  37. 20 3月, 2023 3 次提交