1. 04 9月, 2023 3 次提交
  2. 02 9月, 2023 1 次提交
  3. 01 9月, 2023 1 次提交
  4. 31 8月, 2023 2 次提交
    • L
      use macro instead of functor (#56726) · 5425ad7f
      LiYuRio 提交于
      5425ad7f
    • C
      [AutoParallel] Adapt static spmd rules for dynamic graph (#56367) · 54fcd9a9
      Chen Weihang 提交于
      * move matmul spmd rules into phi
      
      * add basic infer spmd utils
      
      * addspmd factory
      
      * fix compile error
      
      * add unittest
      
      * refine infer spmd test and utils
      
      * debug infer spmd test
      
      * adapt python test
      
      * poish details
      
      * change to vector attr arg
      
      * revert needless change
      
      * update matmul spmd rule test
      
      * remove original rule
      
      * polish details
      
      * fix marco error
      
      * add comment
      
      * pass backward test
      
      * fix compile error
      
      * add cmake rule for spmd_rules_test
      
      * add dist meta tensor
      
      * update pybind impl
      
      * add marco for rules
      54fcd9a9
  5. 30 8月, 2023 4 次提交
    • H
      Add paddle custom flags support (#56256) · 2ef4ec71
      huangjiyi 提交于
      * update
      
      * repalce gflags header
      
      * replace DEFINE_<type> with PD_DEFINE_<type>
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update cmake
      
      * add :: before some paddle namespace
      
      * fix link error
      
      * fix CI-Py3
      
      * allow commandline parse
      
      * fix SetFlagsFromEnv
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix CI-Coverage-build
      
      * fix CI-Windows-build
      
      * fix CI-Inference
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix inference api test
      
      * fix infer_ut test
      
      * revert infer_ut gflags usage
      
      * update
      
      * fix inference
      
      * remove flags export macro
      
      * revert inference demo_ci gflags usage
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug when turn on WITH_GFLAGS
      
      * turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * update
      
      * update and add unittest
      
      * add unittest
      
      * fix conflict
      
      * rerun ci
      
      * update
      
      * resolve conflict
      2ef4ec71
    • N
    • W
      [NewIR]Gen ops_api.cc for static mode (#56653) · 59b2ad39
      WangZhen 提交于
      59b2ad39
    • G
      [Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5
      Ghost Screaming 提交于
      * for verify
      
      fluid operator support new comm library
      
      * u
      
      * u
      
      * u
      
      * compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.
      
      * Remove useless comments in process_group.py
      
      * Polish code style.
      
      * Fix some problems.
      
      * Remove use fluid api in phi comm_context_manager.
      
      * Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.
      
      * Fix bug of HIP architecture.
      
      * Fix some problems.
      1. remove useless loggings.
      2. Fix conditional compilation for HIP.
      3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
      then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
      paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
      set, as a result, NCCLCommContext's dev_ctx is not initialized.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code.
      
      * Revert compatiable upgrade for communication operators. Their upgrades
      will be submitted in another PR.
      
      * Remove StaticTCPStore.
      
      * Remove useless modification.
      
      * Remove useless set_cuda_device_id.
      
      * Polish code.
      
      * Remove fluid header files in phi files.
      
      * Remove useless comments.
      
      * Fix problems of hip arch.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code style.
      
      ---------
      Co-authored-by: TaoTao Li's avatarhitywt <yuwentao126@126.com>
      ade51aa5
  6. 29 8月, 2023 3 次提交
  7. 28 8月, 2023 3 次提交
  8. 25 8月, 2023 7 次提交
  9. 24 8月, 2023 3 次提交
  10. 23 8月, 2023 1 次提交
    • L
      Integrate TRT qdq layers (#54803) · ae84c603
      Leo Chen 提交于
      * Integrate quantize/dequantize linear and add config for explicit quantization
      
      * Fix the build error
      
      * Add macro for TRT version < 8.0
      
      * Remove qdq UT from windows
      
      * Fix UT failure
      
      * Check TRT version in qdq UT
      
      * Test tensorrt_explicit_enabled API
      
      * Disable QDQ UT if TRT version < 8.5
      
      * Add quantization postfix into public APIs
      
      * Apply code formatter
      
      * Fix the UT failure for explicit quantization
      
      * Apply code formatter on modified files
      
      * Correct the year in copyright
      ae84c603
  11. 22 8月, 2023 1 次提交
    • C
      [AutoParallel] Polish dist tensor design (#56368) · 8495377a
      Chen Weihang 提交于
      * polish dist teensor design
      
      * adjust constructor
      
      * polish details
      
      * polish details design
      
      * fix compile error
      
      * refactor init tensor impl
      
      * fix reshard test
      
      * polish details
      
      * add unittest for coverage
      8495377a
  12. 21 8月, 2023 3 次提交
  13. 18 8月, 2023 2 次提交
  14. 17 8月, 2023 4 次提交
  15. 16 8月, 2023 2 次提交
    • N
      2951521a
    • C
      [AutoParallel] Dygraph basic impl for semi auto parallel (#55698) · 7039bef3
      Chen Weihang 提交于
      * add phi forward api gen impl
      
      * add phi backward gen code
      
      * polish api code gen impl
      
      * polish code gen impl
      
      * remove auto_paralel namespace
      
      * add dygraph forward impl
      
      * add for_auto_parallel cond
      
      * fix code gen errors
      
      * add dygraph backward impl
      
      * resolve conflict with develop
      
      * refactor dist api gen impl
      
      * revert origin api gen impl
      
      * replace template for override func
      
      * fix dnnl marco error
      
      * revert third_party change
      
      * add with distributed marco
      
      * Update grad_tensor_holder.cc details
      
      * merge dist tensor constructor
      
      * change test tensor to replicate
      
      * fx typo
      
      * resolve conflict with develop
      
      * fix out dim error
      7039bef3