1. 30 8月, 2023 4 次提交
    • H
      Add paddle custom flags support (#56256) · 2ef4ec71
      huangjiyi 提交于
      * update
      
      * repalce gflags header
      
      * replace DEFINE_<type> with PD_DEFINE_<type>
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update cmake
      
      * add :: before some paddle namespace
      
      * fix link error
      
      * fix CI-Py3
      
      * allow commandline parse
      
      * fix SetFlagsFromEnv
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix CI-Coverage-build
      
      * fix CI-Windows-build
      
      * fix CI-Inference
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix inference api test
      
      * fix infer_ut test
      
      * revert infer_ut gflags usage
      
      * update
      
      * fix inference
      
      * remove flags export macro
      
      * revert inference demo_ci gflags usage
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug when turn on WITH_GFLAGS
      
      * turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * update
      
      * update and add unittest
      
      * add unittest
      
      * fix conflict
      
      * rerun ci
      
      * update
      
      * resolve conflict
      2ef4ec71
    • N
    • W
      [NewIR]Gen ops_api.cc for static mode (#56653) · 59b2ad39
      WangZhen 提交于
      59b2ad39
    • G
      [Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5
      Ghost Screaming 提交于
      * for verify
      
      fluid operator support new comm library
      
      * u
      
      * u
      
      * u
      
      * compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.
      
      * Remove useless comments in process_group.py
      
      * Polish code style.
      
      * Fix some problems.
      
      * Remove use fluid api in phi comm_context_manager.
      
      * Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.
      
      * Fix bug of HIP architecture.
      
      * Fix some problems.
      1. remove useless loggings.
      2. Fix conditional compilation for HIP.
      3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
      then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
      paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
      set, as a result, NCCLCommContext's dev_ctx is not initialized.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code.
      
      * Revert compatiable upgrade for communication operators. Their upgrades
      will be submitted in another PR.
      
      * Remove StaticTCPStore.
      
      * Remove useless modification.
      
      * Remove useless set_cuda_device_id.
      
      * Polish code.
      
      * Remove fluid header files in phi files.
      
      * Remove useless comments.
      
      * Fix problems of hip arch.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code style.
      
      ---------
      Co-authored-by: TaoTao Li's avatarhitywt <yuwentao126@126.com>
      ade51aa5
  2. 29 8月, 2023 3 次提交
  3. 28 8月, 2023 3 次提交
  4. 25 8月, 2023 7 次提交
  5. 24 8月, 2023 3 次提交
  6. 23 8月, 2023 1 次提交
    • L
      Integrate TRT qdq layers (#54803) · ae84c603
      Leo Chen 提交于
      * Integrate quantize/dequantize linear and add config for explicit quantization
      
      * Fix the build error
      
      * Add macro for TRT version < 8.0
      
      * Remove qdq UT from windows
      
      * Fix UT failure
      
      * Check TRT version in qdq UT
      
      * Test tensorrt_explicit_enabled API
      
      * Disable QDQ UT if TRT version < 8.5
      
      * Add quantization postfix into public APIs
      
      * Apply code formatter
      
      * Fix the UT failure for explicit quantization
      
      * Apply code formatter on modified files
      
      * Correct the year in copyright
      ae84c603
  7. 22 8月, 2023 1 次提交
    • C
      [AutoParallel] Polish dist tensor design (#56368) · 8495377a
      Chen Weihang 提交于
      * polish dist teensor design
      
      * adjust constructor
      
      * polish details
      
      * polish details design
      
      * fix compile error
      
      * refactor init tensor impl
      
      * fix reshard test
      
      * polish details
      
      * add unittest for coverage
      8495377a
  8. 21 8月, 2023 3 次提交
  9. 18 8月, 2023 2 次提交
  10. 17 8月, 2023 4 次提交
  11. 16 8月, 2023 2 次提交
    • N
      2951521a
    • C
      [AutoParallel] Dygraph basic impl for semi auto parallel (#55698) · 7039bef3
      Chen Weihang 提交于
      * add phi forward api gen impl
      
      * add phi backward gen code
      
      * polish api code gen impl
      
      * polish code gen impl
      
      * remove auto_paralel namespace
      
      * add dygraph forward impl
      
      * add for_auto_parallel cond
      
      * fix code gen errors
      
      * add dygraph backward impl
      
      * resolve conflict with develop
      
      * refactor dist api gen impl
      
      * revert origin api gen impl
      
      * replace template for override func
      
      * fix dnnl marco error
      
      * revert third_party change
      
      * add with distributed marco
      
      * Update grad_tensor_holder.cc details
      
      * merge dist tensor constructor
      
      * change test tensor to replicate
      
      * fx typo
      
      * resolve conflict with develop
      
      * fix out dim error
      7039bef3
  12. 15 8月, 2023 2 次提交
  13. 14 8月, 2023 2 次提交
  14. 11 8月, 2023 2 次提交
    • Y
      [NewIR]Polish IR code (#56087) · 8cf4b1c2
      YuanRisheng 提交于
      * perfect code
      
      * delete __all__
      8cf4b1c2
    • C
      [PRIM][IR]fix comment for vjp (#56137) · 533b62ff
      Charles-hit 提交于
      * [prim][newir] add basic framework for primitive
      
      * support desctensor in new ir
      
      * add vjp interface
      
      * support vjp in new ir
      
      * support vjp in new ir
      
      * polish vjp interface
      
      * fix stop_gradients set
      
      * fix vjp dispatch
      
      * add comment
      
      * add vjp test for new ir
      
      * add test for tanh vjp
      
      * [prim][newir] add basic framework for primitive
      
      * support desctensor in new ir
      
      * support vjp in new ir
      
      * support vjp in new ir
      
      * polish vjp interface
      
      * fix stop_gradients set
      
      * fix vjp dispatch
      
      * add comment
      
      * add vjp test for new ir
      
      * add test for tanh vjp
      
      * add eager and static backend for warp lower level api
      
      * support call_vjp pybind
      
      * polish code and add test for vjp
      
      * remove useless code
      
      * polish code
      
      * remove useless code
      
      * support mean vjp
      
      * add test for mean vjp and support has_vjp function
      
      * fix call_vjp
      
      * polish code
      
      * add primitive ops set for backend
      
      * add vjp test for tanh_
      
      * fix inference CI
      
      * fix inference ci
      
      * modify fluid cmake
      
      * remove useless deps
      
      * add cmake
      
      * fix comment
      
      * fix test
      
      * polish code
      
      * modify backward stop_gradients
      
      * modify static_backend.cc
      
      * remove useless code
      
      ---------
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
      533b62ff
  15. 10 8月, 2023 1 次提交