1. 04 9月, 2023 6 次提交
  2. 02 9月, 2023 4 次提交
  3. 01 9月, 2023 7 次提交
  4. 31 8月, 2023 8 次提交
  5. 30 8月, 2023 8 次提交
    • H
      Add paddle custom flags support (#56256) · 2ef4ec71
      huangjiyi 提交于
      * update
      
      * repalce gflags header
      
      * replace DEFINE_<type> with PD_DEFINE_<type>
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update cmake
      
      * add :: before some paddle namespace
      
      * fix link error
      
      * fix CI-Py3
      
      * allow commandline parse
      
      * fix SetFlagsFromEnv
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix CI-Coverage-build
      
      * fix CI-Windows-build
      
      * fix CI-Inference
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix inference api test
      
      * fix infer_ut test
      
      * revert infer_ut gflags usage
      
      * update
      
      * fix inference
      
      * remove flags export macro
      
      * revert inference demo_ci gflags usage
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug when turn on WITH_GFLAGS
      
      * turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * update
      
      * update and add unittest
      
      * add unittest
      
      * fix conflict
      
      * rerun ci
      
      * update
      
      * resolve conflict
      2ef4ec71
    • N
    • W
      [NewIR]Gen ops_api.cc for static mode (#56653) · 59b2ad39
      WangZhen 提交于
      59b2ad39
    • K
      [NewIR] add_arg_mapping_for_fetch (#56752) · 1692af99
      kangguangli 提交于
      * add_arg_mapping_for_fetch
      
      * fix
      
      * fix
      1692af99
    • G
      [Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5
      Ghost Screaming 提交于
      * for verify
      
      fluid operator support new comm library
      
      * u
      
      * u
      
      * u
      
      * compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.
      
      * Remove useless comments in process_group.py
      
      * Polish code style.
      
      * Fix some problems.
      
      * Remove use fluid api in phi comm_context_manager.
      
      * Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.
      
      * Fix bug of HIP architecture.
      
      * Fix some problems.
      1. remove useless loggings.
      2. Fix conditional compilation for HIP.
      3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
      then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
      paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
      set, as a result, NCCLCommContext's dev_ctx is not initialized.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code.
      
      * Revert compatiable upgrade for communication operators. Their upgrades
      will be submitted in another PR.
      
      * Remove StaticTCPStore.
      
      * Remove useless modification.
      
      * Remove useless set_cuda_device_id.
      
      * Polish code.
      
      * Remove fluid header files in phi files.
      
      * Remove useless comments.
      
      * Fix problems of hip arch.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code style.
      
      ---------
      Co-authored-by: TaoTao Li's avatarhitywt <yuwentao126@126.com>
      ade51aa5
    • C
      [IR] Rigister LegacyKernelOp into KernelDialect (#56680) · ded10442
      chen2016013 提交于
      * Register LegacyKernelDialect & Rigister LegacyKernelOp
      
      * fix code style
      
      * delete LegacyKernelDialect ,register LegacyKernelOp into PaddleKernelDialect
      
      * fix bug
      
      * change as reviewed comments
      
      * bug fix
      
      * bug fix
      
      * try to restart coverage CI
      
      * pass legacy op to kernel pass
      
      * fix code style
      
      * fix code style
      
      * fix code style
      ded10442
    • R
      c5786be1
    • N
      [clang-tidy][task 61] enable `hicpp-exception-baseclass` and fix existing errors (#55847) · 31a96888
      Nyakku Shigure 提交于
      * [clang-tidy] enable `hicpp-exception-baseclass` and fix existing errors
      
      * config
      
      * update error format to pass the ci check (at least 20 chars)
      31a96888
  6. 29 8月, 2023 7 次提交
    • R
      [CustomDevice] Not reset pass_builder (#56755) · 220f13bd
      ronnywang 提交于
      220f13bd
    • Z
      [NewIR] support c_sync_calc_stream/c_sync_comm_stream/send_v2/recv_v2 (#56557) · 0ce66c1c
      zhaoyingli 提交于
      * [AutoParallel][NewIR] support calc_sync/comm_sync/send_v2/recv_v2
      
      * pre-commit
      
      * rm unittest
      
      * tiny fix
      
      * api_gen support send_v2's output is empty
      
      * fix format
      
      * python_c_gen support send_v2
      0ce66c1c
    • F
      Fix instant variable oom in paddle2cinn (#56662) · df9d9c59
      Fisher 提交于
      When using paddle2cinn, CompilationContext.with_instantiate_variables should be set to false, otherwise CINN will instant and manage variables memory, this leads to double the memory usage, which eventually leads to out of memory error.
      This PR will set CompilationContext.with_instantiate_variables to false before context pass to constructing the graph compiler.
      df9d9c59
    • C
      Vjp autogen for grad list op(split) (#56720) · 128f95a1
      Chen Zhiyang 提交于
      * add vjp code gen for SplitOp
      
      * change vjp manual file name
      128f95a1
    • L
      [New-IR] add pass registry (#56729) · 9999e849
      Leo Chen 提交于
      * add pass registry
      
      * add pass registry macro
      9999e849
    • S
      Remove need_move_to_phi (#56371) · daac3829
      Sonder 提交于
      * remove flag
      
      * open static build flag
      
      * add searchsorted to list
      
      * add register info for fused layernorm
      
      * fix fused_layernorm_kernel output registe info
      
      * fix stft registe info
      
      * add include
      
      * fix registe info
      
      * add skip fake init for fused_layernorm:residual_out
      
      * fix error
      
      * add distributed_fused_lamb_init to StaticBuildBlackList
      
      * set static_build flag to false
      daac3829
    • G
      [Fluid] move lars_momentum to phi (#55798) · b0c2ee26
      gouzil 提交于
      * [Fluid] move lars_momentum to phi
      
      * add sig
      
      * fix optional Output
      
      * off check_dygraph
      
      * fix input
      
      * fix operator[]
      
      * fix
      
      * try fix AllocateTmpTensor
      
      * fix
      
      * fix type
      
      * Update paddle/phi/kernels/gpu/lars_momentum_kernel.cu
      
      * fix type
      
      * rollback
      
      * Add Registration
      
      * try fix win
      
      * try fix win
      
      * try use double
      
      * try use operator *(float,const Derived &)
      
      * try auto
      
      * fix
      
      * fix
      
      * fix
      
      * fix dtype
      
      * fix type
      
      * fix index
      b0c2ee26