1. 06 9月, 2023 2 次提交
  2. 05 9月, 2023 7 次提交
  3. 04 9月, 2023 9 次提交
  4. 01 9月, 2023 7 次提交
  5. 31 8月, 2023 7 次提交
  6. 30 8月, 2023 6 次提交
    • K
      [NewIR] fix logical op infermeta (#56711) · 987cb97e
      kangguangli 提交于
      * fix logical op infermeta
      
      * add test
      
      * adpat inplace api
      987cb97e
    • H
      Add paddle custom flags support (#56256) · 2ef4ec71
      huangjiyi 提交于
      * update
      
      * repalce gflags header
      
      * replace DEFINE_<type> with PD_DEFINE_<type>
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update cmake
      
      * add :: before some paddle namespace
      
      * fix link error
      
      * fix CI-Py3
      
      * allow commandline parse
      
      * fix SetFlagsFromEnv
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix CI-Coverage-build
      
      * fix CI-Windows-build
      
      * fix CI-Inference
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix inference api test
      
      * fix infer_ut test
      
      * revert infer_ut gflags usage
      
      * update
      
      * fix inference
      
      * remove flags export macro
      
      * revert inference demo_ci gflags usage
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug when turn on WITH_GFLAGS
      
      * turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * update
      
      * update and add unittest
      
      * add unittest
      
      * fix conflict
      
      * rerun ci
      
      * update
      
      * resolve conflict
      2ef4ec71
    • R
    • G
      [Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5
      Ghost Screaming 提交于
      * for verify
      
      fluid operator support new comm library
      
      * u
      
      * u
      
      * u
      
      * compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.
      
      * Remove useless comments in process_group.py
      
      * Polish code style.
      
      * Fix some problems.
      
      * Remove use fluid api in phi comm_context_manager.
      
      * Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.
      
      * Fix bug of HIP architecture.
      
      * Fix some problems.
      1. remove useless loggings.
      2. Fix conditional compilation for HIP.
      3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
      then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
      paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
      set, as a result, NCCLCommContext's dev_ctx is not initialized.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code.
      
      * Revert compatiable upgrade for communication operators. Their upgrades
      will be submitted in another PR.
      
      * Remove StaticTCPStore.
      
      * Remove useless modification.
      
      * Remove useless set_cuda_device_id.
      
      * Polish code.
      
      * Remove fluid header files in phi files.
      
      * Remove useless comments.
      
      * Fix problems of hip arch.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code style.
      
      ---------
      Co-authored-by: TaoTao Li's avatarhitywt <yuwentao126@126.com>
      ade51aa5
    • G
    • iSerendipity's avatar
      【complex op】No.6 add complex support for logical_and/or/xor/not (#56323) · 5cbf5bd4
      iSerendipity 提交于
      * 【complex op】No.6 add complex support for logical_and/or/xor/not
      
      * fix dtype check
      
      * modify the docs
      
      * add special condition for not raise when x.dtype is complex
      
      * add random generate for complex dtype
      
      * fix generate for complex
      
      * fix
      
      * fix
      
      * add corner case for complex type
      
      * fix ut
      
      * fix ut
      5cbf5bd4
  7. 29 8月, 2023 2 次提交
    • Z
      [NewIR] support c_sync_calc_stream/c_sync_comm_stream/send_v2/recv_v2 (#56557) · 0ce66c1c
      zhaoyingli 提交于
      * [AutoParallel][NewIR] support calc_sync/comm_sync/send_v2/recv_v2
      
      * pre-commit
      
      * rm unittest
      
      * tiny fix
      
      * api_gen support send_v2's output is empty
      
      * fix format
      
      * python_c_gen support send_v2
      0ce66c1c
    • S
      Remove need_move_to_phi (#56371) · daac3829
      Sonder 提交于
      * remove flag
      
      * open static build flag
      
      * add searchsorted to list
      
      * add register info for fused layernorm
      
      * fix fused_layernorm_kernel output registe info
      
      * fix stft registe info
      
      * add include
      
      * fix registe info
      
      * add skip fake init for fused_layernorm:residual_out
      
      * fix error
      
      * add distributed_fused_lamb_init to StaticBuildBlackList
      
      * set static_build flag to false
      daac3829