- 30 8月, 2023 17 次提交
-
-
由 张春乔 提交于
* xdoc * Update python/paddle/tensor/einsum.py * Update einsum.py * Apply suggestions from code review * Update einsum.py * Apply suggestions from code review
-
由 Nyakku Shigure 提交于
-
由 xuxinyi389 提交于
* fix bugs of tp * fix bugs of tp * fix bugs * fix bugs * fix bugs of md5
-
由 ronnywang 提交于
-
由 WangZhen 提交于
-
由 kangguangli 提交于
* add_arg_mapping_for_fetch * fix * fix
-
由 Ghost Screaming 提交于
* for verify fluid operator support new comm library * u * u * u * compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter. * Remove useless comments in process_group.py * Polish code style. * Fix some problems. * Remove use fluid api in phi comm_context_manager. * Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement. * Fix bug of HIP architecture. * Fix some problems. 1. remove useless loggings. 2. Fix conditional compilation for HIP. 3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first, then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not set, as a result, NCCLCommContext's dev_ctx is not initialized. * Fix some problems. * Polish code. * Polish code. * Revert compatiable upgrade for communication operators. Their upgrades will be submitted in another PR. * Remove StaticTCPStore. * Remove useless modification. * Remove useless set_cuda_device_id. * Polish code. * Remove fluid header files in phi files. * Remove useless comments. * Fix problems of hip arch. * Fix some problems. * Polish code. * Polish code style. --------- Co-authored-by: hitywt <yuwentao126@126.com>
-
由 chen2016013 提交于
* Register LegacyKernelDialect & Rigister LegacyKernelOp * fix code style * delete LegacyKernelDialect ,register LegacyKernelOp into PaddleKernelDialect * fix bug * change as reviewed comments * bug fix * bug fix * try to restart coverage CI * pass legacy op to kernel pass * fix code style * fix code style * fix code style
-
由 cyber-pioneer 提交于
* support prim all in new ir * process makefile * fix rule bug * polish case * fix flag * fix rules bug
-
由 Nyakku Shigure 提交于
-
由 ronnywang 提交于
-
由 Nyakku Shigure 提交于
* [clang-tidy] enable `hicpp-exception-baseclass` and fix existing errors * config * update error format to pass the ci check (at least 20 chars)
-
由 yoyoIcy 提交于
-
由 gouzil 提交于
-
由 iSerendipity 提交于
* 【complex op】No.6 add complex support for logical_and/or/xor/not * fix dtype check * modify the docs * add special condition for not raise when x.dtype is complex * add random generate for complex dtype * fix generate for complex * fix * fix * add corner case for complex type * fix ut * fix ut
-
由 LoneRanger 提交于
* fix sample code * fix bug * fix bug * Update regularizer.py * Update __init__.py * Update decorator.py * fix code-style
-
由 张春乔 提交于
* weight_norm_hook * Update weight_norm_hook.py * Update weight_norm_hook.py * Update python/paddle/nn/utils/weight_norm_hook.py * Update python/paddle/nn/utils/weight_norm_hook.py * Update python/paddle/nn/utils/weight_norm_hook.py Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com> * xdoc * Apply suggestions from code review * Apply suggestions from code review --------- Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
-
- 29 8月, 2023 23 次提交
-
-
由 ronnywang 提交于
-
由 zhaoyingli 提交于
* [AutoParallel][NewIR] support calc_sync/comm_sync/send_v2/recv_v2 * pre-commit * rm unittest * tiny fix * api_gen support send_v2's output is empty * fix format * python_c_gen support send_v2
-
由 Fisher 提交于
When using paddle2cinn, CompilationContext.with_instantiate_variables should be set to false, otherwise CINN will instant and manage variables memory, this leads to double the memory usage, which eventually leads to out of memory error. This PR will set CompilationContext.with_instantiate_variables to false before context pass to constructing the graph compiler.
-
由 Chen Zhiyang 提交于
* add vjp code gen for SplitOp * change vjp manual file name
-
由 张春乔 提交于
* Update dlpack.py * Apply suggestions from code review * Apply suggestions from code review * xdoc * Apply suggestions from code review * Apply suggestions from code review
-
由 Leo Chen 提交于
* add pass registry * add pass registry macro
-
由 张春乔 提交于
* 240 * fix bugs * fix bugs
-
由 Sonder 提交于
* remove flag * open static build flag * add searchsorted to list * add register info for fused layernorm * fix fused_layernorm_kernel output registe info * fix stft registe info * add include * fix registe info * add skip fake init for fused_layernorm:residual_out * fix error * add distributed_fused_lamb_init to StaticBuildBlackList * set static_build flag to false
-
由 duanyanhui 提交于
* support cum & multinomial for dcu * rm commt
-
由 ronnywang 提交于
-
由 ronnywang 提交于
-
由 iSerendipity 提交于
-
由 小飞猪 提交于
[xdoctest][task 200] reformat example code with google style in `python/paddle/tensor/creation.py` (#56685) * [Doctest]fix No.200, test=docs_preview * fix output * add retain_grads * fix style
-
由 张春乔 提交于
-
由 xiaoguoguo626807 提交于
* modify test comp grad * modify test comp grad
-
由 Xavier ZXY 提交于
-
由 gouzil 提交于
* [Fluid] move lars_momentum to phi * add sig * fix optional Output * off check_dygraph * fix input * fix operator[] * fix * try fix AllocateTmpTensor * fix * fix type * Update paddle/phi/kernels/gpu/lars_momentum_kernel.cu * fix type * rollback * Add Registration * try fix win * try fix win * try use double * try use operator *(float,const Derived &) * try auto * fix * fix * fix * fix dtype * fix type * fix index
-
由 lzy 提交于
-
由 zhangbo9674 提交于
This reverts commit 1409e4ec.
-
由 cyberslack_lee 提交于
* fix * fix
-
由 xiaoye 提交于
[clang-tidy] No. 53,54 enable cppcoreguidelines-c-copy-assignment-signature and bugprone-use-after-move (#56601)
-
由 Nyakku Shigure 提交于
-
由 小飞猪 提交于
[xdoctest][task 181-183] reformat example code with google style in `sparse/multiary.py`,`distributed/auto_parallel/*` (#56665) * [Doctest]fix No.181-183, test=docs_preview * add env skip
-