- 05 9月, 2023 18 次提交
-
-
由 Ghost Screaming 提交于
* [WIP] Support std::vector<phi::Tensor> input and output for DistTensor. Concat forward and backward are verified. * Polish code for new dist tensor implementation. * Fix bug of DistTensor upgrade. Add support functions for std::vector<Tensor> -> std::vector<Tensor>. * Add support for DistTensor type of std::vector<phi::Tensor> as input or output of operators. Following testcases are passed. 1. concat: std::vector<phi::Tensor> -> phi::Tensor 2. unbind: phi::Tensor -> std::vector<phi::Tensor> 3. broadcast_tensors: std::vector<phi::Tensor> -> std::vector<phi::Tensor> * Polish code. Remove useless comments. * Add update_loss_scaling in skip_op_lists. * Polish code.
-
由 gouzil 提交于
* [clang-tidy] replenish cppcoreguidelines-narrowing-conversions * fix * fix
-
由 Wennie396 提交于
* replace amp.use_pure_fp16 with amp.dtype and amp.level * old api still use use_pure_fp16 * test_fuse_adamw_pass still use use_pure_fp16 * add test case tuning recompute with amp(float16,o2) * reset new test case properties TIMEOUT 60 * set smaller value of batch_size and batch_num * deepcopy dist_context fix _rename_input problem * fix loss name after cast * set tuning.enable=True and use engine._tune() * restore some changes in _rename_input()/_rename_output() * add self.amp_dtype for _cast_loss() in auto_parallel_amp.py * fix insert op index in _cast_loss()
-
由 gouzil 提交于
* [Fluid] move lars_momentum_xpu to phi * Empty-Commit;test=kunlun;
-
由 tianshuo78520a 提交于
-
由 tianshuo78520a 提交于
* Fix openblas * fix bug * Fix * Fix log * Fix log * Fix * Fix
-
由 huangjiyi 提交于
* update * update * fix empty arg * update test * update * fix bug * update
-
由 ronnywang 提交于
-
由 小飞猪 提交于
[xdoctest][task 184-185] reformat example code with google style in `distributed/auto_parallel/static/*` (#56666) * [Doctest]fix No.184,185, test=docs_preview * add env skip * fix @staticmethod * fix * add xdoctest for v2 * fix
-
由 KongAKun 提交于
* Fix styles of code * update the GPU option * add the GPU setup * remove the note * update the code
-
由 Ryan 提交于
* add docx * fix conversation * fix * fix code style * fix code style * Thank SigureMo * finished? * add ... and fix line * add skip doctest
-
由 张春乔 提交于
* Update private_helper_function.py * xdoc * Apply suggestions from code review * format and skip a to_static case * add a missing space --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 iSerendipity 提交于
[xdoctest][task 224-225] reformat example code with google style in `python/paddle/distributed/fleet` (#56815) * [Doctest]fix No.224-225, test=docs_preview * fix the AttributeError
-
由 Charles-hit 提交于
-
由 jiangfan06 提交于
-
由 jiangfan06 提交于
-
由 xiaoye 提交于
[clang-tidy] No. 57,58 cppcoreguidelines-explicit-virtual-functions clang-analyzer-core.NonNullParamChecker (#56649)
-
由 xiaoye 提交于
-
- 04 9月, 2023 20 次提交
-
-
由 WangZhen 提交于
-
由 tianhaodongbd 提交于
* add rotate_half in fused_rope * add position_ids in fused_rope * modified examples about fused_rope * add set_device in examples
-
由 Sonder 提交于
* remove has_structed_kerenl and has_fluid_kernel * add test_fused_layernorm_op to STATIC_BUILD_TESTS list * open static_build flag * remove distributed_fused_lamb_init from StaticBuildBlackList * use initialized replacing IsInitialized * recover codes * delete useless codes * close the flag
-
由 ShenLiang 提交于
-
由 Yuanle Liu 提交于
-
由 niuliling123 提交于
* add num_splist for flash_attn_bwd and FlashAttnUnpaddedGradKernel * Add assertTrue * Update submodule to a specific commit
-
由 wanghuancoder 提交于
* disable strided split
-
由 zhaoyingli 提交于
* [NewIR]add c_allreduce_sum/c_identity/c_reduce_sum/c_embedding/c_embedding_grad * rm VLOG * rm c_identity from LegacyOpList * rm VLOG * rm c_reduce_sum
-
由 tianshuo78520a 提交于
* Fix eigen build * Fix * Fix
-
由 zhangbo9674 提交于
* add code * add code * refine code * add code * fix bug * fix bug * fix bug * add code * add ut * polish code * fix bug * refine code * fix bug * refine code * fix bug * refine code * fix bug * refine code * fix bug * refine code * add code * fix bug * fix bug * fix bug * fix bug * fix bug * refine code
-
由 WangZhen 提交于
-
由 ming1753 提交于
* Modify MarkTrtEngineOutputs API
-
由 huangjiyi 提交于
* update * fix bug * fix bug * fix bug * fix bug * rerun ci * turn off shared_phi
-
由 huangjiyi 提交于
* update * update * update
-
由 duanyanhui 提交于
-
由 BiynXu 提交于
Change FlattenLoops in the elementwise schedule to Fuse
-
由 LiYuRio 提交于
-
由 engineer1109 提交于
-
由 wanghuancoder 提交于
* fix contiguous
-
由 wanghuancoder 提交于
-
- 03 9月, 2023 1 次提交
-
-
由 Chen Zhiyang 提交于
* add reference of lbfgs * add reference of lbfgs * new ir op test v1.0 * fix new ir optest bug1.0 * modify two testcase bug * add new ir white list & pass test_mean_op.py * rename white list * add new_ir_guard * new ir sum op test all pass * rename backward.grad as ir_backward.grad * check place for new ir * fix test_build_model env bug * fix test_prim_program backward bug * change backward to ir_backward in check_appr * add check_new_ir flag for mkldnn * clean --------- Co-authored-by: Nwangruting <wangruting@baidu.com>
-
- 02 9月, 2023 1 次提交
-
-
由 Leo Chen 提交于
* polish code of pass and executor * update ut
-