- 28 11月, 2021 1 次提交
-
-
由 zmx 提交于
-
- 26 11月, 2021 4 次提交
-
-
由 zyfncg 提交于
slice_grad op在选择kernel过程中出现错误,问题原因是在获取use_mkldnn属性时,map中未找到该键值,所以抛出out_of_range异常 本PR在map获取use_mkldnn属性数据前增加了是否存在该键值的判断逻辑,从而避免出现上述异常
-
由 zhouweiwei2014 提交于
cherry-pick #36714
-
由 zmx 提交于
* fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * [heterps]bug fix for _run_from_dataset * fix heter_server.cc * fix launch_utils.py * fix heter_section_worker.cc * fix. test=develop * fix. test=develop
-
由 LielinJiang 提交于
-
- 25 11月, 2021 4 次提交
-
-
由 Steffy-zxf 提交于
* fix data parallel when VOCAB var in program * fix ci coverage
-
由 kuizhiqing 提交于
-
由 zhangkaihuo 提交于
目前的fused_attention_op不支持attn_mask=None的输入,本PR对此进行了补充,并补充了相应的单测逻辑。
-
由 pangyoki 提交于
Cherry-pick PR 37420, fix inplace bug when the first grad_var(loss_grad) is inplace var (#37420) (#37488) fix inplace bug,Cherry pick PR #37420
-
- 24 11月, 2021 1 次提交
-
-
由 Li Min 提交于
Add support for bias is none for fused_attention op.
-
- 23 11月, 2021 9 次提交
-
-
由 lilong12 提交于
-
由 zmx 提交于
* bug fix for DeserializeSelectedRows. test=develop (#36520) * fix SerializeSelectedRows (#36543) * bug fix for DeserializeSelectedRows. test=develop * fix bug for SerializeSelectedRows. test=develop * update. test=develop * [Heterps]Refactor Heter Pipeline Parameter Server (#36845) * change username * fix * fix * fix * fix * fix * update * update * update unittests * fix * update * fix * update * fix * fix * fix * update * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update send_and_recv op. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix unit. notest,test=coverage * fix ut. notest, test=coverage * update. notest,test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix. notest, test=coverage * fix. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * add func. notest, test=coverage * fix ut. notest, test=coverage * fix. test=develop * fix. test=develop * Fix unit test for send_and_recv_cpu & send_and_recv_gpu (#37129) * [heterps]fix ut for heter_pipeline_trainer.cc (#37136) * fix ut. test=develop * fix ut. test=develop * [heterps]bug fix for local training with --heter_worker_num (#37166) * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * [heterps]Refactor heterogenous worker (#37244) * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * refactor heter trainer. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * [heterps]add heterps mode judgement (#37298) * [heterps]change default executor for heter trainer (#37314) * fix pslib. test=develop * add device to train_from_dataset. test=develop * refine fleet.stop_worker. test=develop * fix ut. test=develop * fix ut. test=develop * fix executor & ut. test=develop * fix executor & ut. test=develop * fix executor & ut. test=develop * [heterps]remove api for heter pipeline ps (#37396) * fix api. test=develop * fix api. test=develop * fix code style. test=release/2.2 * fix CMakeLists. test=develop (#37454)
-
由 zhupengyang 提交于
-
由 wangguanqun 提交于
* save/load in ps runtime(the_one_ps) (#36097) * add trainer desc config to distributed strategy * code style modified * data_feed set lod * fix bug * code style * fix bug * save load * save load * save unittest * add unittest of the_one_ps * unittest * add todo in communicator sendsparse * fix bug in save_inference_model (#37362)
-
由 sneaxiy 提交于
* enhance scatter err msg check * fix ci error
-
由 0x45f 提交于
[Dy2stat]Allow users to switch eval/train mode when using @to_static to decorate a function (#37383) (#37432) 本PR之前使用@to_static装饰一个单独的function时,对于生成的Program无法切换train/eval模式,只能运行在train模式下。这也就导致动转静后用户多次调用function显存会一直增长。 本PR之后,使用@to_static装饰一个单独的function时,可以通过function.train()或者function.eval()的方式来切换train/eval模式。
-
由 Wilber 提交于
-
由 JingZhuangzhuang 提交于
-
由 JingZhuangzhuang 提交于
-
- 22 11月, 2021 3 次提交
-
-
由 ceci3 提交于
* fix a quantization bug Co-authored-by: NXGZhang <46363693+XGZhang11@users.noreply.github.com>
-
由 Siming Dai 提交于
* Add paddle.incubate.graph_send_recv API * fix bug in CudaAtomicMin and CudaAtomicMax * add empty line
-
由 Li Min 提交于
fix bug to support dropout eval grad computing. cherry-pick #37305.
-
- 19 11月, 2021 3 次提交
-
-
由 0x45f 提交于
该PR使得动转静模块能够正确转换如下的for i in [1, 2, 3]语句。
-
由 0x45f 提交于
set net.forward to original forward function in flops when net is a dy2stat model.
-
由 Liu-xiandong 提交于
* fix cusparse compile bug in CUDA11.2, test=develop * modify sparse_attention docs, test=document_fix (#36554) * modify sparse_attention docs, test=develop * add warning * add warning ,test=document_fix
-
- 17 11月, 2021 4 次提交
-
-
由 Wangzheee 提交于
* fix_qkv_plugin: half_scale * [Paddle-Inference] fix_qkv_plugin: fix half scale
-
由 Wangzheee 提交于
-
由 Wangzheee 提交于
-
由 JingZhuangzhuang 提交于
-
- 16 11月, 2021 3 次提交
-
-
由 zhangkaihuo 提交于
修复了fused_transformer_encoder_layer fine-tune过程发现的一些问题: fused_attention_op添加attn_mask=None的支持:PR pre_layer_norm处理问题:PR 参数处理,计算错误的问题:PR add_bias计算错误问题:PR 添加pure fp16的支持:PR
-
由 zyfncg 提交于
修复了一维Tensor在使用省略号(...)索引时维度检测异常的问题。
-
由 石晓伟 提交于
Co-authored-by: NPei Yang <peiyang@baidu.com>
-
- 15 11月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* add mlperf optimization PRs * update
-
- 10 11月, 2021 1 次提交
-
-
由 Jack Zhou 提交于
* fix rnn grad bug when num_layers is set 2 and dropout_prob is set 0 * add more test for rnn
-
- 08 11月, 2021 2 次提交
-
-
由 Weilong Wu 提交于
Renamed the variable and function Removed the original template function Removed the tests_properties in CMakeLists.txt
-
由 zyfncg 提交于
att,Fix issue:36902
-
- 01 11月, 2021 2 次提交
-
-
由 Liu-xiandong 提交于
* fix cusparse compile bug in CUDA11.2, test=develop * fix bug
-
由 Feng Xing 提交于
-
- 30 10月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
Cherry-pick #36525
-
- 29 10月, 2021 1 次提交
-
-
由 Wilber 提交于
-