- 25 11月, 2021 2 次提交
-
-
由 zhangkaihuo 提交于
目前的fused_attention_op不支持attn_mask=None的输入,本PR对此进行了补充,并补充了相应的单测逻辑。
-
由 pangyoki 提交于
Cherry-pick PR 37420, fix inplace bug when the first grad_var(loss_grad) is inplace var (#37420) (#37488) fix inplace bug,Cherry pick PR #37420
-
- 24 11月, 2021 1 次提交
-
-
由 Li Min 提交于
Add support for bias is none for fused_attention op.
-
- 23 11月, 2021 9 次提交
-
-
由 lilong12 提交于
-
由 zmx 提交于
* bug fix for DeserializeSelectedRows. test=develop (#36520) * fix SerializeSelectedRows (#36543) * bug fix for DeserializeSelectedRows. test=develop * fix bug for SerializeSelectedRows. test=develop * update. test=develop * [Heterps]Refactor Heter Pipeline Parameter Server (#36845) * change username * fix * fix * fix * fix * fix * update * update * update unittests * fix * update * fix * update * fix * fix * fix * update * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update send_and_recv op. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix unit. notest,test=coverage * fix ut. notest, test=coverage * update. notest,test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix. notest, test=coverage * fix. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * add func. notest, test=coverage * fix ut. notest, test=coverage * fix. test=develop * fix. test=develop * Fix unit test for send_and_recv_cpu & send_and_recv_gpu (#37129) * [heterps]fix ut for heter_pipeline_trainer.cc (#37136) * fix ut. test=develop * fix ut. test=develop * [heterps]bug fix for local training with --heter_worker_num (#37166) * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * [heterps]Refactor heterogenous worker (#37244) * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * refactor heter trainer. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * [heterps]add heterps mode judgement (#37298) * [heterps]change default executor for heter trainer (#37314) * fix pslib. test=develop * add device to train_from_dataset. test=develop * refine fleet.stop_worker. test=develop * fix ut. test=develop * fix ut. test=develop * fix executor & ut. test=develop * fix executor & ut. test=develop * fix executor & ut. test=develop * [heterps]remove api for heter pipeline ps (#37396) * fix api. test=develop * fix api. test=develop * fix code style. test=release/2.2 * fix CMakeLists. test=develop (#37454)
-
由 zhupengyang 提交于
-
由 wangguanqun 提交于
* save/load in ps runtime(the_one_ps) (#36097) * add trainer desc config to distributed strategy * code style modified * data_feed set lod * fix bug * code style * fix bug * save load * save load * save unittest * add unittest of the_one_ps * unittest * add todo in communicator sendsparse * fix bug in save_inference_model (#37362)
-
由 sneaxiy 提交于
* enhance scatter err msg check * fix ci error
-
由 0x45f 提交于
[Dy2stat]Allow users to switch eval/train mode when using @to_static to decorate a function (#37383) (#37432) 本PR之前使用@to_static装饰一个单独的function时,对于生成的Program无法切换train/eval模式,只能运行在train模式下。这也就导致动转静后用户多次调用function显存会一直增长。 本PR之后,使用@to_static装饰一个单独的function时,可以通过function.train()或者function.eval()的方式来切换train/eval模式。
-
由 Wilber 提交于
-
由 JingZhuangzhuang 提交于
-
由 JingZhuangzhuang 提交于
-
- 22 11月, 2021 3 次提交
-
-
由 ceci3 提交于
* fix a quantization bug Co-authored-by: NXGZhang <46363693+XGZhang11@users.noreply.github.com>
-
由 Siming Dai 提交于
* Add paddle.incubate.graph_send_recv API * fix bug in CudaAtomicMin and CudaAtomicMax * add empty line
-
由 Li Min 提交于
fix bug to support dropout eval grad computing. cherry-pick #37305.
-
- 19 11月, 2021 3 次提交
-
-
由 0x45f 提交于
该PR使得动转静模块能够正确转换如下的for i in [1, 2, 3]语句。
-
由 0x45f 提交于
set net.forward to original forward function in flops when net is a dy2stat model.
-
由 Liu-xiandong 提交于
* fix cusparse compile bug in CUDA11.2, test=develop * modify sparse_attention docs, test=document_fix (#36554) * modify sparse_attention docs, test=develop * add warning * add warning ,test=document_fix
-
- 17 11月, 2021 4 次提交
-
-
由 Wangzheee 提交于
* fix_qkv_plugin: half_scale * [Paddle-Inference] fix_qkv_plugin: fix half scale
-
由 Wangzheee 提交于
-
由 Wangzheee 提交于
-
由 JingZhuangzhuang 提交于
-
- 16 11月, 2021 3 次提交
-
-
由 zhangkaihuo 提交于
修复了fused_transformer_encoder_layer fine-tune过程发现的一些问题: fused_attention_op添加attn_mask=None的支持:PR pre_layer_norm处理问题:PR 参数处理,计算错误的问题:PR add_bias计算错误问题:PR 添加pure fp16的支持:PR
-
由 zyfncg 提交于
修复了一维Tensor在使用省略号(...)索引时维度检测异常的问题。
-
由 石晓伟 提交于
Co-authored-by: NPei Yang <peiyang@baidu.com>
-
- 15 11月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* add mlperf optimization PRs * update
-
- 10 11月, 2021 1 次提交
-
-
由 Jack Zhou 提交于
* fix rnn grad bug when num_layers is set 2 and dropout_prob is set 0 * add more test for rnn
-
- 08 11月, 2021 2 次提交
-
-
由 Weilong Wu 提交于
Renamed the variable and function Removed the original template function Removed the tests_properties in CMakeLists.txt
-
由 zyfncg 提交于
att,Fix issue:36902
-
- 01 11月, 2021 2 次提交
-
-
由 Liu-xiandong 提交于
* fix cusparse compile bug in CUDA11.2, test=develop * fix bug
-
由 Feng Xing 提交于
-
- 30 10月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
Cherry-pick #36525
-
- 29 10月, 2021 2 次提交
-
-
由 Wilber 提交于
-
由 Feiyu Chan 提交于
2. add complex data type support for paddle.shape at graph assembly.
-
- 28 10月, 2021 6 次提交
-
-
由 0x45f 提交于
-
由 pangyoki 提交于
Cherry-pick PR #36511
-
由 zhaoyingli 提交于
-
由 Ligoml 提交于
* fix device docs;test=document_fix * update __init__.py
-
由 pangyoki 提交于
* add paddle.version.cuda and paddle.version.cudnn API * fix little bug * fix bug * add doc string * fix mkdir error * fix windows path * fix new paddle/version path * fix unittest * fix format
-
由 XGZhang 提交于
-