- 07 1月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Cherry-pick] Add the forward QR operator * Add a check for multiplex op * Improve multiplex based on reviews
-
- 06 1月, 2022 3 次提交
-
-
由 Haohongxiang 提交于
* optimize backward (#37055) * update * update * update * modify code style
-
由 Wilber 提交于
Co-authored-by: Nfeng_shuai <fengshuai03@baidu.com>
-
由 jakpiase 提交于
[CHERRY-PICK] Fix for matmul_v2 oneDNN op broadcasting when inputs dims have different lengths (#38733) * fix for matmul_v2 broadcasting * resolved conflicts
-
- 05 1月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 30 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
本PR修复了以下两个tensor切片索引bug: 1.修复Tensor索引赋值调用set_value op出现的显存泄露问题,该问题主要是由Inplace策略的使用不当导致,本PR中已完成修复。 2.修复使用多个None类型索引时结果维度异常的问题
-
- 29 12月, 2021 4 次提交
-
-
由 Guoxia Wang 提交于
-
由 Guoxia Wang 提交于
-
由 Guoxia Wang 提交于
* support data_format='NHWC' for prelu channel mode (#37019) * support data_format='NHWC' for prelu channel mode * fix prelu weight shape for NHWC of static mode (#38310)
-
由 Shang Zhizhou 提交于
-
- 21 12月, 2021 1 次提交
-
-
由 huangxu96 提交于
Add python interface of subgraph: 1. all_sub_graphs() 2. get_sub_graph(idx)
-
- 13 12月, 2021 3 次提交
-
-
由 Shang Zhizhou 提交于
-
由 zhangbo9674 提交于
优化layer.to实现逻辑,相关pr: Remove additional warnning in layer.to ( #36700) Refine param conversion logic in layer.to ( #36862) Fix Layer.to() of device bug ( #37156)
-
由 Wangzheee 提交于
* fix_ele_convert: IElementWiseLayer can broadcast * fix_ele_convert
-
- 10 12月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
fix: when ceil_model==true && Padding_algo!=SAME, (x-size)/stride != int, this convert is wrong (#37929) (#38033) Co-authored-by: Nfeng_shuai <fengshuai03@baidu.com>
-
- 08 12月, 2021 1 次提交
-
-
由 wenbin 提交于
-
- 03 12月, 2021 1 次提交
-
-
由 baoachun 提交于
-
- 29 11月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
cherry-pick #37536 修复pass_desc.proto在编译时产生依赖问题。
-
- 28 11月, 2021 1 次提交
-
-
由 zmx 提交于
-
- 26 11月, 2021 3 次提交
-
-
由 zyfncg 提交于
slice_grad op在选择kernel过程中出现错误,问题原因是在获取use_mkldnn属性时,map中未找到该键值,所以抛出out_of_range异常 本PR在map获取use_mkldnn属性数据前增加了是否存在该键值的判断逻辑,从而避免出现上述异常
-
由 zhouweiwei2014 提交于
cherry-pick #36714
-
由 zmx 提交于
* fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * [heterps]bug fix for _run_from_dataset * fix heter_server.cc * fix launch_utils.py * fix heter_section_worker.cc * fix. test=develop * fix. test=develop
-
- 25 11月, 2021 2 次提交
-
-
由 zhangkaihuo 提交于
目前的fused_attention_op不支持attn_mask=None的输入,本PR对此进行了补充,并补充了相应的单测逻辑。
-
由 pangyoki 提交于
Cherry-pick PR 37420, fix inplace bug when the first grad_var(loss_grad) is inplace var (#37420) (#37488) fix inplace bug,Cherry pick PR #37420
-
- 24 11月, 2021 1 次提交
-
-
由 Li Min 提交于
Add support for bias is none for fused_attention op.
-
- 23 11月, 2021 7 次提交
-
-
由 lilong12 提交于
-
由 zmx 提交于
* bug fix for DeserializeSelectedRows. test=develop (#36520) * fix SerializeSelectedRows (#36543) * bug fix for DeserializeSelectedRows. test=develop * fix bug for SerializeSelectedRows. test=develop * update. test=develop * [Heterps]Refactor Heter Pipeline Parameter Server (#36845) * change username * fix * fix * fix * fix * fix * update * update * update unittests * fix * update * fix * update * fix * fix * fix * update * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update send_and_recv op. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix unit. notest,test=coverage * fix ut. notest, test=coverage * update. notest,test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix. notest, test=coverage * fix. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * add func. notest, test=coverage * fix ut. notest, test=coverage * fix. test=develop * fix. test=develop * Fix unit test for send_and_recv_cpu & send_and_recv_gpu (#37129) * [heterps]fix ut for heter_pipeline_trainer.cc (#37136) * fix ut. test=develop * fix ut. test=develop * [heterps]bug fix for local training with --heter_worker_num (#37166) * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * [heterps]Refactor heterogenous worker (#37244) * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * refactor heter trainer. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * [heterps]add heterps mode judgement (#37298) * [heterps]change default executor for heter trainer (#37314) * fix pslib. test=develop * add device to train_from_dataset. test=develop * refine fleet.stop_worker. test=develop * fix ut. test=develop * fix ut. test=develop * fix executor & ut. test=develop * fix executor & ut. test=develop * fix executor & ut. test=develop * [heterps]remove api for heter pipeline ps (#37396) * fix api. test=develop * fix api. test=develop * fix code style. test=release/2.2 * fix CMakeLists. test=develop (#37454)
-
由 zhupengyang 提交于
-
由 wangguanqun 提交于
* save/load in ps runtime(the_one_ps) (#36097) * add trainer desc config to distributed strategy * code style modified * data_feed set lod * fix bug * code style * fix bug * save load * save load * save unittest * add unittest of the_one_ps * unittest * add todo in communicator sendsparse * fix bug in save_inference_model (#37362)
-
由 sneaxiy 提交于
* enhance scatter err msg check * fix ci error
-
由 JingZhuangzhuang 提交于
-
由 JingZhuangzhuang 提交于
-
- 22 11月, 2021 2 次提交
-
-
由 Siming Dai 提交于
* Add paddle.incubate.graph_send_recv API * fix bug in CudaAtomicMin and CudaAtomicMax * add empty line
-
由 Li Min 提交于
fix bug to support dropout eval grad computing. cherry-pick #37305.
-
- 17 11月, 2021 4 次提交
-
-
由 Wangzheee 提交于
* fix_qkv_plugin: half_scale * [Paddle-Inference] fix_qkv_plugin: fix half scale
-
由 Wangzheee 提交于
-
由 Wangzheee 提交于
-
由 JingZhuangzhuang 提交于
-
- 16 11月, 2021 2 次提交
-
-
由 zhangkaihuo 提交于
修复了fused_transformer_encoder_layer fine-tune过程发现的一些问题: fused_attention_op添加attn_mask=None的支持:PR pre_layer_norm处理问题:PR 参数处理,计算错误的问题:PR add_bias计算错误问题:PR 添加pure fp16的支持:PR
-
由 zyfncg 提交于
修复了一维Tensor在使用省略号(...)索引时维度检测异常的问题。
-