- 13 2月, 2023 1 次提交
-
-
由 chalsliu 提交于
* Support build with gcc12 for CUDA less than 12.0 * fix version message test=document_fix
-
- 07 2月, 2023 1 次提交
-
-
由 zqw_1997 提交于
* 2.4:modify cmake file for cuda11.8 compile * fix small mistake * mistake resolved
-
- 06 2月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 02 2月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
cherry-pick some PR about optimize sparse kernel and fix some bug: #47736 #47703 #47604 #46679 #48439 #49009 #49734
-
- 30 12月, 2022 1 次提交
-
-
由 Chenxiao Niu 提交于
* [MLU] fix compute error of dropout op (#45923) * [MLU] add mergedAdam kernel. (#45965) * [MLU] add int64 support for mlu one_hot_v2 (#46313) * [MLU] fix profiler compile failure (#46208) * [MLU] add barrier_op kernel. (#46417) * [MLU] fluid: add mluop (#46429) * [MLU] add huber_loss kernel. (#46455) * [MLU] add mlu kernel for add_reduce_max_grad (#45651) Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com> * [MLU] add_fluid_mluop_yolo_box (#46573) * [MLU] fix phi::Tensor compile error of mlu. (#46649) * [MLU] add fluid MLUOps prior_box (#46585) * [MLU] fix cmake error (#46772) * [MLU]fix unittest of sync_bn (#46797) * [MLU] add masterparam support for mlu adamw. (#46804) * [MLU] add int64 support for allgather. (#46830) * [MLU] fix compile error & add mlu blacklist function. (#47439) * [MLU] fix softmax_with_cross_entropy failed in 370-X8. * [MLU] fix cncl stuck caused by multiple initializations. * [MLU] fix code style check. Co-authored-by: Nqipengh <huangqipeng@cambricon.com> Co-authored-by: Ncifar10 <41565156+cifar10@users.noreply.github.com> Co-authored-by: Lux et Veritas <1004239791@qq.com> Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com> Co-authored-by: Nronnywang <ronny1996@163.com>
-
- 27 12月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 08 11月, 2022 1 次提交
-
-
由 chalsliu 提交于
-
- 21 10月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* Add infer prune function * add fusion op
-
- 14 10月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
-
- 11 10月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* [PHI] Migrate gelu kernels (#45596) * gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * gelu fwd * sort activations * gelu gradient * remove unused macros * merge conflicts * fix merge conflicts * remove extra contraint from gelu op * [PHI] relu6_grad kernel (#46501) * Relu6 * remove fluid handler * add individual kernel signature * coding style * replace bounded_relu with clip * whitespace * code style
-
由 YuanRisheng 提交于
* fix concat bug * fix ci bugs * fix ci bugs
-
- 20 9月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] update xdnn activations. (#46246) * [XPU] update xpu cmake. test=kunlun
-
- 19 9月, 2022 1 次提交
-
-
由 zhaocaibei123 提交于
-
- 17 9月, 2022 2 次提交
-
-
由 Yuanle Liu 提交于
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fix gloo compile warning * adapt for nn fl-ps
-
- 14 9月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
-
- 07 9月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] update xdnn to 0906. test=kunlun * [XPU] update xdnn to 0907. test=kunlun
-
- 06 9月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* [Phi] fix phi compile option for jetson * Update phi.cmake
-
- 01 9月, 2022 2 次提交
-
-
由 houj04 提交于
-
由 zhangyikun02 提交于
-
- 25 8月, 2022 1 次提交
-
-
由 danleifeng 提交于
* update brpc version;test=develop
-
- 24 8月, 2022 1 次提交
-
-
由 zhangxiaoci 提交于
* update xccl lib * change user name to pass cla check
-
- 19 8月, 2022 1 次提交
-
-
由 houj04 提交于
-
- 15 8月, 2022 2 次提交
-
-
由 zhangyikun02 提交于
-
由 houj04 提交于
* [XPU] add some collective ops. test=kunlun * use XPUOpTestWrapper. test=kunlun * skip kl1 for collective ops. fix typo: deivce -> device. test=kunlun
-
- 10 8月, 2022 1 次提交
-
-
由 z8hanghuan 提交于
* fix bug of adaptive pool2d_grad, *test=kunlun * fix bug of adaptive pool2d_grad, *test=kunlun * fix bug of adaptive pool2d_grad, *test=kunlun
-
- 05 8月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move mkldnn activation kernel * fix compile bugs * fix compile bugs * deal with conflict * fix compile bugs * fix windows compile bugs * mkldnn unittest fix * change mutable to alloc * fix unittest bugs * modify code according comment
-
- 04 8月, 2022 2 次提交
-
-
由 dongfangshenzhu 提交于
* add merged_momentum *test=kunlun * add merged_momentum *test=kunlun * add fp16 to merged_momentum,*test=kunlun
-
由 jiangcheng 提交于
-
- 03 8月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* c++ property deserialization * fix for comment * more error info * fix exception info * fix ci * fix compile * fix layer test ci
-
- 02 8月, 2022 1 次提交
-
-
由 zhangyikun02 提交于
-
- 01 8月, 2022 4 次提交
-
-
由 danleifeng 提交于
Co-authored-by: seemingwang <zsasuke@qq.com> Co-authored-by: NDesmonDay <908660116@qq.com> Co-authored-by: Nseemingwang <seemingwang@users.noreply.github.com> Co-authored-by: NThunderbrook <a754913769@163.com> Co-authored-by: Nxuewujiao <105861147+xuewujiao@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0693.yq01.baidu.com> Co-authored-by: NThunderbrook <52529258+Thunderbrook@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0009.yq01.baidu.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyaoxuefeng <yaoxuefeng@baidu.com> Co-authored-by: Nlxsbupt <luoxsbupt@163.com> Co-authored-by: Nmiaoli06 <106585574+miaoli06@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0008.yq01.baidu.com> Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
由 heliqi 提交于
-
由 Aganlengzi 提交于
-
由 Allen Guo 提交于
* support ninja * fix mkldnn on windows * fix mkldnn on windows up1 * up2 * up3 * fix gflags * BUILD_BYPRODUCTS_OPTION -> BUILD_BYPRODUCTS_ARGS * use CMAKE_COMMAND * up x
-
- 29 7月, 2022 1 次提交
-
-
由 houj04 提交于
-
- 28 7月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] add top_k op. test=kunlun * [XPU] add top_k op. test=kunlun * use PADDLE_ENFORCE_XDNN_NOT_NULL to check pointer. test=kunlun
-
- 27 7月, 2022 3 次提交
-
-
由 niuliling123 提交于
* Replaced add_custom_command with add_custom_target in xpu_kp_cmake
-
由 Leo Chen 提交于
-
由 z8hanghuan 提交于
* fix bug of elementwise_add_grad, *test=kunlun * fix bug, *test=kunlun * rm pooling_t, *test=kunlun * fix bug of ew_add_grad when inplace, *test=kunlun
-