- 02 8月, 2022 10 次提交
-
-
由 Weilong Wu 提交于
* [Eager] call final_state_slice under eager mode * rm useless comments * use eager final state instead intermidiate state * update fill_constant yaml * update fill_constant yaml * modify wrapped_infermeta_gen logic to fix special case * fix slice in manipulation * use fill_constant_ * modify slice infermeta * rm final_state_conv2d * use final_state_slice * use final_state_slice only * polish slice, use final state * add paddle_throw for SplitInferMeta * rm fill_constant_ temply * recover array_equal, not allclose * recover original code
-
由 Yulong Ao 提交于
* [Phi] Move Qr to the Phi * [Phi] Regiter the cpu grad kernel for qr * [Phi] Share the cuda kernels to lstsq * [Phi] Remove some improper inlcude files * [Phi] Modify codes based on the reviews * [Phi] Remove unecessary files and add the cuda_only comment * [Phi] Remove the unecessary include file * [Phi] Remove qr_op.cu and lstsq_op.cu
-
由 xiaoguoguo626807 提交于
* manuel_fused_gemm_epilogue
-
由 houj04 提交于
* [XPU] fp16 for layer_norm op. test=kunlun
-
由 WangZhen 提交于
* Fix to_static error when call to_static to convert a method of a common class * raise typerror when class no inherits from layer * Fix @to_static
-
由 xiongkun 提交于
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk * write trainer_desc file
-
由 Ruibiao Chen 提交于
* Skip inplace for coalesce_tensor_op outputs * Fix typos * Add UTs * Fix typos
-
由 ccrrong 提交于
* add yaml and unittest * update yaml * update backward yaml and unittest * update yaml * add Yolov3LossGradInferMeta * update yolov3_loss_op.cc * fix bug * code format
-
由 kuizhiqing 提交于
-
- 01 8月, 2022 10 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
由 zhiboniu 提交于
-
由 danleifeng 提交于
Co-authored-by: seemingwang <zsasuke@qq.com> Co-authored-by: NDesmonDay <908660116@qq.com> Co-authored-by: Nseemingwang <seemingwang@users.noreply.github.com> Co-authored-by: NThunderbrook <a754913769@163.com> Co-authored-by: Nxuewujiao <105861147+xuewujiao@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0693.yq01.baidu.com> Co-authored-by: NThunderbrook <52529258+Thunderbrook@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0009.yq01.baidu.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyaoxuefeng <yaoxuefeng@baidu.com> Co-authored-by: Nlxsbupt <luoxsbupt@163.com> Co-authored-by: Nmiaoli06 <106585574+miaoli06@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0008.yq01.baidu.com> Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
由 zhouweiwei2014 提交于
-
由 Roc 提交于
-
由 LiYuRio 提交于
-
由 zhangkaihuo 提交于
* fix test and doc
-
由 Thomas Young 提交于
* update unstack_op * update unstack_op * update unstack_op * fix unstack test * update unstack * update with remote * fix unstack_test.py * temp_save_change_nms_op * add nms test * update nms fix * update unstack_op * temp save change * finish fix nms_op * pass nms test * fix CI * fix ops test * save change * fix code style * fix code style * fix ci and codestyle * fix ci Co-authored-by: NShiningZhang <zhang_liang1991@126.com>
-
由 levi131 提交于
* update code format * add ymal and test * update for comments
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Add kernel declarations * Copy kernel implementation code * Transfer kernel implementation code * Register new kernels * Remove old kernels * Add lu_unpack_sig * Fix bugs * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * Update include files * lu_impl.h->lu_kernel_impl.h * Transfer infermeta * Add yaml and update python api * Add check_eager Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
- 30 7月, 2022 1 次提交
-
-
由 zhiboniu 提交于
* phi_prior_box * add float[] support * phi_prior_box_optest * update
-
- 29 7月, 2022 18 次提交
-
-
由 Ainavo 提交于
* add paddle.nn.functional.pairwise_distance (cattidea/Paddle#273) * remove the test case for undefined behavior Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 zhangkaihuo 提交于
* add sparse SyncBatchNorm
-
由 Haohongxiang 提交于
* migrate lstsq op * update * fix bugs for CIs * update * fix bugs * add uts * update * update * update * fix bugs of jip * fix bugs of hip * update * update according to review * update * update * update * update
-
由 caozhou 提交于
-
由 QingshuChen 提交于
* add some fp16 op for kunlun resnet50 model *test=kunlun * tmp *test=kunlun
-
由 zhiboniu 提交于
-
由 Aganlengzi 提交于
* add FLAGS_enable_api_kernel_fallback * deal with more cases * add ut for coverage
-
由 thunder95 提交于
* add count_nonzero api * remove grad test
-
由 Wang Bojun 提交于
* add yaml and utests of phi softplus add yaml of softplus fix softplus bug in phi * update utests * bug fix * bug fix for test_layers * layer api match * match def and doc in ops.py * doc polish * fix unwanted modified of thresholded_relu * style imporve
-
由 ccrrong 提交于
* skip cast trt convert when input dtype is bool
-
由 JZ-LIANG 提交于
* fixed bug for pass & engine * fixed bug for benchmark GPT-3 * add tuner & profiler * add algorithms & config
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
由 Allen Guo 提交于
-
由 Jiabin Yang 提交于
-
由 fwenguang 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * lu_impl.h->lu_kernel_impl.h Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 Yulong Ao 提交于
* [Phi] Add yaml for assign_value * [Phi] Fix the bug of the assign api and modify the unittest * [Phi] Fix the bug when the tensor does not have the backend info * [Phi] Replace the functional-style cast init by the brace-init * [Phi] Cast the data explicitly
-
由 houj04 提交于
-
- 28 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-