- 03 8月, 2022 4 次提交
-
-
由 zhiboniu 提交于
* add cmake enforce * transfer multiclass_nms3 to phi
-
由 zhiboniu 提交于
* phi_matrix_nms * remove old kernels and add optest check_eager * reoder args * reoder args in infermate * update * get back legacy dygraph
-
由 sneaxiy 提交于
* add use_hierarchical_allreduce * support hierarchical allreduce for more cases
-
由 zhiboniu 提交于
* phi_edit_distance * fix
-
- 02 8月, 2022 7 次提交
-
-
由 Leo Chen 提交于
-
由 seemingwang 提交于
* move renorm op * change python api * change op class func * alloc data * remove comments * fix grad arguments * fix python argument * fix python argument * change unit-test * remove shape func registration * recover extra-arguments * recover shape functor
-
由 Wilber 提交于
* multihead matmul add fp16 * fix windows error * fix rocm error * fix rocm error
-
由 Yulong Ao 提交于
* [Phi] Move Qr to the Phi * [Phi] Regiter the cpu grad kernel for qr * [Phi] Share the cuda kernels to lstsq * [Phi] Remove some improper inlcude files * [Phi] Modify codes based on the reviews * [Phi] Remove unecessary files and add the cuda_only comment * [Phi] Remove the unecessary include file * [Phi] Remove qr_op.cu and lstsq_op.cu
-
由 houj04 提交于
* [XPU] fp16 for layer_norm op. test=kunlun
-
由 ccrrong 提交于
* add yaml and unittest * update yaml * update backward yaml and unittest * update yaml * add Yolov3LossGradInferMeta * update yolov3_loss_op.cc * fix bug * code format
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun
-
- 01 8月, 2022 7 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
由 zhiboniu 提交于
-
由 Thomas Young 提交于
* update unstack_op * update unstack_op * update unstack_op * fix unstack test * update unstack * update with remote * fix unstack_test.py * temp_save_change_nms_op * add nms test * update nms fix * update unstack_op * temp save change * finish fix nms_op * pass nms test * fix CI * fix ops test * save change * fix code style * fix code style * fix ci and codestyle * fix ci Co-authored-by: NShiningZhang <zhang_liang1991@126.com>
-
由 Jacek Czaja 提交于
* - fix * - another fix * lint
-
由 levi131 提交于
* update code format * add ymal and test * update for comments
-
由 Xiaoxu Chen 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Add kernel declarations * Copy kernel implementation code * Transfer kernel implementation code * Register new kernels * Remove old kernels * Add lu_unpack_sig * Fix bugs * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * Update include files * lu_impl.h->lu_kernel_impl.h * Transfer infermeta * Add yaml and update python api * Add check_eager Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
- 30 7月, 2022 1 次提交
-
-
由 zhiboniu 提交于
* phi_prior_box * add float[] support * phi_prior_box_optest * update
-
- 29 7月, 2022 9 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile
-
由 Haohongxiang 提交于
* migrate lstsq op * update * fix bugs for CIs * update * fix bugs * add uts * update * update * update * fix bugs of jip * fix bugs of hip * update * update according to review * update * update * update * update
-
由 QingshuChen 提交于
* add some fp16 op for kunlun resnet50 model *test=kunlun * tmp *test=kunlun
-
由 zhiboniu 提交于
-
由 Jacek Czaja 提交于
* - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * lu_impl.h->lu_kernel_impl.h Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 ming1753 提交于
* fused_fc_elementwise_layernorm support fp16 * fused_fc_elementwise_layernorm support double
-
由 houj04 提交于
-
- 28 7月, 2022 10 次提交
-
-
由 HongyuJia 提交于
* test_activation_op unitest error, yaml & activation.py in_dygraph_mode incomplete * fix test_activation_op unitest error, add yaml and dygraph test * fix code style with pre-commit * try to fix namespace error of abs in activation_functor.h * fix namespace error of abs
-
由 Xiaoxu Chen 提交于
* migrate dirichlet op kernel to phi * fix dirichlet sample memory leak
-
由 Haohongxiang 提交于
-
由 Chenxiao Niu 提交于
-
由 Charles-hit 提交于
* Move frame OP to phi、add frame OP yaml config and supplement single test * add Header file of in_dygraph_mode * Modify variable name and FrameGradInferMeta multiplex UnchangedInferMeta * move seq2col to phi
-
由 Charles-hit 提交于
* Move api(lgamma) from legacy_api.yaml to api.yaml * Move api(lgamma) from legacy_api.yaml to api.yaml * Move api(lgamma) from legacy_api.yaml to api.yaml * modify code style * add x to X mapping * add definition of lgamma * delete redundant lgamma definitions * Modify code comments * Modify ops.py code format * add lgamma single test and lgamma api in fluid * Optimized lgamma unittest
-
由 z8hanghuan 提交于
-
由 LiYuRio 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Fix: Move out_grad to first * Register new kernels * Remove old kernels * Move out_grad to last * Fix bugs * Transfer infermeta * Add yaml files * Add blank line * Fix code style * Optimize directory structure Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 houj04 提交于
* [XPU] add top_k op. test=kunlun * [XPU] add top_k op. test=kunlun * use PADDLE_ENFORCE_XDNN_NOT_NULL to check pointer. test=kunlun
-
- 27 7月, 2022 2 次提交