- 05 8月, 2022 5 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 duanboqiang 提交于
* migrate kernel * fix sig order * remove header files * remove header * remove header * modify logits grad
-
由 carryyu 提交于
* add fused_multi_transformer post_layer_norm * add test post_layer_norm
-
由 Zhang Jun 提交于
* update trt workspace size param * update * update * update * use int64_t * use int64_t * upate * update
-
由 Feiyu Chan 提交于
* move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft * make stft_op use fft from phi/kernels/funcs, clean code
-
- 04 8月, 2022 7 次提交
-
-
由 Sławomir Siwek 提交于
* Add unit tests * matmul_v2 + activation * matmuls + elementwise_add * matmul_v2 postops * transform matmul to v2 * opcompat * fix fusing matmul with multipe outs * add shape constraints * remove unused vars * change pass order * - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint * add alpha constraint * merge matmul refactor * trigger CI * - fix * - another fix * code style * add support for matmul+elementwise_add+activation * code style * fix bfloat16 bugs * change append_binary to append_sum Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
由 jakpiase 提交于
-
由 dongfangshenzhu 提交于
* add merged_momentum *test=kunlun * add merged_momentum *test=kunlun * add fp16 to merged_momentum,*test=kunlun
-
由 zhiboniu 提交于
* phi_fill_diagonal_tensor * delete extra lines * update * add legacy api test * rename sig
-
由 zhiboniu 提交于
* phi_generate_proposals_v2 * remove old kernels * optest add eager_check * del lod * update * update * update test_detection with_lod * update nms_util * remove old nms_util.h
-
由 xiaoting 提交于
* fix conflicts * mv unused file * revert backward.h * revert lu_unpack kernel * rm .cu file * Update lu_unpack_kernel.cc * format phi yaml
-
由 sneaxiy 提交于
-
- 03 8月, 2022 14 次提交
-
-
由 Guanghua Yu 提交于
-
由 zhiboniu 提交于
* phi_fill_diagonal * remove old kernels * update * update attr args * refix * update
-
由 z8hanghuan 提交于
* add sequence_unpad for xpu,*test=kunlun * add sequence_unpad, *test=kunlun * fix bug in testcase,should not be sequence_pad,*test=kunlun
-
由 Thomas Young 提交于
* save change * save change by YSL * save change by YSL * change by YSL * test pre commit * Revert "test pre commit" This reverts commit eee5e116331186cc544de871b4a5174a6431f17c. * fix code style * fix ctest * temp save * save change * change by YSL * final change by ysl * fix ci * fix code style * delete unuse code * change by ysl
-
由 seemingwang 提交于
* move repeat interleave * fix api name * recover op registration * fix arguments order * fix * fix infermeta * fix infermeta * fix header * fix infermeta * fix * fix * fix dtype * log&test * test * remove logs * fix * remove logs * combine files * combine * combine files * fix cuda place
-
由 fwenguang 提交于
-
由 wangzhen38 提交于
* fix_auc in instag task * fix_auc in instag task * fix_auc in instag task * fix opmaker error * optmize default value of ins_tag * fix multiary * fix multiary * reback ctr_metric_bundle * reback ctr_metric_bundle * for ci auc_test * code style * code style fix * suit paddle * suit paddle * update code example * complete Auc unittest * set asdispensable * set optional * compatable and code style * fix cu * fix cu * for ci win * update unittests of auc * update code doc by reviews * change code doc by reviews
-
由 wuyefeilin 提交于
* mv kernel * mv infershape * mv yaml * update some * maintain decoe_jpeg in old dygraph * fix as review * rm decode_jpeg_op.cu * update for rocm
-
由 wuyefeilin 提交于
-
由 wuyefeilin 提交于
* mv eigvalsh op
-
由 zhiboniu 提交于
* add cmake enforce * transfer multiclass_nms3 to phi
-
由 zhiboniu 提交于
* phi_matrix_nms * remove old kernels and add optest check_eager * reoder args * reoder args in infermate * update * get back legacy dygraph
-
由 sneaxiy 提交于
* add use_hierarchical_allreduce * support hierarchical allreduce for more cases
-
由 zhiboniu 提交于
* phi_edit_distance * fix
-
- 02 8月, 2022 7 次提交
-
-
由 Leo Chen 提交于
-
由 seemingwang 提交于
* move renorm op * change python api * change op class func * alloc data * remove comments * fix grad arguments * fix python argument * fix python argument * change unit-test * remove shape func registration * recover extra-arguments * recover shape functor
-
由 Wilber 提交于
* multihead matmul add fp16 * fix windows error * fix rocm error * fix rocm error
-
由 Yulong Ao 提交于
* [Phi] Move Qr to the Phi * [Phi] Regiter the cpu grad kernel for qr * [Phi] Share the cuda kernels to lstsq * [Phi] Remove some improper inlcude files * [Phi] Modify codes based on the reviews * [Phi] Remove unecessary files and add the cuda_only comment * [Phi] Remove the unecessary include file * [Phi] Remove qr_op.cu and lstsq_op.cu
-
由 houj04 提交于
* [XPU] fp16 for layer_norm op. test=kunlun
-
由 ccrrong 提交于
* add yaml and unittest * update yaml * update backward yaml and unittest * update yaml * add Yolov3LossGradInferMeta * update yolov3_loss_op.cc * fix bug * code format
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun
-
- 01 8月, 2022 7 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
由 zhiboniu 提交于
-
由 Thomas Young 提交于
* update unstack_op * update unstack_op * update unstack_op * fix unstack test * update unstack * update with remote * fix unstack_test.py * temp_save_change_nms_op * add nms test * update nms fix * update unstack_op * temp save change * finish fix nms_op * pass nms test * fix CI * fix ops test * save change * fix code style * fix code style * fix ci and codestyle * fix ci Co-authored-by: NShiningZhang <zhang_liang1991@126.com>
-
由 Jacek Czaja 提交于
* - fix * - another fix * lint
-
由 levi131 提交于
* update code format * add ymal and test * update for comments
-
由 Xiaoxu Chen 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Add kernel declarations * Copy kernel implementation code * Transfer kernel implementation code * Register new kernels * Remove old kernels * Add lu_unpack_sig * Fix bugs * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * Update include files * lu_impl.h->lu_kernel_impl.h * Transfer infermeta * Add yaml and update python api * Add check_eager Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-