- 04 9月, 2023 1 次提交
-
-
由 tianhaodongbd 提交于
* add rotate_half in fused_rope * add position_ids in fused_rope * modified examples about fused_rope * add set_device in examples
-
- 28 8月, 2023 1 次提交
-
-
由 kangguangli 提交于
* register set_value in new ir * fix * register set_value_grad * fix * fix * remove debug info * add unittest * fix * fix * fix * fix * fix * resolve comments
-
- 03 8月, 2023 1 次提交
-
-
由 Wang Xin 提交于
-
- 26 7月, 2023 1 次提交
-
-
由 tianhaodongbd 提交于
-
- 24 7月, 2023 1 次提交
-
-
由 JYChen 提交于
* add IndexPutGradInfermeta to fix backward error in static-mode * codestyle
-
- 19 7月, 2023 1 次提交
-
-
由 sneaxiy 提交于
* fix mea seg fault develop * fix bias_grad seg fault
-
- 12 7月, 2023 1 次提交
-
-
由 Wang Xin 提交于
* [clang-tidy] enable readability-container-size-empty check * fix test_custom_kernel Failed * add clang-tid-10 in dockerfile * add clang-tidy in dockerfile * fix bug
-
- 03 7月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 29 6月, 2023 1 次提交
-
-
由 niuliling123 提交于
* style * more * update ctest * Update legacy_backward.yaml * Update legacy_ops.yaml * Update legacy_ops.yaml * update * update * update for move
-
- 25 5月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 15 5月, 2023 1 次提交
-
-
由 niuliling123 提交于
* update * Update backward.h * Update composite_backward_api.h * Update tensor_utils.cc * Update backward.cc * update * stype * update * add ctest * code stype
-
- 13 4月, 2023 1 次提交
-
-
由 zhangyuqin1998 提交于
-
- 10 4月, 2023 1 次提交
-
-
由 Wang Xin 提交于
* add autogen code support for affine_grid op * update op_compat.yaml for affine_grid * update op_compat.yaml for affine_grid * fix AffineGridGradInferMeta * fix CI error * update AffineGridInferMeta
-
- 09 4月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
-
- 04 4月, 2023 1 次提交
-
-
由 lzydev 提交于
* autogen embedding * deal * fix bug in CompatMetaTensor::share_lod
-
- 30 3月, 2023 1 次提交
-
-
由 Ainavo 提交于
* support auto generate for prelu * op_compat 中增加输入参数 * del attrs ; add kernel data_type * add PreluGradInferMeta
-
- 27 3月, 2023 1 次提交
-
-
由 ZhangDY-6483 提交于
-
- 24 3月, 2023 1 次提交
-
-
由 ZhangDY-6483 提交于
* first version, notest * return final rst, notest * use infinity() instead of max * ut structure * start up of ut * generate lse * update * add depense * reconstruct cmake * move file * add memory efficient attention and fix blasimpl * update * update cmake * add namespace * update cmake * use .cu * update for pad3d * bug fix * bug fix * update * bug fix * update enforce * add test case * merge the lse pad * fix kernel_fn of backward * fix PADDLE_ENFORCE_EQ and phi_api * fix PADDLE_ENFORCE * fix PADDLE_ENFORCE * rerun coverage * fix memory efficient attention test * rerun ci * add cuda version condition * add cuda version condition * delete WIP test * replace PADDLE_ENFORCE * edit the namespace of datatype in multiple.cc * rerun * rerun --------- Co-authored-by: Nliuyuang <liuyuang@baidu.com>
-
- 01 3月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* flash attn * seed * almost * softmax * fix workspace * add unitest; linux only * fix setup * fix datatype include * fix setup typo * fix def scope * new error api * use paddle fork * fix attr bug; complete ut * update flash hash * fix rng reset * fix offset * fix comments
-
- 09 1月, 2023 1 次提交
-
-
由 HappyHeavyRain 提交于
* support the drop_empty_grad in backward * change code according to yunfei's review suggestion
-
- 01 11月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* standard_api * add hardtanh
-
- 31 10月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* standard api * fix ci bugs * fix ci bugs * fix ce bugs
-
- 10 10月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta * WIP: move rnn kernrl to phi * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types. This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types. * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results. * fix ci bugs * fix ci bugs * fix ci bugs * modify code according comment Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
-
- 07 9月, 2022 2 次提交
- 05 9月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [OpAttr]ksize of pool2d support Tensor type * fix unittest * add unittest
-
- 31 8月, 2022 1 次提交
-
-
由 Li Min 提交于
-
- 09 8月, 2022 1 次提交
-
-
由 duanboqiang 提交于
* add margin infer * migrate yaml * modify unittests script
-
- 05 8月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft * make stft_op use fft from phi/kernels/funcs, clean code
-
- 04 8月, 2022 1 次提交
-
-
由 zhiboniu 提交于
* phi_fill_diagonal_tensor * delete extra lines * update * add legacy api test * rename sig
-
- 03 8月, 2022 4 次提交
-
-
由 zhiboniu 提交于
* phi_fill_diagonal * remove old kernels * update * update attr args * refix * update
-
由 Thomas Young 提交于
* save change * save change by YSL * save change by YSL * change by YSL * test pre commit * Revert "test pre commit" This reverts commit eee5e116331186cc544de871b4a5174a6431f17c. * fix code style * fix ctest * temp save * save change * change by YSL * final change by ysl * fix ci * fix code style * delete unuse code * change by ysl
-
由 wuyefeilin 提交于
-
由 wuyefeilin 提交于
* mv eigvalsh op
-
- 02 8月, 2022 1 次提交
-
-
由 ccrrong 提交于
* add yaml and unittest * update yaml * update backward yaml and unittest * update yaml * add Yolov3LossGradInferMeta * update yolov3_loss_op.cc * fix bug * code format
-
- 01 8月, 2022 3 次提交
-
-
由 Thomas Young 提交于
* update unstack_op * update unstack_op * update unstack_op * fix unstack test * update unstack * update with remote * fix unstack_test.py * temp_save_change_nms_op * add nms test * update nms fix * update unstack_op * temp save change * finish fix nms_op * pass nms test * fix CI * fix ops test * save change * fix code style * fix code style * fix ci and codestyle * fix ci Co-authored-by: NShiningZhang <zhang_liang1991@126.com>
-
由 levi131 提交于
* update code format * add ymal and test * update for comments
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Add kernel declarations * Copy kernel implementation code * Transfer kernel implementation code * Register new kernels * Remove old kernels * Add lu_unpack_sig * Fix bugs * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * Update include files * lu_impl.h->lu_kernel_impl.h * Transfer infermeta * Add yaml and update python api * Add check_eager Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
- 29 7月, 2022 1 次提交
-
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * lu_impl.h->lu_kernel_impl.h Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
- 28 7月, 2022 1 次提交
-
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Fix: Move out_grad to first * Register new kernels * Remove old kernels * Move out_grad to last * Fix bugs * Transfer infermeta * Add yaml files * Add blank line * Fix code style * Optimize directory structure Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-