- 05 6月, 2023 3 次提交
-
-
由 PommesPeter 提交于
* feat: added polygamma init code * feat: added polygamma unittest code * test: added more test cases * refactor: added forward impl * refactor: added backward impl * test: updated cases * refactor: updated test cases * refactor: added more case and fixed some bugs * test: updated ref func * refactor: updated code style * refactor: move the code * refactor: updated test * refactor: updated test * docs: updated en doc Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com> * docs: updated math eq --------- Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com>
-
由 wangzhen38 提交于
-
由 Asthestarsfalll 提交于
* optimize logsumexp in small data scale * fix * fix * add #pragma once * swith to use aligned_vector and support arbitrarily shape * fix store * fix store * refine for special cases * try * fix * update * fix * fix all_reduce * try * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug
-
- 02 6月, 2023 2 次提交
-
-
由 Difer 提交于
* add fp&bf16 bernoulli * add check_dtype & fix error * fix rocm error
-
由 Zhang Ting 提交于
* support master_grad for adam and momentum Co-authored-by: zhangting_2017@163.com <zhangting2020>
-
- 30 5月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Reimplement the check_nan_inf function as check_numerics kernel. * Remove the cpu implemention to phi. * Add ifdef for the including of omp.h. * Move the use of FLAGS_check_nan_inf_level out of header file. * Implement a common PrintAndThrowError function. * Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__. * Add dependency of phi. * Polish codes and unittest.
-
- 26 5月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* create phi so * fix ci bugs * fix py3 bugs * add file * fix py3 bugs * fix windows bugs * perfect so * fix py3 bugs * delete all static target in phi * fix windows bugs * fix py3 bugs * fix ci bugs * fix windows bugs * fix bugs: gflags can't be linked by dynamic and static lib * fix bugs that can not load 3rd party * fix ci bugs * fix compile bugs * fix py3 bugs * fix conflict * fix xpu bugs * fix mac compile bugs * fix psgpu bugs * fix inference failed * deal with conflict * fix LIBRARY_PATH bug * fix windows bugs * fix onednn error * fix windows compile bugs * fix windows compile bugs * fix test_cuda_graph_static_mode_error aborted * fix windows bugs * fix mac-python3 error * fix hip compile bugs * change mode to static * change to static mode * fix ci bugs * fix py3 bugs * fix windows bugs * fix bugs * add static flag * add PADDLE_API * change position of PADDLE_API * fix windows bugs * change mode to dynamic lib * fix windows static bugs * deal with conflict * fix windows unit bug * fix coverage * deal with conflict * fix windows-inference * fix py3 bugs * fix bugs when compile type_info * fix compile bugs * fix py3 bugs * fix windows bugs * fix windows openblas * fix xpu bugs * fix enforce_test in windows * update code according comment * fix windows cmake bug * fix windows bugs * fix windows bugs * delete cinn unittest * fix cinn bugs --------- Co-authored-by: lzydev <1528794076@qq.com>
-
- 25 5月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 24 5月, 2023 1 次提交
-
-
由 Winters Montagne 提交于
Removed unnecessary header files introduced
-
- 23 5月, 2023 2 次提交
-
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support float16 in selu * fix
-
由 cyberslack_lee 提交于
-
- 22 5月, 2023 1 次提交
-
-
由 Tian Zheng 提交于
* Add GPU kernel for multiclass_nms3 op * Make multiclass_nms3 gpu kernel output consistent with cpu kernel * Fix API incompatibility * Fix unittests on builds without CUDA * Fix ROCM build * Remove fluid headers; Use default atol for unittest * Change function and variable naming * Add comments; Reduce redundant code * Use paddle test framework
-
- 19 5月, 2023 3 次提交
-
-
由 Galaxy1458 提交于
-
由 Galaxy1458 提交于
-
由 Danyang Zhang 提交于
* delete bf16 of cross entropy * delete bf16 of cross entropy
-
- 18 5月, 2023 3 次提交
-
-
由 Charles-hit 提交于
* add meshgrid,expand_as, prod and grad bf16 kernel * fix bf16 for optest * modify code style * fix amp test
-
由 co63oc 提交于
-
由 LoneRanger 提交于
-
- 17 5月, 2023 1 次提交
-
-
由 LyndonKong 提交于
* Add i1 and i1e op * resolve merge conflicts
-
- 16 5月, 2023 5 次提交
-
-
由 co63oc 提交于
-
由 Difer 提交于
* add bf16 for mode * remove random seed 666 * try to fix op_type error * test for me * try to fix op_type * fix redundancy code * add fp,bf for lastdim * fix some error * simplify code * fix shape error * optype error * fix skipif bf16
-
由 Winters Montagne 提交于
* modify lerp_kernel.cu * pre-commit * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * Add files via upload fix some CI issues
-
由 huangjiyi 提交于
* update * fix bug * test * test * update * update mutable_data * fix bug * update * fix bug * update output type reg * update * update
-
由 gouzil 提交于
* [phi]mv StftKernel to phi * [phi] fix KernelSignature * [phi]fix arr error * [phi] Disable check_dygraph * [phi]fix include * [phi] rewrite mutable_data, add output register * [phi] fix Alloc * [phi] fix Alloc again * [phi] fix mutable_data * [phi] fix onesided_out Resize
-
- 15 5月, 2023 3 次提交
-
-
由 zhouweiwei2014 提交于
-
由 niuliling123 提交于
* update * Update backward.h * Update composite_backward_api.h * Update tensor_utils.cc * Update backward.cc * update * stype * update * add ctest * code stype
-
由 zhangyuqin1998 提交于
* move OneHotRawKernel to legacy * fix
-
- 12 5月, 2023 6 次提交
-
-
由 PommesPeter 提交于
* added base code for i0 and i0e * added grad base code for i0 and i0e * added i0 and i0e python code * added ops and backward yaml config * added i0 and i0e cpu kernel, but not test. * added i0 and i0e code and unitest files * added test files * added i0/i0e gpu implementation code * updated code style * updated code style * fixed unitests code * updated i0 with eigen3 * fixed bug and added more test cases * refactor: fixed static graph bug * refactor: removed i0 and i0e from op_compat * refactor: updated code style * refactor: updated op_compat.yaml * refactor: updated op_compat.yaml * refactor: fixed op name mapping and optimize unittest case * refactor: manually implement i0 / i0e * refactor: added grad kernel for i0 / i0e,didn't finish * Update math.py * refactor: added equation to doc in English and added comments for computing i0 / i0e gradient * refactor: removed eigen implementation * refactor: finished i0 / i0e cpu and gpu op * refactor: updated code style * fix: find a bug but not fix * fix: incorrect unittest cases * update: updated code style and remove my file * update: updated unittest case * fix: fixed sign error * fix: fixed mistakes when merging * refactor: updated code style * refactor: remove unused code * refactor: updated code style
-
由 Leo Chen 提交于
-
由 xiaoguoguo626807 提交于
* add rules * modify silu_grad input * modify kernel signature * modify kernel signature * code style * review
-
由 Wang Xin 提交于
* sequence_mask functionalization * fix sequence_mask test
-
由 huangjiyi 提交于
* update * update
-
由 傅剑寒 提交于
-
- 10 5月, 2023 4 次提交
-
-
由 傅剑寒 提交于
* add index_put api * fix value broadcast in backward and add test case in static * add timeout=120s for index_put * add op_compat for index_put * add inplace index_put test * add test case when index tensor in indices is int32 when indices.size less than x.dims * add index_put api backward in cpu place * add backward test case * refactor code to delete some duplicated code * replace reshape with resize for decrease extra memcpy * add datatype flag in backward yaml * fix bug in documentation * Update python/paddle/tensor/manipulation.py --------- Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
由 co63oc 提交于
-
由 Yiqun Liu 提交于
-
- 09 5月, 2023 1 次提交
-
-
由 Zhan Rongrui 提交于
-
- 08 5月, 2023 2 次提交
-
-
由 GGBond8488 提交于
* add complex support for optest * add complex grad test * append one * move some debug info * move some debug info * move some debug info * move some debug info * add more complex test * Fix naming ambiguity * Revert "add more complex test" This reverts commit dbcb0516b8e53ba42e2d6089878a39b395345969. * change backward gradient, add TODO
-
由 Zhang Zheng 提交于
-