- 14 6月, 2023 1 次提交
-
-
由 sneaxiy 提交于
* fix A100 CUDA12 ut * fix ci uts * fix test_sync_batch_norm_op * fix sync bn op ut again by separating 2 files * fix codestyle ci * combine other PRs * fix codestyle * fix codestyle ci
-
- 13 6月, 2023 2 次提交
- 12 6月, 2023 4 次提交
-
-
由 Hui Zhang 提交于
* fix for log xxx * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward * fix docstring * fix bug * fix bugs * fix bugs * fix bugs * fix bugs * fix bug * using cast * fix test * fix api * fix other bugs * fix ci bug for not using dygraph guard * add bfloat16 test * fix ut * bf16
-
由 YuanRisheng 提交于
* add phi option in cmake * modify default option * fix windows bugs * perfect log * change default mode * change phi so on for py3
-
由 Zhang Jun 提交于
* support bias's rank equal to input's rank
-
由 Nyakku Shigure 提交于
-
- 10 6月, 2023 1 次提交
-
-
由 HongyuJia 提交于
-
- 09 6月, 2023 6 次提交
-
-
由 pangengzheng 提交于
* support add(x_float32, bfloa16_) or add(x_float32, y_float16) * polish * fix test
-
由 lzydev 提交于
* auto generate code for elementwise_max * auto generate code for elementwise_max * fix composite ops * fix bug of fmax
-
由 jiangfan06 提交于
-
由 zhangbo9674 提交于
* refine code * refine code * refine code * refine code * refine code * refine code * refine code * fix bug * refine code * refine code * refine code * refine code * refine code * delete unused code * delete unused code * refine code
-
由 Nyakku Shigure 提交于
* bump ruff to 0.0.271 and update config * exclude third_party * bump ruff to 0.0.272 * refine config
-
由 houj04 提交于
-
- 08 6月, 2023 3 次提交
-
-
由 wz1qqx 提交于
-
由 ykkk2333 提交于
-
由 Yiqun Liu 提交于
* Add outputs to check_numerics_kernel. * Add check_numerics to yaml. * Add API and unittest. * Add check_nan_inf_level as argument of check_numerics_kernel. * Add more unittests. * Fix static API implementation and unittest. * Move the implementation of check_numerics to paddle.amp. * Fix import error.
-
- 07 6月, 2023 2 次提交
-
-
由 Charles-hit 提交于
-
由 RedContritio 提交于
-
- 06 6月, 2023 3 次提交
-
-
由 YuanRisheng 提交于
-
由 houj04 提交于
-
由 Zhang Zheng 提交于
* Fix compilation error by using thrust * fix
-
- 05 6月, 2023 9 次提交
-
-
由 PommesPeter 提交于
* feat: added polygamma init code * feat: added polygamma unittest code * test: added more test cases * refactor: added forward impl * refactor: added backward impl * test: updated cases * refactor: updated test cases * refactor: added more case and fixed some bugs * test: updated ref func * refactor: updated code style * refactor: move the code * refactor: updated test * refactor: updated test * docs: updated en doc Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com> * docs: updated math eq --------- Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com>
-
由 gouzil 提交于
-
由 wangzhen38 提交于
-
由 houj04 提交于
-
由 umiswing 提交于
-
由 huangjiyi 提交于
Support code generation for op conv2d_transpose, conv3d_transpose, depthwise_conv2d_transpose (#54242)
-
由 huangjiyi 提交于
* update * update eager_gen * update * rm intermediate
-
由 Asthestarsfalll 提交于
* optimize logsumexp in small data scale * fix * fix * add #pragma once * swith to use aligned_vector and support arbitrarily shape * fix store * fix store * refine for special cases * try * fix * update * fix * fix all_reduce * try * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug * fix rocm bug
-
由 Chen Weihang 提交于
-
- 03 6月, 2023 2 次提交
-
-
由 RedContritio 提交于
* remove reduce_sum_op.h * support auto generate for static op reduce_sum * remove reduce_sum_op in CMakeLists.txt
-
由 Scotty 提交于
-
- 02 6月, 2023 7 次提交
-
-
由 kangguangli 提交于
* add vector type support for program translator * polish * support basic attribute type * resolve conflicts * add verify for combine/slice and unittests * polish * support more type in attribute translator * modify by reviews * fix merge mistakes * refine code * refine code * add interface * fix: op name normalization * fix typo * refactor input translator * fix merge conflicts * fix op normalizer bug * refactor attribute translator * fix bug * refactor output translator * fix typo * fix * fix approval error * fix coverage * fix op_compat parser * fix merge conflicts * fix merge conflicts * fix merge conflicts * fix merge conflicts * fix merge conflicts * revert some changes --------- Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
由 RedContritio 提交于
-
由 RedContritio 提交于
* decouple reduce_any_op.h and reduce_op.h from reduce_any_op.cc * support auto generate for static op reduce_any
-
由 RedContritio 提交于
-
由 RedContritio 提交于
-
由 YuanRisheng 提交于
* fix mac typeinfo bugs * add file * move code to cc * fix compile bugs
-
由 Difer 提交于
* add fp&bf16 bernoulli * add check_dtype & fix error * fix rocm error
-