- 19 10月, 2022 14 次提交
-
-
由 WangZhen 提交于
* Fix recurrent op eager deletion pass error in dy2st * Polish code * Refine error message
-
由 will-jl944 提交于
-
由 xiongkun 提交于
* [Dy2Static] Remove GradTransformer 1. fix einsum infershape bugs. 2. remove grad_transformer and unify paddle.grad and paddle.static.gradient. 3. add dygraph_and_dy2static_only decorator for dy2static. * fix bugs * rename
-
由 zlsh80826 提交于
-
由 Aurelius84 提交于
* [Dy2Stat]Polish @to_static temporary file directory * [Dy2Stat]Polish @to_static temporary file directory * refine temp.name * fix typo * fix typo
-
由 Hui Zhang 提交于
* cond infer apply exec seprate * fix bugs * fix as comment
-
由 Leo Chen 提交于
* clean unused code: piece.cc/h * clean usage
-
由 zlsh80826 提交于
-
由 zlsh80826 提交于
-
由 wanghuancoder 提交于
-
由 zlsh80826 提交于
* Add missing fp32 config and reduce the testing combination * Reduce trt matmul pass test max examples
-
由 Li-fAngyU 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] fix an decode error caused by 47036 * add a comment * add an unittest for Block._rename_var * add test_block_rename_var to static_mode_white_list
-
由 Roc 提交于
-
- 18 10月, 2022 23 次提交
-
-
由 weishengying 提交于
-
由 YangZhou 提交于
* update audio api examples * fix format * format * fix * test api * fix format * fix static check error * fix doc error * fix ci * fix api error * update api.spec * fix ci * fix typo in window gaussian
-
由 zhoutianzi666 提交于
* Rewrite strided_slice converter using shape tensor * clean code
-
由 zhouweiwei2014 提交于
-
由 caozhou 提交于
* add parallel tuner * add unittest * fix unittest * set timeout of unittest * set unittest timeout * fix auto_mode setting * update unittest * sync from develop and update unittest * remove unused import * update unittest * update cmakelist * add unittests
-
由 ronnywang 提交于
-
由 LiYuRio 提交于
-
由 seemingwang 提交于
* add embedding range check * change head file * change head file * fix
-
由 liu zhengxi 提交于
-
由 Wang Bojun 提交于
* first version, accuracy corrected * disable debug print * use blockReduceSum in phi * add UT * add opCompat * code style * code refine * bug fix * code refine * test fix * bugfix * codesytle fix * code style * code-style * code-style * code-style
-
由 Sławomir Siwek 提交于
* git * style * leave default relu in kernel * style * cleanup FCMKLDNN pattern * merge conflicts * update develop * update develop * add const * rename to oneDNN and adjust attributes * whitespace
-
由 caozhou 提交于
* add cost interface * update inferface and add unittest * update unittest * update inferface
-
由 Nyakku Shigure 提交于
-
由 Hui Zhang 提交于
* cond infer apply exec seprate * fix bugs
-
由 Wilber 提交于
-
由 zhouweiwei2014 提交于
-
由 xiaoxiaohehe001 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove `compat` module (to_text) * remove some unnecessary decode * remove to_text definition and unittest * Revert "remove to_text definition and unittest" This reverts commit a6b69cb8dca8b9b031ce10ea32d1040e7e0dd267. * remove an assertion * empty commit
-
由 Weilong Wu 提交于
-
由 houj04 提交于
* [XPU] update xpu cmake to 1016. test=kunlun * fix special case of transpose op. test=kunlun
-
由 zyfncg 提交于
* support generating code of opmaker for backward op invoke forward op * gsupport code-gen of opmaker for sparse op * refind logic of choose phi kernrel * fix complie budg * fix code_gen bug * fix bug * fix kernel signature code-gen * fix complie bug of VarType * fix complie bug of VarType * fix test_sparse_conv_op * fix test_sparse_norm_op
-
由 HongyuJia 提交于
-
由 zhaoyingli 提交于
* [AutoParallel] add callbacks * fix unittest * fix dist_context * fix engine * fix cmakelist * fix unittest's returns * fix cmakelist
-
- 17 10月, 2022 3 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Support allow_partial switch, which can be configure in pipeline_configs. If sent tensor are not the same from different hosts, they shouldn't been sent partially and then concated as a whole tensor. * Change name allow_partial to enable_partial_send_recv. * Add global variable _enable_partial_send_recv
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-