- 10 11月, 2022 20 次提交
-
-
由 Sylwester Fraczek 提交于
* migrate prelu * remove cache * review fixes
-
由 WangZhen 提交于
Get grads types from cpp for adam to speed up
-
由 LiYuRio 提交于
-
由 YuanRisheng 提交于
* standard api * fix sparse bugs * fix xpu bugs, test=kunlun * remove hard code for custom unittest * open ci, test=kunlun * deal with conflict
-
由 Wang Xin 提交于
* remove fluid/framework/generator.h from phi * fix PR-CI-Kunlun-KP-Build fail
-
由 PuQing 提交于
* rm fluid gpu_launch_config * fix type
-
由 zhangxin81 提交于
* add roformer pass&&plugin(novarlen)
-
由 huangjiyi 提交于
[PHI Decoupling] remove "paddle/fluid/platform/float16.h" and "paddle/fluid/platform/for_range.h" in phi. (#47817) * rm "paddle/fluid/platform/float16.h" in phi * rm "paddle/fluid/platform/for_range.h" in phi
-
由 huangjiyi 提交于
[PHI Decoupling] remove dependency on "paddle/fluid/platform/errors.h" and "paddle/fluid/platform/fast_divmod.h" in phi. (#47815) * rm "paddle/fluid/platform/errors.h" in phi * rm "paddle/fluid/platform/fast_divmod.h" in phi
-
由 zhouweiwei2014 提交于
-
由 huangjiyi 提交于
-
由 pangyoki 提交于
change cudnn error to cuda error if compiled cuda version is incompatible with installed cuda version (#47743) * fix cudnn error * fix * fix * fix
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
由 wenbin 提交于
* skip_merge_layernorm * add UT * modify comments
-
由 zyfncg 提交于
* add ci check for code-gen script * update
-
由 jiangcheng 提交于
* fix paddle with cinn cannot link relu op bug * change cmake activation_op to generator_op
-
由 RichardWooSJTU 提交于
* add fuse_multi_transformer_layer_pass
-
由 Charles-hit 提交于
-
由 Wen Sun 提交于
* refactor: send, recv, send_partial, recv_partial * refactor: rm useless const ref
-
由 zhangbo9674 提交于
-
- 09 11月, 2022 20 次提交
-
-
由 huangjiyi 提交于
* rm "paddle/fluid/platform/dynload/cudnn.h" in phi * rm "paddle/fluid/platform/dynload/mklml.h" in phi * rm "paddle/fluid/platform/dynload/rocblas.h" in phi * replace "paddle::platform::dynload::" with "phi::dynload::" in phi * revert "blas_impl.cu.h"
-
由 HongyuJia 提交于
-
由 Leo Chen 提交于
* remove locked allocator * fix ut * add heafer file
-
由 joanna.wozna.intel 提交于
-
由 WangZhen 提交于
* Get params and grads in cpp to avoid gpu idel time * Using python param instead of cpp return param to fix test_asp_optimize_dynamic.py * Get grads from cpp and construct params_grads on python * Check meta and remove comments
-
由 Wang Xin 提交于
* remove framework/data_type.h from phi * fix CI fail: map proto::VarType to phi::DataType * refactor code to add more detailed comments
-
由 Paulina Gacek 提交于
* Analysis API interface for disabling fc passes * Unit tests corrected * Python API added * test runs only when PADDLE_WITH_MKLDNN * Fc op changed to relu in matmul_op_test * Disable fc passes in tests where acc drops * code formating * Unit test for analysisConf added * Unit test gpu added * fc passes disabled when iterations=0 in gru test * style * passes disabled when fp32 in gru test * fc passes disabled in lstm test * Import from inference, not fluid in doc
-
由 Tony Cao 提交于
* fix flake8 CodeStyle E266 * fix comments
-
由 jakpiase 提交于
-
由 Sławomir Siwek 提交于
-
由 Jacek Czaja 提交于
* first commit - more fixes - compilation fix - compilation fix - fix - another fix - yet another fix - Fix - fix to fused ops - compilation fix - compilation fix - another compilation fix - another fix - fix - fix - fix - fix - yet another fix - fix - fix - cosmetic fix :- lint - Revert some changes (to be brought back later) - fix to build - Added prototype of slice - fix compilation fix - compilation fix - fix - fix - Fix - fix fix modified: cmake/flags.cmake * lint * rerun of CI * - Fix * - lint * - lint2
-
由 huangjiyi 提交于
-
由 zyfncg 提交于
* generate static graph code of some op * polish code * fix bug * update default value
-
由 huangjiyi 提交于
-
由 cyber-pioneer 提交于
-
由 Chen Weihang 提交于
* move fluid op generator into fluid * remove parsed op * resolve sig undef error * append python interp find logic * remove dup code
-
由 Wen Sun 提交于
-
由 LiYuRio 提交于
-
由 FlyingQianMM 提交于
* fix scale kernel configuration error where input numel is 0 * fix code stype * add unit test case for scale op when numel of input x is zero * fix ci codestyle check * add cpu and gpu unit test case for scale op when numel of input x is zero * add uninitialized judgment for input of scale
-