- 10 11月, 2022 9 次提交
-
-
由 pangyoki 提交于
change cudnn error to cuda error if compiled cuda version is incompatible with installed cuda version (#47743) * fix cudnn error * fix * fix * fix
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
由 wenbin 提交于
* skip_merge_layernorm * add UT * modify comments
-
由 zyfncg 提交于
* add ci check for code-gen script * update
-
由 jiangcheng 提交于
* fix paddle with cinn cannot link relu op bug * change cmake activation_op to generator_op
-
由 RichardWooSJTU 提交于
* add fuse_multi_transformer_layer_pass
-
由 Charles-hit 提交于
-
由 Wen Sun 提交于
* refactor: send, recv, send_partial, recv_partial * refactor: rm useless const ref
-
由 zhangbo9674 提交于
-
- 09 11月, 2022 23 次提交
-
-
由 huangjiyi 提交于
* rm "paddle/fluid/platform/dynload/cudnn.h" in phi * rm "paddle/fluid/platform/dynload/mklml.h" in phi * rm "paddle/fluid/platform/dynload/rocblas.h" in phi * replace "paddle::platform::dynload::" with "phi::dynload::" in phi * revert "blas_impl.cu.h"
-
由 HongyuJia 提交于
-
由 Leo Chen 提交于
* remove locked allocator * fix ut * add heafer file
-
由 joanna.wozna.intel 提交于
-
由 WangZhen 提交于
* Get params and grads in cpp to avoid gpu idel time * Using python param instead of cpp return param to fix test_asp_optimize_dynamic.py * Get grads from cpp and construct params_grads on python * Check meta and remove comments
-
由 Wang Xin 提交于
* remove framework/data_type.h from phi * fix CI fail: map proto::VarType to phi::DataType * refactor code to add more detailed comments
-
由 Paulina Gacek 提交于
* Analysis API interface for disabling fc passes * Unit tests corrected * Python API added * test runs only when PADDLE_WITH_MKLDNN * Fc op changed to relu in matmul_op_test * Disable fc passes in tests where acc drops * code formating * Unit test for analysisConf added * Unit test gpu added * fc passes disabled when iterations=0 in gru test * style * passes disabled when fp32 in gru test * fc passes disabled in lstm test * Import from inference, not fluid in doc
-
由 Tony Cao 提交于
* fix flake8 CodeStyle E266 * fix comments
-
由 jakpiase 提交于
-
由 Sławomir Siwek 提交于
-
由 Jacek Czaja 提交于
* first commit - more fixes - compilation fix - compilation fix - fix - another fix - yet another fix - Fix - fix to fused ops - compilation fix - compilation fix - another compilation fix - another fix - fix - fix - fix - fix - yet another fix - fix - fix - cosmetic fix :- lint - Revert some changes (to be brought back later) - fix to build - Added prototype of slice - fix compilation fix - compilation fix - fix - fix - Fix - fix fix modified: cmake/flags.cmake * lint * rerun of CI * - Fix * - lint * - lint2
-
由 huangjiyi 提交于
-
由 zyfncg 提交于
* generate static graph code of some op * polish code * fix bug * update default value
-
由 huangjiyi 提交于
-
由 cyber-pioneer 提交于
-
由 Chen Weihang 提交于
* move fluid op generator into fluid * remove parsed op * resolve sig undef error * append python interp find logic * remove dup code
-
由 Wen Sun 提交于
-
由 LiYuRio 提交于
-
由 FlyingQianMM 提交于
* fix scale kernel configuration error where input numel is 0 * fix code stype * add unit test case for scale op when numel of input x is zero * fix ci codestyle check * add cpu and gpu unit test case for scale op when numel of input x is zero * add uninitialized judgment for input of scale
-
由 Wangzheee 提交于
* upgrade scale and slice op convert for Paddle-TensorRT
-
由 zhangkaihuo 提交于
-
由 wanghuancoder 提交于
* refine python call error report
-
- 08 11月, 2022 8 次提交
-
-
由 ronnywang 提交于
-
由 Wilber 提交于
-
由 LiYuRio 提交于
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for sundary api * fix comment
-
由 Sławomir Siwek 提交于
* softplus+activation * fc + elementwise_add test refactored * rename MKLDNN to OneDNN * fc+activation tests refactored * remove softplus ut * whitespace * whitespace * codestyle * codestyle * add more cases to fc+act * remove softplus+hard_sigmoid pass * remove softplus + hard_sigmoid UT * add approximate for gelu * swish beta range * new codestyle * reduce number of tests
-
由 zhangyikun02 提交于
-
由 zhangyikun02 提交于
-
由 Kaipeng Deng 提交于
-