- 11 11月, 2022 2 次提交
-
-
由 zyfncg 提交于
* generate static graph code for some ops by yaml * remove deleted files * update cmake * update cmake * udpate cmake
-
由 Yuanle Liu 提交于
-
- 10 11月, 2022 13 次提交
-
-
由 Sylwester Fraczek 提交于
* migrate prelu * remove cache * review fixes
-
由 WangZhen 提交于
Get grads types from cpp for adam to speed up
-
由 LiYuRio 提交于
-
由 YuanRisheng 提交于
* standard api * fix sparse bugs * fix xpu bugs, test=kunlun * remove hard code for custom unittest * open ci, test=kunlun * deal with conflict
-
由 zhangxin81 提交于
* add roformer pass&&plugin(novarlen)
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
由 wenbin 提交于
* skip_merge_layernorm * add UT * modify comments
-
由 zyfncg 提交于
* add ci check for code-gen script * update
-
由 jiangcheng 提交于
* fix paddle with cinn cannot link relu op bug * change cmake activation_op to generator_op
-
由 RichardWooSJTU 提交于
* add fuse_multi_transformer_layer_pass
-
由 Charles-hit 提交于
-
由 Wen Sun 提交于
* refactor: send, recv, send_partial, recv_partial * refactor: rm useless const ref
-
由 zhangbo9674 提交于
-
- 09 11月, 2022 16 次提交
-
-
由 HongyuJia 提交于
-
由 Leo Chen 提交于
* remove locked allocator * fix ut * add heafer file
-
由 joanna.wozna.intel 提交于
-
由 WangZhen 提交于
* Get params and grads in cpp to avoid gpu idel time * Using python param instead of cpp return param to fix test_asp_optimize_dynamic.py * Get grads from cpp and construct params_grads on python * Check meta and remove comments
-
由 Paulina Gacek 提交于
* Analysis API interface for disabling fc passes * Unit tests corrected * Python API added * test runs only when PADDLE_WITH_MKLDNN * Fc op changed to relu in matmul_op_test * Disable fc passes in tests where acc drops * code formating * Unit test for analysisConf added * Unit test gpu added * fc passes disabled when iterations=0 in gru test * style * passes disabled when fp32 in gru test * fc passes disabled in lstm test * Import from inference, not fluid in doc
-
由 Tony Cao 提交于
* fix flake8 CodeStyle E266 * fix comments
-
由 jakpiase 提交于
-
由 Sławomir Siwek 提交于
-
由 Jacek Czaja 提交于
* first commit - more fixes - compilation fix - compilation fix - fix - another fix - yet another fix - Fix - fix to fused ops - compilation fix - compilation fix - another compilation fix - another fix - fix - fix - fix - fix - yet another fix - fix - fix - cosmetic fix :- lint - Revert some changes (to be brought back later) - fix to build - Added prototype of slice - fix compilation fix - compilation fix - fix - fix - Fix - fix fix modified: cmake/flags.cmake * lint * rerun of CI * - Fix * - lint * - lint2
-
由 zyfncg 提交于
* generate static graph code of some op * polish code * fix bug * update default value
-
由 cyber-pioneer 提交于
-
由 Chen Weihang 提交于
* move fluid op generator into fluid * remove parsed op * resolve sig undef error * append python interp find logic * remove dup code
-
由 Wen Sun 提交于
-
由 LiYuRio 提交于
-
由 Wangzheee 提交于
* upgrade scale and slice op convert for Paddle-TensorRT
-
由 wanghuancoder 提交于
* refine python call error report
-
- 08 11月, 2022 9 次提交
-
-
由 Wilber 提交于
-
由 LiYuRio 提交于
-
由 Sławomir Siwek 提交于
* softplus+activation * fc + elementwise_add test refactored * rename MKLDNN to OneDNN * fc+activation tests refactored * remove softplus ut * whitespace * whitespace * codestyle * codestyle * add more cases to fc+act * remove softplus+hard_sigmoid pass * remove softplus + hard_sigmoid UT * add approximate for gelu * swish beta range * new codestyle * reduce number of tests
-
由 zhangyikun02 提交于
-
由 zhangyikun02 提交于
-
由 Kaipeng Deng 提交于
-
由 ronnywang 提交于
-
由 zhoutianzi666 提交于
* allow fold fill_constant && allow nms3 into trt in int8 model * use unordered_map * fix CI failing
-
由 Paulina Gacek 提交于
* Split kernel registered, tests for uint/int added * Split quantized * Split output scales calculated only once * NearestInterp test fix reversed * DequantizeOutputs corrected
-