- 14 11月, 2022 6 次提交
-
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor as scalar attribute for some api * fix doc
-
由 cyber-pioneer 提交于
-
由 HongyuJia 提交于
-
由 LiYuRio 提交于
-
由 Nyakku Shigure 提交于
[CodeStyle][F821] fix undefined variables due to missing imports, misspelled variable names (#47899) * `hann` -> `_hann` * `false` -> `False` * a missing passed argument `reduce_all` * some missing imports * `device_type` -> `heter_device_type` * `PKVClient` -> `KVClient` * fix some typos and missing imports
-
由 Nyakku Shigure 提交于
-
- 11 11月, 2022 1 次提交
-
-
由 czr-gc 提交于
* feat(ipu): add model_runtime backend support in IPU. * fix(ipu_executor): fix error message format. * fix(ipu_executor): fix format. * fix(ipu_executor): fix format again. * fix(ipu_executor): fix format again. * fix(ipu_executor): fix format again.
-
- 10 11月, 2022 11 次提交
-
-
由 zhangyikun02 提交于
-
由 YuanRisheng 提交于
* standard api * fix sparse bugs * fix xpu bugs, test=kunlun * remove hard code for custom unittest * open ci, test=kunlun * deal with conflict
-
由 zhangxin81 提交于
* add roformer pass&&plugin(novarlen)
-
由 zhaoyingli 提交于
-
由 caozhou 提交于
* add c_concat pass for reshard * add unittest
-
由 zhouweiwei2014 提交于
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
由 wenbin 提交于
* skip_merge_layernorm * add UT * modify comments
-
由 Haohongxiang 提交于
-
由 Charles-hit 提交于
-
由 Nyakku Shigure 提交于
* [Fix][F821] fix TestUnpoolOpException * fix TestUnpoolOpException * fix TestUnpool3DOpException * remove unused variables * fix the regexp does not match the C++ traceback * add missing error message for gpu unpool_kernel * Revert "add missing error message for gpu unpool_kernel" This reverts commit 17ef7a127e1c3ee00f9102c37ad8cea35953f20c. * assertion indices_value_error errors are only reported on the CPU * for test * run test_exception in dygraph mode
-
- 09 11月, 2022 7 次提交
-
-
由 Paulina Gacek 提交于
* Analysis API interface for disabling fc passes * Unit tests corrected * Python API added * test runs only when PADDLE_WITH_MKLDNN * Fc op changed to relu in matmul_op_test * Disable fc passes in tests where acc drops * code formating * Unit test for analysisConf added * Unit test gpu added * fc passes disabled when iterations=0 in gru test * style * passes disabled when fp32 in gru test * fc passes disabled in lstm test * Import from inference, not fluid in doc
-
由 Tony Cao 提交于
* fix flake8 CodeStyle E266 * fix comments
-
由 cyber-pioneer 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove unnecessary u-prefix in string literal * `"{}".format(x)` -> `x` * remove duplicated dtype literals * revert changes in data_feeder.py * remove u-prefix in data_feeder * revert remove duplicated dtype literals in data_feeder * remove unnecessary convert to str * for test * add some comments * refine comment * restore a removed str conversion * re-trigger all ci, empty commit
-
由 LiYuRio 提交于
-
由 FlyingQianMM 提交于
* fix scale kernel configuration error where input numel is 0 * fix code stype * add unit test case for scale op when numel of input x is zero * fix ci codestyle check * add cpu and gpu unit test case for scale op when numel of input x is zero * add uninitialized judgment for input of scale
-
由 Wangzheee 提交于
* upgrade scale and slice op convert for Paddle-TensorRT
-
- 08 11月, 2022 13 次提交
-
-
由 JZ-LIANG 提交于
[Auto Parallel] Sharding Optimization:Partition Algorithm & Stage2 Parameter Bucket communication (#47180) * partition param by order * add logging * reorder opt * config * stage2 bucket * update unitest
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for sundary api * fix comment
-
由 Sławomir Siwek 提交于
* softplus+activation * fc + elementwise_add test refactored * rename MKLDNN to OneDNN * fc+activation tests refactored * remove softplus ut * whitespace * whitespace * codestyle * codestyle * add more cases to fc+act * remove softplus+hard_sigmoid pass * remove softplus + hard_sigmoid UT * add approximate for gelu * swish beta range * new codestyle * reduce number of tests
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for distribution api * fix comment
-
由 zhangyikun02 提交于
-
由 zhangyikun02 提交于
-
由 xiongkun 提交于
* fix tensor_array slice bugs in _getitem_impl_ * fix when var is a paddle.Tensor * code format
-
由 zhoutianzi666 提交于
* allow fold fill_constant && allow nms3 into trt in int8 model * use unordered_map * fix CI failing
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment
-
由 Paulina Gacek 提交于
* Split kernel registered, tests for uint/int added * Split quantized * Split output scales calculated only once * NearestInterp test fix reversed * DequantizeOutputs corrected
-
由 tianshuo78520a 提交于
* disable distributed xpu tests * test=kunlun * test=document_fix;test=kunlun * test=document_fix;test=kunlun * test=document_fix;test=kunlun * test=document_fix;test=kunlun
-
由 Charles-hit 提交于
* support pow_double_grad op * add unit test for pow double grad * fix pow double grad * optimize pow double grad kernel * fix pow double grad kernel
-
由 zhoutianzi666 提交于
* fix cast bug
-
- 07 11月, 2022 2 次提交
-
-
由 QingshuChen 提交于
*test=kunlun
-
由 zhaoyingli 提交于
* fp16 pass support assign op * choose assign op exec mode * add unittest * add cmakelist
-