- 09 11月, 2022 13 次提交
-
-
由 WangZhen 提交于
* Get params and grads in cpp to avoid gpu idel time * Using python param instead of cpp return param to fix test_asp_optimize_dynamic.py * Get grads from cpp and construct params_grads on python * Check meta and remove comments
-
由 Paulina Gacek 提交于
* Analysis API interface for disabling fc passes * Unit tests corrected * Python API added * test runs only when PADDLE_WITH_MKLDNN * Fc op changed to relu in matmul_op_test * Disable fc passes in tests where acc drops * code formating * Unit test for analysisConf added * Unit test gpu added * fc passes disabled when iterations=0 in gru test * style * passes disabled when fp32 in gru test * fc passes disabled in lstm test * Import from inference, not fluid in doc
-
由 Tony Cao 提交于
* fix flake8 CodeStyle E266 * fix comments
-
由 jakpiase 提交于
-
由 Sławomir Siwek 提交于
-
由 Jacek Czaja 提交于
* first commit - more fixes - compilation fix - compilation fix - fix - another fix - yet another fix - Fix - fix to fused ops - compilation fix - compilation fix - another compilation fix - another fix - fix - fix - fix - fix - yet another fix - fix - fix - cosmetic fix :- lint - Revert some changes (to be brought back later) - fix to build - Added prototype of slice - fix compilation fix - compilation fix - fix - fix - Fix - fix fix modified: cmake/flags.cmake * lint * rerun of CI * - Fix * - lint * - lint2
-
由 zyfncg 提交于
* generate static graph code of some op * polish code * fix bug * update default value
-
由 cyber-pioneer 提交于
-
由 Chen Weihang 提交于
* move fluid op generator into fluid * remove parsed op * resolve sig undef error * append python interp find logic * remove dup code
-
由 Wen Sun 提交于
-
由 LiYuRio 提交于
-
由 Wangzheee 提交于
* upgrade scale and slice op convert for Paddle-TensorRT
-
由 wanghuancoder 提交于
* refine python call error report
-
- 08 11月, 2022 14 次提交
-
-
由 Wilber 提交于
-
由 LiYuRio 提交于
-
由 Sławomir Siwek 提交于
* softplus+activation * fc + elementwise_add test refactored * rename MKLDNN to OneDNN * fc+activation tests refactored * remove softplus ut * whitespace * whitespace * codestyle * codestyle * add more cases to fc+act * remove softplus+hard_sigmoid pass * remove softplus + hard_sigmoid UT * add approximate for gelu * swish beta range * new codestyle * reduce number of tests
-
由 zhangyikun02 提交于
-
由 zhangyikun02 提交于
-
由 Kaipeng Deng 提交于
-
由 ronnywang 提交于
-
由 zhoutianzi666 提交于
* allow fold fill_constant && allow nms3 into trt in int8 model * use unordered_map * fix CI failing
-
由 Paulina Gacek 提交于
* Split kernel registered, tests for uint/int added * Split quantized * Split output scales calculated only once * NearestInterp test fix reversed * DequantizeOutputs corrected
-
由 jzhang533 提交于
* removing dependent to fluid/framework/eigen.h in phi * more fix according to PR-CI-Py3 fail
-
由 Charles-hit 提交于
* support pow_double_grad op * add unit test for pow double grad * fix pow double grad * optimize pow double grad kernel * fix pow double grad kernel
-
由 zhoutianzi666 提交于
* fix cast bug
-
由 TeFeng Chen 提交于
-
由 Tomasz Socha 提交于
* Fix undefined symbol: shm_open * Fix for Windows * Exclude APLLE
-
- 07 11月, 2022 13 次提交
-
-
由 Hui Zhang 提交于
* suqeeze2 transpose2 fuse onednn * format * fix output shape * fix conflict * format * format * remove useless * remove log * simply pass * fix comment * fix * fix msg * fix error msg * format
-
由 Wang Xin 提交于
-
由 QingshuChen 提交于
*test=kunlun
-
由 ykkk2333 提交于
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368) * add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
-
由 wanghuancoder 提交于
* refine python lib link
-
由 Yuanle Liu 提交于
-
由 ronnywang 提交于
-
由 WangZhen 提交于
* Get three grad lists in CPP to avoid gpu idle time * Support legacy mode
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code * Call SetDnnFallback function in the base class * activation fallback to plain kernel * fix default GetExpectedKernelType find wrong kernel * search cudnn kernel instead of fallback * fix cudnn_handle bug * remove tanh use_cudnn * restore tanh use_cudnn * debug tanh * fix tanh bug * delete activation cudnn kernel * polish code
-
由 Qi Li 提交于
-
由 Wen Sun 提交于
-
由 Sławomir Siwek 提交于
* init changes * bnorm * method signature * change order * bnorm * removed unused args
-
由 Sławomir Siwek 提交于
* remove fwd funcs * migrate conv grads
-