- 08 11月, 2022 29 次提交
-
-
由 ronnywang 提交于
-
由 JZ-LIANG 提交于
[Auto Parallel] Sharding Optimization:Partition Algorithm & Stage2 Parameter Bucket communication (#47180) * partition param by order * add logging * reorder opt * config * stage2 bucket * update unitest
-
由 Wilber 提交于
-
由 LiYuRio 提交于
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for sundary api * fix comment
-
由 Sławomir Siwek 提交于
* softplus+activation * fc + elementwise_add test refactored * rename MKLDNN to OneDNN * fc+activation tests refactored * remove softplus ut * whitespace * whitespace * codestyle * codestyle * add more cases to fc+act * remove softplus+hard_sigmoid pass * remove softplus + hard_sigmoid UT * add approximate for gelu * swish beta range * new codestyle * reduce number of tests
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for distribution api * fix comment
-
由 zhangyikun02 提交于
-
由 zhangyikun02 提交于
-
由 shentanyue 提交于
-
由 Kaipeng Deng 提交于
-
由 Leo Guo 提交于
-
由 Nyakku Shigure 提交于
-
由 xiongkun 提交于
* fix tensor_array slice bugs in _getitem_impl_ * fix when var is a paddle.Tensor * code format
-
由 ronnywang 提交于
-
由 chalsliu 提交于
-
由 zhoutianzi666 提交于
* allow fold fill_constant && allow nms3 into trt in int8 model * use unordered_map * fix CI failing
-
由 houj04 提交于
* update AUTHOR. test=kunlun * update AUTHOR.
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment
-
由 zyfncg 提交于
-
由 Paulina Gacek 提交于
* Split kernel registered, tests for uint/int added * Split quantized * Split output scales calculated only once * NearestInterp test fix reversed * DequantizeOutputs corrected
-
由 jzhang533 提交于
* removing dependent to fluid/framework/eigen.h in phi * more fix according to PR-CI-Py3 fail
-
由 tianshuo78520a 提交于
* disable distributed xpu tests * test=kunlun * test=document_fix;test=kunlun * test=document_fix;test=kunlun * test=document_fix;test=kunlun * test=document_fix;test=kunlun
-
由 Charles-hit 提交于
* support pow_double_grad op * add unit test for pow double grad * fix pow double grad * optimize pow double grad kernel * fix pow double grad kernel
-
由 zhoutianzi666 提交于
* fix cast bug
-
由 Wang Xin 提交于
-
由 Chen Weihang 提交于
-
由 TeFeng Chen 提交于
-
由 Tomasz Socha 提交于
* Fix undefined symbol: shm_open * Fix for Windows * Exclude APLLE
-
- 07 11月, 2022 11 次提交
-
-
由 Yiqun Liu 提交于
* Define ConvRunner to wrapper the call of cudnn conv functions. * Use ConvKind in SearchAlgorithm.
-
由 Hui Zhang 提交于
* suqeeze2 transpose2 fuse onednn * format * fix output shape * fix conflict * format * format * remove useless * remove log * simply pass * fix comment * fix * fix msg * fix error msg * format
-
由 Wang Xin 提交于
-
由 Leo Chen 提交于
-
由 QingshuChen 提交于
*test=kunlun
-
由 tianshuo78520a 提交于
* test=cuda117 * test=cuda11 * test=document_fix;test=cuda117 * test=document_fix
-
由 Chen Weihang 提交于
-
由 zhaoyingli 提交于
* fp16 pass support assign op * choose assign op exec mode * add unittest * add cmakelist
-
由 pangyoki 提交于
-
由 ykkk2333 提交于
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368) * add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
-
由 wanghuancoder 提交于
* test_paddle_multiprocessing support eager tensor pickler
-