- 25 8月, 2022 6 次提交
-
-
由 hong 提交于
* optimizer conv alog speed * code polish * remove useless code * fix compile error * fix cpu compile error * not use cudnn alog t * add search cache max number * polish code * fix cache test bug * add groups data format to conv args * fix cache test bug * fix cudnn_deterministic bug * fix test switch auto tune bug * fix test swith autotune bug; * fix conv cache bug * fix cache test error * fix cache test bug * fix windows mac compile error * fix workspace search error * update cudnn cache * fix cache test bug; test=develop * fix autotune swith test error * polish code * oplish code
-
由 Rayman 提交于
-
由 handiz 提交于
-
由 JZ-LIANG 提交于
* support high order differential with data parallel overlap * update unitest
-
由 hong 提交于
* add python final state * fix bug * fix bugs * fix bug * fix bug * revert impl, final state mul not support selected rows * fix softmax use cudnn error * add softlable false unitest * revert loss.py
-
由 haosicheng 提交于
-
- 24 8月, 2022 10 次提交
-
-
由 ShenLiang 提交于
* fix utest * fix utest * fix utest * fix log * fix random utest
-
由 wangzhen38 提交于
* [DOC FIX]fix code demo of auc * [doc fix] fix doc of auc
-
由 ronnywang 提交于
-
由 WangZhen 提交于
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun
-
由 WangZhen 提交于
* Adapt minlength attr for bincount
-
由 zhangkaihuo 提交于
-
由 feifei-111 提交于
* fixed1 * fix 2 * fixre * test complext var * delete logic no needed * fix to_tensor_static * code style * del
-
由 ronnywang 提交于
-
由 xiongkun 提交于
* while support for python container. It is convenient to convert more dynamic graph codes into static graphs. * cond support python container
-
- 23 8月, 2022 9 次提交
-
-
由 pangyoki 提交于
-
由 zhaoyingli 提交于
* add quant pass
-
由 LiYuRio 提交于
-
由 JZ-LIANG 提交于
-
由 xiongkun 提交于
-
由 JZ-LIANG 提交于
* bugfix * remove scaling * support rescale_grad opt * add unitest
-
由 pangyoki 提交于
* new executor support program._graph * fix program._program if program is Program()
-
由 YuanRisheng 提交于
* move distribute_fpn_proposals * fix some code * fix yaml bugs * add set dtype * move proposal_impl to funcs * fix compile bugs
-
由 ronnywang 提交于
* [CustomDevice] add profiler apis * migrate CalculateEstOccupancy into cuda_tracer * update * add ut
-
- 22 8月, 2022 5 次提交
-
-
由 joanna.wozna.intel 提交于
* Add int8 support for matmul+elementwiae_add fuse * Corrections after review and ernie test fix
-
由 Sławomir Siwek 提交于
* merge conv_concat_relu to conv_act * fix typo * extend unit test * reuse existing gpd * codestyle * enforce mkldnn conv
-
由 zhoutianzi666 提交于
-
由 wanghuancoder 提交于
some python c api use final state
-
由 Jiabin Yang 提交于
* optimize python api to speed up eager exec * optimize python api to speed up eager exec * optimize python api to speed up eager exec
-
- 20 8月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* pylayer detach output tensor if it is equal with input * pylayer detach output tensor if it is equal with input
-
由 Sing_chan 提交于
* add max_p without test * add test of max_p * make max_p consistent with paddle.maximum
-
- 19 8月, 2022 8 次提交
-
-
由 pangyoki 提交于
* add forward inplace final state api * fix bug * fix reshape * fix coverage * add inplace info for erfinv, lerp, put_along_axis * fix put_along_axis infer_meta * fix format * update yaml * fix
-
由 xiongkun 提交于
-
由 Wang Bojun 提交于
* fix layernormTrt meanVar alloc bug
-
由 Ruibiao Chen 提交于
* Fix random op depenency and lr_shedule bugs for standalone executor * Fix CI errors * Fix CI errors * Fix CI errors
-
由 Wang Bojun 提交于
* add group_norm dyanmic plugin
-
由 HongyuJia 提交于
* polish default param of XXX_interp_test, the same default value with XXX_interp_np * set default value data_layout=NCHW, cause C++ end treat NCDHW the same way as NCHW
-
由 dongfangshenzhu 提交于
* add merged_momentum *test=kunlun * add merged_momentum *test=kunlun * add fp16 to merged_momentum,*test=kunlun * change dist_model.cc * add merged_momentum unittest and change momentum,test=kunlun * add merged_momentum unittest and change momentum,test=kunlun * add merged_momentum unittest and change momentum,test=kunlun * add merged_momentum unittest and change momentum,test=kunlun
-
由 Nyakku Shigure 提交于
[CodeStyle] use np.testing.assert_allclose instead of self.assertTrue(np.allclose(...)) (part 3) (#45251)
-