- 03 3月, 2023 10 次提交
-
-
由 Xiaoxu Chen 提交于
-
由 sneaxiy 提交于
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs
-
由 zhangkaihuo 提交于
-
由 wangxiaoning 提交于
* comp gather_nd_grad * fix * test no cinn * fix * fix cinn
-
由 zhouweiwei2014 提交于
-
由 JingZhuangzhuang 提交于
-
由 niuliling123 提交于
-
由 risemeup1 提交于
* patch on gloo/types.h * fix patch * change patch dir * add patch
-
由 ronnywang 提交于
* [CustomDevice] fix process_group_custom api * update * update * update * update
-
- 02 3月, 2023 29 次提交
-
-
由 Ruibiao Chen 提交于
* Check structed kernel for new executor static build * Update code * Ready for resnet50 * Move transfer_dtype to phi * Ready for transformer * Fix CI errors * Fix layer_norm InferMeta * Remove layer_norm infermeta fix
-
由 limingshu 提交于
* first commit * finish base work * modification for good * fix for cache setting and gather the algo and desc as one data for cache storage * fix for cache setting and gather the algo and desc as one data for cache storage * install pre-commit check
-
由 chenxiao120660 提交于
-
由 ahahahahahaha 提交于
-
由 Zhang Jun 提交于
-
由 xiongkun 提交于
-
由 xiaoxiaohehe001 提交于
* add_trt_tile * tile_trt
-
由 zyfncg 提交于
* fix performance drop in BF16 models * fix test_cpu_quantize_squash_pass
-
由 Charles-hit 提交于
* fix prim_op_test when python api outs is different with kernel sig * add elementwise op prim test * fix unit test * add bfloat16 for full in static prim api * empty-commit * close bf16 test * polish elementwise tests
-
由 qizhaoaoe 提交于
* fluid clean: remove parallel and parallel_helper api * fix: fix the import path. * fix DataParallel imports issue
-
由 Jiabin Yang 提交于
* fix attrs copy error * fix bert by fix slice error * fix op test
-
由 risemeup1 提交于
* fix gcc12 error * patch on device.cc * fix gcc error while compiling gloo
-
由 xiongkun 提交于
* [dy2static] bugfix: make stop_gradient a cache key 1. make stop_gradient cache key in dy2static. * fix ci errors * fix ci error * fix ci error * fix ci error
-
由 feng_shuai 提交于
-
由 wangshengxiang 提交于
-
由 easywaytolifebelief 提交于
* fluid clean: remove dygraph_utils._append_bias_in_dygraph * fix func name and imports
-
由 wangzhen38 提交于
-
由 HongyuJia 提交于
* polish codes according #50813 * [getCurrentCUDAStream] Add C++ API getCurrentCUDAStream * change get->Get * wrap with macro * use Get instead of get
-
由 Leo Chen 提交于
* register fp16 and bf16 kernel for uniform_random * fix compile * support selected_rows * add ut * revert cpu * fp16 test skip cpu
-
由 wangzhen38 提交于
* [cinn] concat_grad * [cinn] concat_grad * [cinn] concat_grad build success * [Add PGLBOX] fix unnitest * [Add PGLBOX] fix unnitest * [Add PGLBOX] fix codestyle * [cinn] update by comments * [cinn] update by comment * [cinn] add axis check
-
由 LoneRanger 提交于
-
由 zhangbo9674 提交于
* add dialect * add some interface for dialect * add some dialect interfaces for class Type * set WITH_NEWIR=OFF * refine code by comment * polish code * refine include style * refine log for debug
-
由 gaoziyuan 提交于
-
由 Roc 提交于
* add composite op hard swish * add test grad * update apis calling * update date range * add ut * tune off cinn for 0-d shape * skip cinn
-
由 haosicheng 提交于
-
由 Yuanle Liu 提交于
-
由 zyfncg 提交于
* split generated_op.cc into 4 src files * fix bug * fix compile on windows
-
由 jiangcheng 提交于
-
由 Vvsmile 提交于
-
- 01 3月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* flash attn * seed * almost * softmax * fix workspace * add unitest; linux only * fix setup * fix datatype include * fix setup typo * fix def scope * new error api * use paddle fork * fix attr bug; complete ut * update flash hash * fix rng reset * fix offset * fix comments
-