- 07 3月, 2023 2 次提交
-
-
由 Jiabin Yang 提交于
-
由 zhangyikun02 提交于
-
- 06 3月, 2023 13 次提交
-
-
由 mhy 提交于
-
由 Paulina Gacek 提交于
* first approach * test finished * cpp test deleted * CmakeList corrected * multi_gru_seq_fuse_pass rewritten * dummy cout deleted * review changes * timeout extended
-
由 傅剑寒 提交于
* add bf16 fp16 type support for interpolate * add bf16 fp16 support for interpolate in phi on cpu
-
由 Wang Xin 提交于
-
由 Ruibiao Chen 提交于
* Remove InterpretercoreInferShapeContext * Fix lod errors
-
由 Sławomir Siwek 提交于
-
由 Sonder 提交于
-
由 Yuanle Liu 提交于
-
由 201716010711 提交于
-
由 Meteor Liu 提交于
* implement floor_grad by primitive logic * implement floor_grad by primitive logic * Merge branch 'develop' into floor_grad
-
由 niuliling123 提交于
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
由 Sławomir Siwek 提交于
* matmul refactored * fc * SetOutMemDescWithLogicalLayoutFusesSupport * matmul_v2 * alpha support * group repetetive funcs * matmul utils * execute matmul methods * restore registered kernel names * split header and impl files * remove double negatives * increase coverage * add onednn tests to ctest * remove fusion logic from base matmuls
-
- 05 3月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
-
- 03 3月, 2023 11 次提交
-
-
由 gouzil 提交于
* [phi] move jit kernels from fluid to phi * [phi] fix paddle::phi err * [phi] fix windows 'posix_memalign': identifier not found * [phi] fix windows 'posix_memalign_free': identifier not found * [phi] fix readme directory structure, fc_functor paddle::platform
-
由 Weilong Wu 提交于
* fix issue accur in Mac OS m1 env * use __arm64__ directive to distinguish m1 chip
-
由 tianshuo78520a 提交于
-
由 Xiaoxu Chen 提交于
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs
-
由 zhangkaihuo 提交于
-
由 wangxiaoning 提交于
* comp gather_nd_grad * fix * test no cinn * fix * fix cinn
-
由 zhouweiwei2014 提交于
-
由 JingZhuangzhuang 提交于
-
由 niuliling123 提交于
-
由 ronnywang 提交于
* [CustomDevice] fix process_group_custom api * update * update * update * update
-
- 02 3月, 2023 13 次提交
-
-
由 Ruibiao Chen 提交于
* Check structed kernel for new executor static build * Update code * Ready for resnet50 * Move transfer_dtype to phi * Ready for transformer * Fix CI errors * Fix layer_norm InferMeta * Remove layer_norm infermeta fix
-
由 limingshu 提交于
* first commit * finish base work * modification for good * fix for cache setting and gather the algo and desc as one data for cache storage * fix for cache setting and gather the algo and desc as one data for cache storage * install pre-commit check
-
由 chenxiao120660 提交于
-
由 ahahahahahaha 提交于
-
由 Zhang Jun 提交于
-
由 xiongkun 提交于
-
由 xiaoxiaohehe001 提交于
* add_trt_tile * tile_trt
-
由 zyfncg 提交于
* fix performance drop in BF16 models * fix test_cpu_quantize_squash_pass
-
由 Charles-hit 提交于
* fix prim_op_test when python api outs is different with kernel sig * add elementwise op prim test * fix unit test * add bfloat16 for full in static prim api * empty-commit * close bf16 test * polish elementwise tests
-
由 Jiabin Yang 提交于
* fix attrs copy error * fix bert by fix slice error * fix op test
-
由 feng_shuai 提交于
-
由 wangshengxiang 提交于
-
由 HongyuJia 提交于
* polish codes according #50813 * [getCurrentCUDAStream] Add C++ API getCurrentCUDAStream * change get->Get * wrap with macro * use Get instead of get
-