- 01 12月, 2022 2 次提交
-
-
由 zyfncg 提交于
* rename kernel for top_k, slogdeterminant, generate_proposals_v2 * fix bug
-
由 zhangyikun02 提交于
-
- 30 11月, 2022 7 次提交
-
-
由 Qi Li 提交于
-
由 zyfncg 提交于
* fix error log for yaml check * remove grad_op of increment
-
由 Netpunk 提交于
* migrate transpose_op.cu.h and gpu_utils.h * format code style * fix some problems * format code * reset tranpose_op.cc * test commit * recover transpose_op.h * delete transpose_op.h * adjust header files order in transpose_op.cc
-
由 Aurelius84 提交于
* [Perf]Fix interploate OutSize data transform problem * fix code style * fix grad * fix phi kernel
-
由 zyfncg 提交于
* rename some kernel name * fix compile problem
-
由 james 提交于
some legacy code still use xpu_wait() for stream sync -- it only syncs default stream. this PR replaces them with dev_ctx.Wait() to ensure that correct stream is always used
-
由 zhangyikun02 提交于
-
- 29 11月, 2022 12 次提交
-
-
由 HongyuJia 提交于
-
由 Paulina Gacek 提交于
* traspose2 kernel migrated * Got rid of mutable_data * x modification added * ops added in extra info file * Formatting fix * 2 fuse passes with tanpose2 commented * nr of outs changed in 2 passes, passes uncommented * Changes in passes reverted * transpose chnaged in operator.cc * MKLDNN check in operator.cc * Transpose fixes * Fix deleted from operato * template corrected Co-authored-by: NPaulina Gacek <paulinagacek@intel.com>
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
由 Asthestarsfalll 提交于
* migrate enforce_custom.h from fluid to phi * move to backends/custom/
-
由 Sławomir Siwek 提交于
-
由 Vvsmile 提交于
Optimize the implementation of the argsort operator
-
由 Sławomir Siwek 提交于
* cleanup unused code * unify is_int8 is_bfloat16 * Simplify matmul_v2 FWD kernel * remove RunKernel methods * remove import namespace * remove headers * clean fluid/phi cross imports * remove fluid axpy_handler * delete fluid methods * activations * OneDNNMemDesc * MKLDNNFormatForSize * MatchShapeToLayout * MKLDNNMemoryFormat * MKLDNNFormat * ReorderMKLDNNHandler * to_void_cast * review suggestions * interpolate * remove fluid depedency * init * ExecuteMatMulV2 * rm fluid kernel * matmul_grad * remove mutable_data * mul_grad * matmul fwd * add extra attr * temp disable passes * re-enable passes * workaround for matmul+act * fix for matmul+eltwise_add * fix typo * merge bugfix #48364 * remove merge conflict
-
由 haosicheng 提交于
-
由 Sławomir Siwek 提交于
-
由 HappyHeavyRain 提交于
* generate static graph code for lerp by yaml, test=develop * modify the op_compat.yaml of lerp, test=develop * generate static graph code for lerp by yaml, test=develop * modify the op_compat.yaml of lerp, test=develop * remove the 'attrs' of lerp, test=develop Signed-off-by: lizhiyu02 <1528794076@qq.com> Signed-off-by: lizhiyu02 <1528794076@qq.com>
-
由 zhangkaihuo 提交于
-
由 zhangkaihuo 提交于
-
- 28 11月, 2022 12 次提交
-
-
由 zyfncg 提交于
* generate static graph code for some operators * add some ops generate * revert npu gelu
-
由 huangjiyi 提交于
* decouple cudnn_desc.h from fluid * move cudnn_desc.h from fluid to phi * fix bugs * decouple cudnn_helper.h from fluid * fix bugs * move cudnn_helper.h from fluid to phi * add fluid cudnn_helper.h * move miopen_desc.h from fluid to phi * move miopen_helper.h from fluid to phi * fix bugs * move gpu_dnn.h from fluid to phi * fix bugs * update copyright year * simplify gpu_dnn.h in fluid * fix bugs * fix xpu build bug * fix compile bug * fix bug
-
由 Yiqun Liu 提交于
* Optimize the log of broadcast and decrease the log level. * Remove the redundant brackets. * Change op benchmark ci to test the tests module. * Remove the observe of elementwise and reduce_ops sub-directory.
-
由 YuanRisheng 提交于
* Fix onednn kernel bugs * fix gpu bugs
-
由 Asthestarsfalll 提交于
-
由 PuQing 提交于
-
由 Qi Li 提交于
* [NPU] apply npu_identity to conv bn and copy2cpu, test=develop * update npu identity to share data with x, test=develop * address review comments, test=develop
-
由 huangjiyi 提交于
* rm fluid “xpu_header.h” deps in phi * move part of xpu_op_list.h from fluid to phi * add fluid xpu_op_list deps * add glog deps for xpu_op_list in phi * fix PR-CI-Kunlun
-
由 zyfncg 提交于
* add fluid_op_name_map * rename some kernel name * add comments for op-kernel map * refine map name of op to kernel
-
由 MarDino 提交于
-
由 Thomas Young 提交于
* fix expand as op * fix bug
-
由 haosicheng 提交于
-
- 25 11月, 2022 7 次提交
-
-
由 wanghuancoder 提交于
* for xpu multi thread bug test
-
由 Wang Bojun 提交于
* group norm fp16 support
-
由 Chitsing KUI 提交于
* attr ready * op ip ready * start dynamic * end2end ok * input shape to map, stat by op * layer wip * first version ready * fix proto depds * fix profiler deps * fix flops typo, rm tuple shape
-
由 Roc 提交于
* support xpu scalar inplace * sharding for xpu Co-authored-by: Nheyanru <81976792+heyanru01@users.noreply.github.com>
-
由 wanghuancoder 提交于
-
由 sneaxiy 提交于
-
由 sneaxiy 提交于
* add bfloat16 support for more ops * fix ci compile * fix windows compile error * fix windows compile error * fix rocm compile error * fix ROCM compile error
-