- 15 9月, 2022 15 次提交
-
-
由 Jacek Czaja 提交于
* - mul & matmul changes - fix - bs16 correction of strides * - cosmetic fixes * - lint * - fix * - fix * - format -> mem_desc * - fix * - fix * - fix * - fix * - fix
-
由 Nyakku Shigure 提交于
-
由 Nyakku Shigure 提交于
-
由 傅剑寒 提交于
-
由 Wangzheee 提交于
-
由 zyfncg 提交于
* clear extra attrs of elementwise op in opmaker * fix op_debug_string_test * fix bug of grad_add * fix sort of runtime attrs
-
由 WangZhen 提交于
Support 0 shapes input Tensor for MKL slice kernel
-
由 limingshu 提交于
* first commit * fix some bugs in code * fix bugs * to optimize merge one dimension feature
-
由 Nyakku Shigure 提交于
-
由 weishengying 提交于
-
由 wanghuancoder 提交于
* saved_tensors_hooks
-
由 Li Min 提交于
-
由 Jiabin Yang 提交于
* make eager log readable * fix compile error * recover test * invoke ci again
-
由 zyfncg 提交于
* delete eigen header in data_type.h * fix complie bug * refactor
-
由 Sing_chan 提交于
-
- 14 9月, 2022 25 次提交
-
-
由 Nyakku Shigure 提交于
* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error
-
由 JingZhuangzhuang 提交于
* merge python lib * Update third_party.cmake * Update CMakeLists.txt
-
由 pangyoki 提交于
-
由 Chen Weihang 提交于
-
由 JingZhuangzhuang 提交于
* Delay TensorRT registry * Add unused define * Fix TensorRT test * fix function to reference * Update trt_plugin.h
-
由 Chen Weihang 提交于
-
由 Jiabin Yang 提交于
* support bmm and bmm_grad in xpu * add error removal * test=kunlun * refactor code for better structure * test=kunlun * add fp16 kernel for bmm * test=kunlun
-
由 Charles-hit 提交于
* add convert rules for fill_any_like op in paddle science * add unit test for fill_any_like op in paddle science * modify fill_any_like convert rule * modify fill_any_like convert rule dtype
-
由 wanghuancoder 提交于
-
由 zhangkaihuo 提交于
-
由 Li Min 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle] trim trailing whitespace in .md and .rst * empty commit, test=document_fix
-
由 Leo Guo 提交于
Migrate scale and scatter to phi, and modify the code style for roi_align_kernel. test=kunlun (#45938)
-
由 Zhang Jun 提交于
* update * update * update
-
由 Charles-hit 提交于
-
由 Chen Weihang 提交于
* add ci rule for blocking legacy kernel registration, test=document_fix * add a test case, test=document_fix * Revert "add a test case, test=document_fix" This reverts commit 239e08678b791aabb8fae48c4b74f2a8861e222d. * polish desc, test=document_fix
-
由 JZ-LIANG 提交于
* bugfix (#45332) * dist embedding support lookup table v1 * add unitest * customize wait_comm * group gradients * bugfix * update program
-
由 ykkk2333 提交于
-
由 Thunderbrook 提交于
-
由 Leo Chen 提交于
-
由 zhangbo9674 提交于
* support bfloat16 for amp_decorate * add check_finite for bf16 * fix bug * add ut * add ut * refine code
-
由 Xinger 提交于
-
由 Chen Weihang 提交于
* fix arm fp16 compile error * polish macro impl
-
由 xiongkun 提交于
* fix select_input with different shape errors: 1. select_input_with_buildin_type directly return non-undefinedvar branch when meeting undefined var 2. the output shape of select_input is inferred from inputs. * reverse the logic in select_input
-
由 sneaxiy 提交于
* fix distributed_fused_lamb nan * remove CUDA_ASSERT
-