- 30 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * add bfloat16 to selu_grad to pass CI * fix selu grad compilation error
-
- 29 9月, 2022 24 次提交
-
-
由 carryyu 提交于
-
由 Xinger 提交于
-
由 zhaoyingli 提交于
-
由 zhangyikun02 提交于
-
由 Zhang Zheng 提交于
* Move valid check from python to kernel * fix error throw * fix * invalid label check * fix * Revert "fix" This reverts commit 79fad6799cfa4b30423dbc84e67d7d843d22b84a. * Revert "invalid label check" This reverts commit 402a9707390ad5386b3222e85844b92d2e9b9fa4. * Revert "fix" This reverts commit 09ba3080ee0587447f875c19cdf060485f15ae3b. * Revert "fix error throw" This reverts commit a901bfcc2179d5c120ec29af766f392b122dab52. * Revert "Move valid check from python to kernel" This reverts commit baa03cc4ef82d8d45516c30dfb52bf5aead30748. * final fix * fix * fix
-
由 zmxdream 提交于
* add afs OpenWriter * update
-
由 Zheng_Bicheng 提交于
* 之前的pr合并了大量错误代码,重新提交一份 * 之前的pr合并了大量错误代码,重新提交一份 * 修正格式问题 * 改回原来的格式 * 按照要求修改 * 按照要求修改格式 * 修复注释的问题 * 更新格式 * 测试自动格式化 * 修正英文注释 * fix docs build error * pre-commit * for docs build * for docs build * 修复mantissa计算错误的bug * 修复误判exponent可能存在负数,导致计算量增加的情况 Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
由 Leo Guo 提交于
Add index_select, index_select_grad, reduce_min kernel and their unittests for kunlun. Add registers of index_select, index_select_grad, reduce_min, sqrt, sqrt_grad to xpu2_op_list.test=kunlun. (#46557)
-
由 Nyakku Shigure 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][F401] remove unused import in unittests/collective * empty commit, test=document_fix * empty commit
-
由 carryyu 提交于
* fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40.
-
由 yeliang2258 提交于
* remove calibration file path * remove useless code
-
由 光明和真理 提交于
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
-
由 zhaoyingli 提交于
* [AutoParallel] fix reshard when train with eval * fix mppp
-
由 ming1753 提交于
-
由 Nyakku Shigure 提交于
-
由 傅剑寒 提交于
-
由 HongyuJia 提交于
* select highest priority layout * opt performance, save virtual table find
-
由 HongyuJia 提交于
* add datatype check for ParseKernelKeyByInputArgs * polish error message * Actually, einsum has vector<Tensor> inpute with DataType::COMPLEX64, see test_einsum_v2.py * headerfile remove enforce.h
-
由 zhangbo9674 提交于
* test * test * refine check pr is_comment chanege * test
-
由 ronnywang 提交于
* [CustomDevice] add to_static, amp ut * update * fix failed ut * update
-
由 Weilong Wu 提交于
[Eager, Performance optimization] support mod / matmul ( % and @ operator) to sink to Cpp layer (#46565) * [Eager, Performance optimization] support mod ( % operator) to sink to Cpp layer * fix mod logic * support matmul math operator * rm LOG(warning), use VLOG(6) * fix conflicts mistake
-
由 houj04 提交于
* [XPU] update xpu cmake to 0923. test=kunlun * [XPU] update xpu cmake to 0928. test=kunlun
-
由 risemeup1 提交于
* check change of unittest before checking coverage rate,test=coverage * modify paddle_build.sh * adding test_list.py
-
- 28 9月, 2022 15 次提交
-
-
由 sneaxiy 提交于
-
由 zhaoyingli 提交于
-
由 zhaoyingli 提交于
-
由 zhaoyingli 提交于
* [AutoParallel] fix dist_split * add unittest * update cmakelist
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
由 Ruibiao Chen 提交于
* Convert GradMergeAllReduceOpHandle in GraphToBlock * Set FLAGS_CONVERT_GRAPH_TO_PROGRAM to False
-
由 Jiabin Yang 提交于
* fix p_norm n=1 issue * fix p norm test error
-
由 Yuang Liu 提交于
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
* change BackendSet from 64bits to 32bits * fix _MSC_VER error, __lzcnt32->__lzcnt * fix __GNUC__ error, __builtin_clzl->__builtin_clz
-
由 Weilong Wu 提交于
[Eager, Performance optimization] support less_than & less_equal( < & <= operator) to sink to Cpp layer (#46542)
-
由 zmxdream 提交于
-
由 Leo Chen 提交于
-
由 Jacek Czaja 提交于
* added fc int8 tests * CI fix * added skipping UTs for GPUs * fixes for CI * added support for residual connections inside fc * fix for quant int8 bias * - lint Co-authored-by: Njakpiase <jakpia21@gmail.com>
-
由 limingshu 提交于
-