- 10 4月, 2023 19 次提交
-
-
由 Vvsmile 提交于
* adjust defalut tolerance of output and grad * fix a bug in the grad of OpTest * fix the type of setting defalut value in optest, both forward and backward * add defalut * fix test_sum_op * adjust tolerance * fix the tolerance of eager * add bf16 and fp16 to the activation tests * remove some fixs * fix activation * fix fp16 * fix gelu * fix the activation tests * add bfloat16 specialization to singrad and cosgrad * fix bugs * fix bugs * add unittest * add skip * add fp/bf to rrelu/rrelu_grad * git add rrelu * fix bugs
-
由 qizhaoaoe 提交于
* add fp16 and bf16 support for instance_norm * fix /= operator which not support bf16 * fix instance_norm_grad kernel and unittests. * fix fp32 unittests. * fix instance_norm_kernel and unittests. * fix instance_norm_grad_kernel and unittest threshold. * add fp16/bf16 for instance_norm_grad_grad op. * add bf16 dtype check. * fix conflicts. * fix cpu support for fp32 op and fix type in instance_norm_grad_kernel. * fix type in instance_norm_kernel. * fix bf16 outputs in unittests and refine codes. * fix dx computation. * delete unuseful params and head including. * add fp16/bf16 for static graph. * fix device condiction for instance_norm op. * fix instance_norm_grad_grad and bf16 op tests. * fix op_test to support grad of bf16 can be compared with fp32. * remove updates. * add self-defined grad.
-
由 Wang Xin 提交于
-
由 huangjiyi 提交于
* update * fix bug * fix ci-windows-openblas * fix test_partial_sum_op * fix codestyle
-
由 jjyaoao 提交于
-
由 Zero Rains 提交于
* fix divide zero bug for softmax_with_cross_entropy * change the single test way * can run but slow. the most important is that I do not know why it slow * remove some useless commet * change the copyright to correct * remove some useless change * if repeat_times == 1, we will not use BroadcastKernel
-
由 cyberslack_lee 提交于
-
由 Asthestarsfalll 提交于
* Optimize the performance of logsumexp * Support zero-dim tensor
-
由 lishicheng1996 提交于
-
由 zyfncg 提交于
-
由 Wang Xin 提交于
* add autogen code support for affine_grid op * update op_compat.yaml for affine_grid * update op_compat.yaml for affine_grid * fix AffineGridGradInferMeta * fix CI error * update AffineGridInferMeta
-
由 Galaxy1458 提交于
* delete [-Wno-error=terminate], test=develop * remove GPUps[-Wterminate],test=develop * remove some -Wno-, test=develop * modify ~MatmulDescriptor * mess
-
由 risemeup1 提交于
-
由 jjyaoao 提交于
-
由 jjyaoao 提交于
-
由 jjyaoao 提交于
-
由 jjyaoao 提交于
-
由 张春乔 提交于
* mv WITH_ASCEND_CL * mv WITH_ASCEND * rollback * remove WITH_ASCEND * remove WITH_ASCEND
-
由 wangzhen38 提交于
* [bug fix] fix pow composite * [bug fix] for ci
-
- 09 4月, 2023 4 次提交
-
-
由 ronnywang 提交于
* [PHI CAPI] support complex dtype kernel * update
-
由 Chitsing KUI 提交于
-
由 shaojie_wang 提交于
-
由 scotty 提交于
* add autogen code support for matrix_nms. * update
-
- 08 4月, 2023 3 次提交
-
-
由 kangguangli 提交于
* add strategy force_sequential_run * fix * fix * fix * fix * fix
-
由 张春乔 提交于
* mv WITH_ASCEND_CL * mv WITH_ASCEND * rollback
-
由 RedContritio 提交于
-
- 07 4月, 2023 13 次提交
-
-
由 lishicheng1996 提交于
-
由 Ruibiao Chen 提交于
* Isolate DenseTensor::set_type from header file * Fix selected_rows
-
由 shaojie_wang 提交于
* register bf16 for communication ops * fix bfloat16 type finding compile error in c_allreduce_max_op
-
由 Zhenghai Zhang 提交于
-
由 TaoTao Li 提交于
fix merge conflicts
-
由 Xiaoxu Chen 提交于
-
由 RedContritio 提交于
-
由 risemeup1 提交于
* fix_build_ci_error * fix_build_ci_error * fix_build_ci_error
-
由 YuanRisheng 提交于
-
由 wenzhe.wang 提交于
fix bugs Co-authored-by: Nwenzhe.wang <wenzhe.wang@xdxct.com>
-
由 Wang Xin 提交于
-
由 jameszhang 提交于
-
由 engineer1109 提交于
-
- 06 4月, 2023 1 次提交
-
-
由 yuehuayingxueluo 提交于
-