- 03 3月, 2021 9 次提交
-
-
由 Qi Li 提交于
* [ROCM] update fluid elementwise op for rocm (part10), test=develop * update, test=develop * address review comments, test=develop
-
由 Qi Li 提交于
* [ROCM] update fluid operators for rocm (part3), test=develop * fix clang format error, test=develop
-
由 Qi Li 提交于
-
由 Qi Li 提交于
-
由 Pei Yang 提交于
-
由 Qi Li 提交于
-
由 Qi Li 提交于
-
由 Zhou Wei 提交于
-
由 Qi Li 提交于
-
- 02 3月, 2021 10 次提交
-
-
由 Shang Zhizhou 提交于
-
由 Qi Li 提交于
-
由 tangwei12 提交于
* fix sycn training error Change-Id: Ie2feebcf0b5b2984fd59cfcdde0c817840e203d2
-
由 Qi Li 提交于
-
由 Qi Li 提交于
* [ROCM] update fluid operators for rocm (part5), test=develop * address review comments, test=develop * fix typo, test=develop
-
由 Pei Yang 提交于
* add n-d input support for trt scale converter * add flatten for ut * fix dims
-
由 Shang Zhizhou 提交于
* support trt serialize when load model from memory * delete conv_bn_fuse_pass before tensorrt, with which trt serialize engine id is not stable * Revert "delete conv_bn_fuse_pass before tensorrt, with which trt serialize engine id is not stable" performance degradation, fix in the future This reverts commit fa6cd17e60b15df351efda379ddd00e9e9c1fea9. * add delete conv_bn * delete path when delete_cache_files
-
由 Gradie 提交于
* lamb_op_xpu;test=kunlun * modify lamb_op_xpu.cc;test=kunlun * delete atol lamb_op_xpu; test=kunlun * update xpu.cmake;test=kunlun * test_error 1e-5,lamb_op_xpu;test=kunlun * error1e-5,lamb_op_xpu,test=kunlun * delete atol lamb_xpu;test=kunlun * modify atol,lamb_op_xpy;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu, XPUOptest;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu,modify xpu_cmake; test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu,modify xpucmake;test=kunlun
-
由 danleifeng 提交于
* topo and memory performance for heterps; test=develop * add trainwithprofiler in heter trainier; test=develop
-
由 Qi Li 提交于
-
- 01 3月, 2021 10 次提交
-
-
由 cucuzg 提交于
* add clip_by_norm on kunlun, *test=kunlun * opt matmul and matmul_v2 on kunlun, *test=kunlun
-
由 Wilber 提交于
-
由 wuhuanzhou 提交于
-
由 石晓伟 提交于
-
由 wuhuanzhou 提交于
* optimize unity build, test=develop * fix compilation error on Windows, test=develop * fix compilation error, test=develop * fix code style error, test=develop
-
由 jiangcheng 提交于
-
由 alncat 提交于
-
由 Chen Weihang 提交于
-
由 Qi Li 提交于
-
由 niuliling123 提交于
* Optimized the adaptive_avg_pool2d op when output_size == 1
-
- 28 2月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 27 2月, 2021 2 次提交
- 26 2月, 2021 7 次提交
-
-
由 Jiabin Yang 提交于
-
由 Jiabin Yang 提交于
* remove remove_unsupport_dtype * remove remove_unsupport_dtype * remove test dtype * add more include * change dtype.h's enum as enum class to avoid conflict with inference lib * make enum as enum class * remove additional test * merge develop * polish code
-
由 WangXi 提交于
-
由 Chen Weihang 提交于
* split build op marco & polish details * revert register api del * fix other unittest
-
由 tangwei12 提交于
Change-Id: I6210ce9c60bed48f3323c47b16500302b66cedf2
-
由 Qi Li 提交于
-
由 Qi Li 提交于
-
- 25 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-